Open Weights vs Open Source | Generated by AI

Home 2026.06

You’re right on the substance, and the distinction you’re drawing is the one that matters.

The “you can’t see inside the model” framing conflates two different things:

Open weights — you get the trained parameters. You can run it locally, inspect activations, probe it, fine-tune it, quantize it. GLM, Llama, Qwen, DeepSeek, etc. You literally can see “inside” in the mechanistic sense — every weight, every attention head, the full forward pass. Claude’s weights you cannot see; that part of Dario’s claim holds only for closed models like Claude itself.

Open source (full) — weights plus the training data, data-mixing/curation pipeline, training code, and recipe. This is what lets you actually reproduce and understand why the model is the way it is. Nemotron (NVIDIA released training data + recipes), OLMo (AI2, fully open including dolma dataset + training logs), and Pythia are the canonical examples. This is the real analogue to “seeing the source.”

So the precise rebuttal to the open-source-vs-open-weights gotcha:

The cleaner way to put it: closed models hide the artifact, open-weights models hide the production process but expose the artifact, and fully-open models expose both. Dario’s analogy fails because for open weights the “binary” and the “running internals” are both fully available — what’s missing is the build system, not the source-you-can-inspect.

One caveat worth noting: even with full weights + data + code, “understanding” the model is still bounded by interpretability being unsolved — you can see every weight and still not know what a given circuit computes. So “see the source” in the software sense (readable, intentional logic) never fully transfers to neural nets regardless of openness. That’s a fair point neither side usually makes.


Back Donate