Learning Objectives
- Explain the practical difference between open-weight and closed-source AI models
- Articulate the strategic logic behind Meta's open-source approach vs. OpenAI's and Anthropic's closed approach
- Apply a decision framework to choose between open and closed models for specific use cases
What "Open Source" Actually Means for AI
In traditional software, "open source" means the source code is publicly available. For AI models, the term is used loosely and means different things depending on context. Not all "open" models are equally open.
The most important distinction:
Open weights: The trained model parameters (weights) are publicly downloadable. You can run the model yourself, fine-tune it, and deploy it on your own infrastructure. Examples: Llama 4, Gemma 3, DeepSeek V3.2.
Open source (fully): Model weights AND training code AND training data AND detailed methodology are publicly available. This is rare. True full open source AI models (like EleutherAI's GPT-Neo series) are uncommon at frontier capability levels.
Closed source (API-only): You access the model only through the provider's API. You cannot download the weights, cannot run it yourself, cannot inspect its internals. Examples: GPT-5.5, Claude Opus 4.7, Gemini 3.1 Pro.
⚠️Warning
The "open source" label is often imprecise. Llama 4 Maverick's weights are downloadable, but Meta has not published the training data or full training methodology. DeepSeek V3.2 publishes weights and the technical paper, but not the complete training dataset. When someone says a model is "open source," ask: what specifically is open? Weights? Training code? Training data? License?
The License Spectrum
Even among open-weight models, licenses vary significantly. MIT and Apache 2.0 are the most permissive — both allow free commercial use and free modification. MIT covers Phi-4, DeepSeek V3, and DeepSeek R1; Apache 2.0 covers Mistral 7 billion, Mixtral, Gemma 3, Qwen 3.5, and GPT-OSS. The Meta Community License (covering the Llama 4 series) is similar but caps free commercial use at 1 million monthly active users. Research-only licenses block commercial use entirely and limit modification — those apply to some academic models. Closed API models like GPT-5.5, Claude, and Gemini 3.1 Pro charge per-API-call pricing and don't allow direct modification of the model.
| License | Commercial Use | Modification | Notable Models |
|---|---|---|---|
| MIT | ✅ Free | ✅ Free | Phi-4, DeepSeek V3, DeepSeek R1 |
| Apache 2.0 | ✅ Free | ✅ Free | Mistral 7 billion, Mixtral, Gemma 3, Qwen 3.5, GPT-OSS |
| Meta Community License | ✅ Up to 1 million MAU | ✅ Free | Llama 4 series |
| Research-only | ❌ No commercial | Limited | Some academic models |
| Closed API | Per API pricing | ❌ No | GPT-5.5, Claude, Gemini 3.1 Pro |
For most commercial use cases, MIT and Apache 2.0 licenses are the most permissive and least legally complex.
Why Companies Open-Source (or Don't)
Understanding the strategic logic helps you evaluate claims about open source.
Why Meta Open-Sources
Meta's reasoning is explicit and rational:
-
Commoditize the complement. Meta's business is advertising on social platforms, not selling AI models. By open-sourcing AI models, Meta makes AI infrastructure cheaper and more available to everyone — which commoditizes the AI model market and prevents competitors (like OpenAI or Anthropic) from building a durable moat in foundation models.
-
Build ecosystem loyalty. When millions of developers build on Llama architecture, they learn Meta's conventions, contribute improvements back, and create a gravitational pull toward Meta's continued model development.
-
Attract and retain research talent. The open-source AI research community is large, talented, and values open publication. Meta's open approach is a talent strategy as much as a product strategy.
-
Reduce regulatory risk. Open models are harder to regulate than proprietary models — they are already distributed. Meta's open-source stance gives it a political position in AI governance debates.
Why OpenAI and Anthropic Remain Closed
-
Capability moat. Keeping model weights private prevents competitors from directly copying the most capable model. The months of lead time between a frontier model release and competitors' responses translate directly to revenue.
-
Safety concerns. Anthropic and OpenAI both argue that releasing frontier model weights creates safety risks — models can be fine-tuned to remove safety guardrails and used for harmful purposes. This argument is contested but not entirely without merit for the most capable models.
-
Commercial incentive. API access creates a clear, recurring revenue stream. Open-sourcing the model eliminates that revenue stream while still incurring the cost of developing the model.
-
Enterprise trust. Enterprise customers often prefer closed, supported models with SLAs, compliance certifications, and vendor accountability — which open weights don't provide.
When to Choose Open-Weight Models
Data privacy and sovereignty: If your application involves sensitive data — healthcare records, financial information, legal documents, personally identifiable information — and you cannot transmit that data to an external API, open-weight models run locally are the solution. You control the data; it never leaves your infrastructure.
Cost control at scale: At very high inference volumes, self-hosting an open-weight model can be significantly cheaper than paying per-token API fees. The crossover point depends on your compute infrastructure, but for millions of daily queries, the economics often favor self-hosting.
Custom fine-tuning: If your application requires specialized behavior that general-purpose models don't provide well, fine-tuning an open-weight model on your domain-specific data is often more effective and efficient than prompt engineering on a closed model.
On-device and edge deployment: Applications running on phones, embedded systems, or in environments without reliable internet access require on-device models. Only open-weight models can be deployed this way.
Research and experimentation: Academic research, red-teaming, interpretability research — all require access to the model internals or the ability to modify the model in ways that closed APIs don't permit.
Regulatory and compliance requirements: Some regulated industries and government applications require full transparency into what model is being used and full data residency control — requirements that closed APIs cannot satisfy.
When to Choose Closed-Source API Models
Maximum capability at launch: Frontier closed-source models typically lead open-weight models by 3–12 months on capability benchmarks. If you need the best available performance for a critical application, closed models usually win at any given moment.
Ease of deployment: No infrastructure to manage, no scaling challenges, no model versioning — just an API call. For most applications, this simplicity is worth the per-token cost.
Safety and alignment: Closed models have been through more extensive safety evaluation and have stronger default content policies. For customer-facing applications where harmful outputs would be reputationally damaging, the safety investment that OpenAI, Anthropic, and Google have made in their closed models is valuable.
Enterprise support: SLAs, compliance certifications, vendor accountability, dedicated support — closed API providers offer these; open-weight models don't.
Rapid iteration: When you need to build quickly and don't want to manage infrastructure, closed APIs let you focus on product, not model operations.
The Open-Source Momentum: DeepSeek and GPT-OSS Change the Calculus
For much of 2023–2024, the choice between open and closed models involved a significant capability trade-off: closed models were materially better. The landscape shifted dramatically in 2025–2026.
DeepSeek V3 (MIT license) is DeepSeek's current flagship general-purpose model — competitive with frontier closed models at a fraction of the training cost (2.664 million H800 GPU hours). DeepSeek V4, a multimodal model with picture, video, and text generation, is expected as soon as March 2026. DeepSeek's continued demonstration that open-weight models can match proprietary alternatives has fundamentally changed enterprise AI adoption patterns.
GPT-OSS (Apache 2.0) marks OpenAI's entry into open-weight models — a strategic acknowledgment that the open-source movement cannot be ignored even by the most commercially oriented AI lab.
The capability gap between top open and top closed models has narrowed substantially. For many use cases, the performance difference is small enough that the advantages of open deployment — privacy, cost control, customization — can dominate the decision.
Meta's Potential Closed-Source Shift
Perhaps the most significant development in the open vs. closed debate: reports suggest Meta may release its next-generation model ("Avocado") as closed-source — a dramatic departure from the company that has been the open-source movement's most influential champion. If confirmed, this would challenge the narrative that open-source AI is an irreversible trend and could leave DeepSeek and Mistral as the primary standard-bearers for open-weight frontier models.
✅Tip
A practical heuristic: Start with a closed API (Claude Sonnet or GPT-5.5) for rapid prototyping and capability validation. Once your application is working and you understand your actual inference volume and requirements, evaluate whether migrating to an open-weight model is worth the operational complexity. For many production applications, the answer is no. For high-volume, privacy-sensitive, or heavily customized applications, the answer is often yes.
A Decision Framework
Ask these questions in order:
- Can your data leave your infrastructure? If no → open weights only
- Do you need to fine-tune on proprietary data? If yes → open weights preferred
- Is cost at scale a primary constraint? If yes → evaluate open weights
- Do you need maximum current capability? If yes → closed API
- Do you need enterprise compliance features? If yes → closed API (or Bedrock/Azure OpenAI for open models with enterprise wrapping)
Most enterprise applications end up in a hybrid: closed API for complex reasoning and customer-facing intelligence, open weights for high-volume classification and extraction tasks.
Key Takeaways
- "Open source" for AI most commonly means open weights — the model can be downloaded and run locally — which is different from open training data, open training code, or open source software
- Meta open-sources Llama for strategic reasons (commoditize the complement, ecosystem building, talent), not altruism
- OpenAI and Anthropic remain closed for capability moat, safety concerns, and commercial revenue reasons
- DeepSeek V3 (MIT license) and GPT-OSS (Apache 2.0) demonstrate that open-weight models now compete at the frontier; DeepSeek V4 (multimodal) is imminent
- Meta may shift to closed-source for its next-gen model — potentially the most significant change in the open vs. closed landscape since DeepSeek's emergence
- Choose open weights when: data can't leave your infrastructure, customization is needed, cost at scale is critical; choose closed APIs when: maximum capability, rapid deployment, enterprise compliance, or safety are priorities