Free to read. Sign up to save your progress and take knowledge-check quizzes.

Sign up free
7 min read·Updated May 9, 2026

Ling-2.6 (Ant Group)

Ant Group logoBy Ant Group

Ant Group / InclusionAI's 1 trillion parameter open-weights model under MIT license — hybrid Multi-head Latent Attention plus Linear Attention, 262,144-token context, 72.2 on SWE-bench Verified.

Listen to this lesson

Free preview · first 0:30
0:00 / 0:30

Audio & video lessons are paid features

Plus unlocks audio streaming. Pro adds downloadable audio, video, certificates, and more.

Plus adds:
  • Audio streaming
  • Downloadable PDFs
  • All AI Playbooks
  • Personalized content
Pro also adds:
  • Certificates of completion
  • Audio MP3 downloads
  • Video lessonssoon
  • & More…soon

Watch this lesson

Video coming soon

Learning Objectives

  • Understand what Ling-2.6 is and how Ant Group's InclusionAI fits into the broader Chinese open-weights story
  • Identify the technical and licensing differences between Ling-2.6 and other trillion-parameter models
  • Evaluate when an open-weights MIT-licensed Chinese model is the right fit versus US closed-weights alternatives

What Is Ling-2.6?

Ling-2.6 is a family of frontier-scale open-weights models published by InclusionAI, the AGI lab inside Ant Group. The flagship variant clocks in at 1 trillion parameters, ships under a permissive MIT license, and uses a hybrid attention architecture combining Multi-head Latent Attention with Linear Attention. Context window is 262,144 tokens.

The family was published on Hugging Face in early May 2026. A companion hosted-only sibling, Ring 2.6, surfaced on OpenRouter at the same trillion-parameter scale. The two share the family branding but follow different distribution strategies — Ling weights are downloadable, Ring is API-only.

💡Key Concept

Why this matters beyond the benchmark numbers: Ant Group is one of the largest fintech companies in the world (operator of Alipay), not a research-pure AI lab — and it is choosing to ship trillion-parameter weights under MIT. That distribution posture is meaningfully different from US frontier labs (OpenAI, Anthropic, Google), where flagship weights stay closed and only research-grade or older-generation models go open. Ling-2.6 is part of a broader Chinese pattern that includes DeepSeek V4, Qwen, GLM-5, and Kimi K2.6 — multiple major Chinese tech operators simultaneously shipping at the trillion-parameter scale under permissive licenses.

Headline Benchmarks

InclusionAI's published benchmark numbers position Ling-2.6 at the open-source state of the art on coding evaluations:

BenchmarkScoreNotes
SWE-bench Verified72.2Among the strongest scores any open-weights model has posted
AIME 2026Strong (specific score)Advanced math reasoning
BFCL-V4Strong (specific score)Function-calling evaluation
TAU2-BenchStrong (specific score)Multi-tool execution
IFBenchStrong (specific score)Instruction following
MRCR (16K to 256K)Strong (specific score)Long-context retrieval
Artificial Analysis Intelligence Index34Composite score across published evals

The SWE-bench Verified result is the headline: 72.2 on a real-world software engineering benchmark, achieved at 1 trillion parameters under MIT license, is a meaningful proof point that frontier-grade open weights can compete with closed flagships on engineering tasks.

Architecture

Ling-2.6 uses a hybrid attention architecture:

  • Multi-head Latent Attention (MLA) — the same attention compression technique that DeepSeek pioneered with V2 and V3, reducing the KV cache footprint at inference time
  • Linear Attention — additional attention layers with linear-time complexity, helping push the practical context window to 262,144 tokens without quadratic memory blow-up

Trained on 1 trillion tokens, with the 1 trillion parameter variant supporting tensor parallelism across 8 GPUs for inference. The model accepts F32, BF16, and F8_E4M3 tensor types — standard open-source flexibility.

Distribution and Access

Ling-2.6 follows the now-familiar Chinese open-weights distribution pattern:

  • Open weights on Hugging Face under MIT license at huggingface.co/inclusionAI/Ling-2.6-1T
  • Hosted inference via ZenMux (the recommended path) or OpenRouter
  • Companion hosted-only model — Ring 2.6 at the same trillion-parameter scale, currently visible on OpenRouter
  • No API key requirement for self-hosting — the open-weights track is fully unrestricted under MIT

The MIT license is materially more permissive than the Llama or Qwen licenses, which carry usage restrictions for very large operators. MIT lets any organization — research lab, enterprise, or competing AI company — deploy Ling-2.6 in production without licensing negotiation.

Hardware Requirements

Trillion-parameter models are not casual deployments. Practical inference requirements:

ConfigurationHardwareNotes
Recommended baseline8 GPUs with tensor parallelismInclusionAI's documented config
GPU classNVIDIA H100 / H200 / B200 or AMD MI300XSufficient HBM per device for the parameter shards
Quantized variantsFP8 (E4M3) supportedReduces memory pressure
Hosted inferenceZenMux or OpenRouterFor organizations that don't want to operate the cluster

For most enterprise users, hosted inference via ZenMux or OpenRouter is the practical access path. Self-hosting makes sense for organizations with existing GPU clusters and a data-residency or trade-secret reason to keep inference inside their own perimeter.

Strengths

  • Trillion-parameter scale at MIT license: One of the only frontier-scale open weights with no usage restrictions
  • Strong on coding: 72.2 on SWE-bench Verified is competitive with closed-weights flagships
  • Long context: 262,144 tokens via the hybrid Linear Attention layers
  • Distribution flexibility: Self-host the open weights, or use ZenMux / OpenRouter for hosted inference
  • Backed by Ant Group: A major financial-services operator with deep applied-AI expertise; not a small research lab that might disappear

Limitations & Considerations

  • Hardware floor is high: 8 GPUs for tensor parallelism puts self-hosting out of reach for most individual developers
  • Data privacy via Chinese hosted providers: If you use ZenMux or other Chinese-operated hosting, normal cross-border data-handling considerations apply (see Module 5 lesson on Chinese AI data privacy)
  • Newer release: Ling-2.6 was published in early May 2026; production patterns and tooling are still emerging
  • Limited public technical detail: InclusionAI publishes model cards and benchmark numbers, not the full reproducible training recipes published by Llama or some Mistral releases
  • Companion Ring 2.6 is hosted-only: The trillion-parameter Ring model is not currently distributed as open weights, despite the family branding

Best Use Cases

ScenarioWhy Ling-2.6
Self-hosted frontier model with no usage restrictionsMIT license clears commercial deployment without negotiation
Code-generation workloads requiring open weights72.2 on SWE-bench Verified at MIT license is a strong combination
Long-context tasks (large codebases, document analysis)262,144-token window via Linear Attention layers
Data-residency or trade-secret-sensitive enterprise inferenceRun the model inside your own perimeter on your own GPUs
Research and academic work requiring full weight accessMIT license and Hugging Face availability enable downstream fine-tuning

When to choose alternatives:

  • Need US-jurisdiction compliance with no Chinese-origin models → Llama 4, Mistral Large 3, or Anthropic Claude API
  • Smaller deployment footprint → Qwen 3.6 (35 billion parameter variant), Gemma 4, or DeepSeek-V3 distilled variants
  • Hosted inference with US or EU data residency → ChatGPT, Claude, Gemini, or AWS Bedrock

How Ling-2.6 Fits in the Open-Weights Landscape

ModelOriginLicenseParameter ScaleHeadline Benchmark
Ling-2.6 (Ant Group)ChinaMIT1 trillion72.2 on SWE-bench Verified
DeepSeek V4ChinaMIT685 billionStrong on math and coding
Qwen 3.6China (Alibaba)Apache 2.0 with use restrictionsUp to 235 billion (current largest open variant)Strong all-rounder
Llama 4US (Meta)Custom Llama license with use restrictionsUp to 405 billionStrong all-rounder
Mistral Large 3FranceCustom Mistral license123 billionEuropean frontier baseline
GLM-5China (Zhipu)Open weightsHundreds of billionsTrained entirely on Huawei Ascend hardware

Ling-2.6's combination of trillion-parameter scale plus MIT license places it at the more permissive end of the open-weights spectrum — closer to DeepSeek's distribution posture than to Llama or Qwen, which carry use restrictions.

Key Takeaways

  • Ling-2.6 is Ant Group's InclusionAI lab's open-weights frontier model family — including a 1 trillion parameter variant published on Hugging Face under MIT license in early May 2026
  • The 72.2 score on SWE-bench Verified is among the strongest results any open-weights model has posted on a coding evaluation; combined with the 262,144-token context window and MIT license, this puts Ling-2.6 in a small group of frontier-scale open-weights models suitable for serious enterprise deployment
  • The release is part of a broader Chinese pattern — Ant Group, DeepSeek, Alibaba, Moonshot, Zhipu — where multiple major operators are simultaneously shipping at the trillion-parameter scale under permissive licenses, in contrast to the US closed-weights majority

Save your progress & take the quiz

Sign up free to bookmark lessons, track which modules you've completed, and lock in what you learned with a quick knowledge-check quiz at the end of each lesson.

🧭Recommended for you