Name: Kimi Code
Availability: InStock
Author: Moonshot AI

Learning Objectives

Understand what Kimi Code is and its relationship to the Kimi K2.6 and K2.7 Code models
Evaluate Kimi Code's benchmarks, pricing, and open-source advantages
Assess Moonshot AI's explosive growth as China's fastest AI decacorn

What Is Kimi Code?

Kimi Code is an open-source AI coding agent from Moonshot AI, launched January 27, 2026 alongside the Kimi K2.5 model. Its engine has advanced quickly: the CLI moved to Kimi K2.6 in spring 2026 and is now powered by Kimi K2.7 Code, a coding-tuned model Moonshot released on June 12, 2026. Built on the K2.6 base, K2.7 Code is a 1 trillion parameter mixture-of-experts (MoE) model — 32 billion parameters active per token across 384 experts — with a 256,000 token context window, and it ships its open weights on Hugging Face under a Modified MIT license. The agent's tooling, IDE integrations, and CLI surface are unchanged; the underlying model is simply faster and stronger.

Think of it as China's answer to Claude Code: a terminal-first coding agent that can plan, write, debug, and test code autonomously, with IDE integrations for VS Code, Cursor, and Zed. It is fully open-source under Apache 2.0 with 6,400+ GitHub stars — making it the strongest open-source coding agent available.

As of July 2026, K2.7 Code is also available directly inside GitHub Copilot. Microsoft began rolling it out to Copilot Pro, Pro+, and Max subscribers, with Business and Enterprise plans to follow. GitHub hosts the model on Microsoft Azure and bills it by usage at provider list pricing; on Business and Enterprise plans an administrator has to turn the model on before a team can pick it from the model selector. Its arrival in Copilot is a notable milestone — a Chinese open-weight model now sits alongside the Western frontier models inside the world's most widely used coding assistant.

💡Key Concept

Terminal-First Coding Agent: Like Claude Code, Kimi Code runs primarily in your terminal — you describe what you want built, and it writes code, creates files, runs commands, and debugs errors. The terminal-first approach means it works with any project setup and integrates naturally into existing developer workflows without requiring a specialized IDE.

Key Capabilities

Terminal CLI agent — plan, write, debug, and test code from natural language descriptions
IDE integrations — VS Code (extension), Cursor, and Zed (via Agent Client Protocol)
Available in GitHub Copilot — selectable in Copilot's model picker on the Pro, Pro+, and Max plans (Business and Enterprise coming; admin opt-in required), hosted on Microsoft Azure
MCP support — full Model Context Protocol tool integration for connecting to external services
Dual mode — coding agent mode and shell mode (toggle with Ctrl-X)
Multimodal input — accepts images and videos (screenshot-to-code workflows)
256,000 token context window — handles large codebases and long conversations
1 trillion parameter MoE architecture — 32 billion parameters active per token across 384 experts, balancing capability with efficient inference
Roughly 30 percent fewer reasoning tokens — K2.7 Code reaches its answers with less intermediate "thinking" than K2.6, cutting latency and cost
100 tokens per second output — fast generation speed
Agent Swarm (research preview) — concurrent task execution via Parallel-Agent Reinforcement Learning (PARL), up to 4.5 times faster execution

Benchmark Performance

Moonshot reports that K2.7 Code posts double-digit gains over the previous K2.6 model on its coding-focused evaluations:

Benchmark	K2.6 → K2.7 Code	What It Measures
Kimi Code Bench v2	50.9 → 62.0 (+21.8%)	Real-world coding-agent tasks
Program Bench	48.3 → 53.6 (+11.0%)	General programming ability
MLS Bench Lite	26.7 → 35.1 (+31.5%)	Machine-learning engineering tasks

Moonshot benchmarked K2.7 Code against GPT-5.5 and Claude Opus 4.8, and highlights that it edges out Opus 4.8 on the MCP Mark Verified tool-use evaluation (81.1 versus 76.4) while using roughly 30 percent fewer reasoning tokens. For context, the prior K2.5 generation scored 76.8 percent on SWE-bench Verified — open-source state of the art at its launch.

⚠️Warning

Read vendor benchmarks with care. These figures are Moonshot's own reported results. After K2.7 Code shipped, some practitioners told VentureBeat the headline gains were hard to reproduce in everyday use — a recurring pattern as open-source labs race to claim coding leadership. Treat the numbers as a directional signal and test on your own workloads before switching.

Pricing

Plan	Price	Features
Kimi Code CLI	Free (open weights, Modified MIT)	—
API Input (cache miss)	$0.95 per million tokens	—
API Input (cached)	$0.19 per million tokens	—
API Output	$4.00 per million tokens	—
Kimi Pro Subscription	$8-19/month	—

Kimi Code CLIFree (open weights, Modified MIT)

—

API Input (cache miss)$0.95 per million tokens

—

API Input (cached)$0.19 per million tokens

—

API Output$4.00 per million tokens

—

Kimi Pro Subscription$8-19/month

—

At under one dollar per million input tokens, K2.7 Code's API stays aggressively priced for its tier — still far cheaper than most Western coding models, even after the increase from the K2.5 and K2.6 generations. Cached input drops to roughly 19 cents per million tokens for repeated context.

Kimi Code vs. Competitors

Tool	License	SWE-bench	Key Difference
Kimi Code	Modified MIT (open weights)	K2.5: 76.8%	Free CLI; aggressive API pricing; leading open-source coding agent
Claude Code	Proprietary	~80.9% (Opus)	Higher benchmark scores; backed by Anthropic; subscription-based
GitHub Copilot CLI	Proprietary	Not published	Largest user base; GitHub integration; $10-19/month
Devin	Proprietary	13.86%	Fully autonomous agent; browser + terminal; $20/month + ACU costs
Augment Code	Proprietary	70.6%	Enterprise focus; Context Engine; SOC 2 + ISO 42001

Kimi Code's advantage: The only top-tier coding agent that is fully open-source (Apache 2.0) with competitive benchmarks. You can self-host it for free or use the API at $0.60 per million input tokens.

⚠️Warning

Kimi K2.5 has faced controversy over allegations of not crediting open-source foundations used in its development. If open-source provenance and attribution matter to your organization, review the model's training disclosure before adopting.

Company Details

Detail	Info
Company	Moonshot AI
Founded	March 2023
CEO	Yang Zhilin (age 33; NLP researcher with transformer architecture background)
Headquarters	Beijing, China
Employees	~80-300
Total Raised	~$1.77 billion across 3 rounds
Latest Funding	$2 billion round (May 2026; led by Meituan's Long-Z Investments)
Valuation	$20 billion (May 2026)
Revenue	$240 million+ (2025); 20-day post-K2.5 revenue exceeded entire 2025 annual total
Growth	Overseas API revenue quadrupled; monthly paying users up 170%+
Status	China's fastest company to reach decacorn ($10 billion+) status
Website	kimi.com/code

Strengths

Open weights (Modified MIT) — free to self-host via vLLM, SGLang, or KTransformers; no vendor lock-in
Strong reported benchmarks — double-digit K2.7 Code gains over K2.6 on Moonshot's coding evaluations; edges out Claude Opus 4.8 on its MCP tool-use test
Aggressive API pricing — under one dollar per million input tokens (about 95 cents on a cache miss, 19 cents cached); still far cheaper than most Western alternatives
Broad IDE support — VS Code, Cursor, Zed, and terminal CLI with MCP integration
Explosive company growth — $240 million+ revenue; China's fastest decacorn; backed by Alibaba and Tencent

Limitations and Considerations

China-based — data privacy and regulatory considerations for organizations in regulated industries
Attribution controversy — allegations of insufficient crediting of open-source foundations in K2.5 development
Trails frontier models — 76.8% SWE-bench vs. Claude Opus at 80.9% and Qwen3-Max at 88.3%
Newer product — launched January 2026; less battle-tested than GitHub Copilot or Cursor
English documentation — while the CLI works globally, some documentation and community resources are primarily in Chinese

Key Takeaways

Kimi Code is a leading open-source coding agent — a CLI with VS Code, Cursor, and Zed integrations and MCP support, now powered by the Kimi K2.7 Code model (June 2026) under a Modified MIT open-weights license
K2.7 Code is a 1 trillion parameter mixture-of-experts model (32 billion active, 384 experts) with a 256,000 token context window, built on Kimi K2.6; Moonshot reports gains of about 22 percent on its Kimi Code Bench v2 and 32 percent on its machine-learning engineering benchmark, with roughly 30 percent fewer reasoning tokens
API pricing is about 95 cents per million input tokens (19 cents cached) and four dollars per million output — a step up from the prior generation but still aggressive for the tier
Independent reviewers have cautioned that Moonshot's reported benchmark gains can be hard to reproduce in everyday use, so test on your own workloads before switching
As of July 2026, K2.7 Code is also selectable inside GitHub Copilot (Pro, Pro+, and Max plans), hosted on Microsoft Azure — a Chinese open-weight model reaching the West's most widely used coding assistant
Best for developers wanting a fast, low-cost, open-weights alternative to Claude Code with strong tool-use performance

Kimi Code

Audio & video lessons are paid features