Free to read. Sign up to save your progress and take knowledge-check quizzes.

Sign up free
6 min read·Updated May 26, 2026

Kimi (Moonshot AI)

Moonshot AI logoBy Moonshot AI

Kimi is Moonshot AI's AI assistant — now powered by Kimi K2.6 (May 2026), the second-most-used model on OpenRouter, shipped alongside Moonshot's $2 billion raise at a $20 billion valuation. The K2 family combines a 256K context window, natively multimodal 1 trillion-parameter MoE architecture, and the open-source Kimi Code coding tool.

Listen to this lesson

Free preview · first 0:30
0:00 / 0:30

Audio & video lessons are paid features

Plus unlocks audio streaming. Pro adds downloadable audio, video, certificates, and more.

Plus adds:
  • Audio streaming
  • Downloadable PDFs
  • All AI Playbooks
  • Personalized content
Pro also adds:
  • Certificates of completion
  • Audio MP3 downloads
  • Video lessonssoon
  • & More…soon

Watch this lesson

Video coming soon

Learning Objectives

  • Understand what distinguishes Kimi from other international AI chatbots, particularly its long-context architecture and coding capabilities
  • Identify the use cases where Kimi's extended context and reasoning capabilities add clear value
  • Know the privacy implications of using Kimi's API vs. downloading open-weight models

What Is Kimi?

Kimi is the AI assistant from Moonshot AI (月之暗面), a Beijing-based AI startup founded in 2023 by Yang Zhilin, formerly a researcher at Tsinghua University and Carnegie Mellon University. Moonshot AI has received major investment from Alibaba and is considered one of China's leading frontier AI startups.

Kimi is notable for three core strengths:

  1. Long-context processing — Kimi offers up to 256K tokens natively, with extended modes for even longer inputs
  2. Multi-step reasoning — strong performance on complex tasks that require maintaining context across many reasoning steps
  3. Coding quality — the K2 family beats GPT 5.2 on SWE-Bench Multilingual and outperforms Gemini 3 Pro on SWE-Bench Verified

Kimi is available as a consumer chatbot at kimi.ai and through an API. The underlying K2 family has been released as open-weight models, making it downloadable and self-hostable.

Kimi K2.6 — Current Flagship

Moonshot AI ships Kimi K2.6 as its current flagship model — open-weights released on Hugging Face on April 20, 2026, with the commercial launch on May 7, 2026 alongside a $2 billion funding round at a $20 billion valuation led by Meituan's Long-Z Investments arm with Tsinghua Capital, China Mobile, and CPE Yuanfeng participating. K2.6 ranks as the second-most-used model on OpenRouter behind only the highest-volume US frontier vendors, reflecting both the model's technical position and the cost-pressure that open-weights Chinese labs increasingly exert on Western API pricing.

Moonshot has reported $200 million in annualized recurring revenue — driven by paid subscriptions and direct-API consumption — making it one of the few Chinese AI labs with publicly disclosed revenue at that scale. The round more than doubled Moonshot's prior valuation and positions the lab as a direct counterpart to DeepSeek's $45 billion round.

K2.6 retains the 1 trillion parameter Mixture-of-Experts architecture (32 billion active per token) from K2.5, extends the context window slightly to 262K tokens, adds native INT4 quantization, and introduces the Agent Swarm system — scaling to 300 sub-agents and 4,000 coordinated steps in 12-hour autonomous coding sessions (up from 100 sub-agents and 1,500 steps in K2.5). Released under a Modified MIT License on Hugging Face; also available via the official Moonshot API, the Kimi Code CLI, and Cloudflare Workers AI. For the full K2.6 deep-dive — architecture, Agent Swarm details, benchmarks vs Claude Opus 4.6 / GPT-5.4 / Gemini 3.1 Pro, and the Cursor Composer 2.5 generation-skip decision — see Kimi K2.6 (Moonshot AI).

Kimi K2.5 — Previous Generation

💡Key Concept

The long-context innovation: When Kimi first launched in 2023, its ability to process 200,000-character Chinese documents in a single context window was genuinely unprecedented for a Chinese AI product. While context window sizes have since become larger across the industry, Kimi's early focus on long-context usability — including the ability to upload and reason across entire books, legal contracts, or research papers — built a strong user base among professionals who need to analyze lengthy documents.

Tip

Try Kimi: kimi.ai — free with a phone number; API at platform.moonshot.cn

Core Capabilities

Long-Context Document Analysis

Kimi's most distinctive feature is its ability to process and reason across very long documents — up to 256K tokens in standard mode, with extended modes for even longer inputs. Practical applications:

  • Upload and analyze entire PDFs, research papers, or legal contracts
  • Ask questions across all sections of a long report simultaneously
  • Compare multiple long documents in a single conversation
  • Summarize and extract key points from book-length content

This is particularly valuable in professional contexts where document review is central — law, finance, research, and consulting.

Multi-Step Reasoning

Kimi demonstrates strong performance on tasks that require maintaining logical consistency across many reasoning steps:

  • Complex math and physics problems
  • Programming tasks with multiple interdependent components
  • Analysis tasks requiring synthesis across multiple sources
  • Strategic planning and scenario analysis

Coding Capabilities

Kimi K2.5 is ranked among the top coding models globally — beating GPT 5.2 on SWE-Bench Multilingual and outperforming Gemini 3 Pro on SWE-Bench Verified. Capabilities include:

  • Writing full functions and classes from natural language specifications
  • Reviewing code for bugs, security issues, and style problems
  • Explaining complex codebases and legacy code
  • Generating tests and documentation
  • Debugging across multiple programming languages (Python, JavaScript, TypeScript, Go, Rust, Java, and others)

Kimi Code — Open-Source Coding Tool

Moonshot AI released Kimi Code, an open-source coding tool with integrations for terminals, VS Code, Cursor, and Zed. This brings Kimi's coding capabilities directly into developer workflows, similar to how GitHub Copilot or Cursor integrate AI into editors.

Web Search Integration

Kimi can access the web to provide grounded, up-to-date answers. Useful for research queries where training data may be outdated.

In January 2026, Moonshot AI released Kimi K2.5, a significant upgrade over the earlier K2. K2.5 is natively multimodal — trained on 15 trillion mixed visual and text tokens from the ground up, rather than bolting vision capabilities onto a text-only model. The architectural fundamentals below remain the public reference point for the K2 family pending K2.6's full technical disclosure.

ModelParametersContextHighlights
Kimi K2.51 trillion total (MoE) / ~32 billion active256KNatively multimodal; beats GPT 5.2 on SWE-Bench Multilingual and VideoMMU; beats Claude Opus 4.5 on VideoMMU; outperforms Gemini 3 Pro on SWE-Bench Verified
Kimi K2.5 InstructSame256KInstruction-tuned; ready for immediate use
Kimi K2.5 BaseSame256KBase model for fine-tuning

Kimi K2.5 uses a Mixture-of-Experts architecture with 1 trillion total parameters and approximately 32 billion active per token — similar in structure to DeepSeek V3 but with native multimodal training and a doubled context window.

Pricing & Access

Access MethodCostDetails
kimi.ai (consumer)Free tier availableWeb and mobile app; Chinese phone number required for full features; global access available
Moonshot API (platform.moonshot.cn)Usage-based~$0.12–$2.50 per million tokens depending on context length and model; competitive pricing
Open-weight downloadFreeKimi K2.5 downloadable from Hugging Face; Moonshot permissive license; self-hostable
Third-party providersUsage-basedAvailable via Together.ai and other open-model hosting platforms

⚠️Warning

Data privacy note: Using Kimi's API or kimi.ai sends data to servers in China. Chinese data law applies. Download the Kimi K2.5 open-weight model and run locally for sensitive data, or use a third-party API host in your preferred jurisdiction.

Strengths

  • Long-context document processing: 256K native context window for document analysis use cases
  • Top-tier coding: Beats GPT 5.2 on SWE-Bench Multilingual and Gemini 3 Pro on SWE-Bench Verified
  • Natively multimodal: Trained on 15 trillion mixed visual and text tokens — beats GPT 5.2 and Claude Opus 4.5 on VideoMMU
  • Kimi Code: Open-source coding tool with VS Code, Cursor, Zed, and terminal integrations
  • Multi-step reasoning: Strong performance on problems requiring sustained logical chains
  • Open-weight model available: Kimi K2.5 downloadable under permissive license — privacy and customization friendly
  • Competitive API pricing: More affordable than US frontier models for equivalent capabilities

Limitations & Considerations

  • Registration friction: Full feature access requires a Chinese phone number for verification (though global access is improving)
  • Smaller ecosystem: Fewer integrations and English-language community resources than ChatGPT, Claude, or Gemini
  • Political content restrictions: Similar to other Chinese AI systems, Kimi avoids politically sensitive topics (Taiwan, Tiananmen Square) per Chinese regulations
  • Less multimodal generation: Image generation and voice output capabilities are more limited than US frontier models — K2.5's multimodal strength is in understanding, not generation
  • Western name recognition: Less established brand reputation outside China — fewer professional resources and tutorials in English

Best Use Cases

TaskWhy Kimi
Long document analysis256K native context; analyzes entire books, contracts, or research papers in one session
Complex coding projectsBeats GPT 5.2 on SWE-Bench Multilingual; Kimi Code integrates with VS Code, Cursor, and Zed
Video and multimodal understandingBeats GPT 5.2 and Claude Opus 4.5 on VideoMMU benchmarks
Multi-step technical reasoningSustained logical chain performance comparable to leading reasoning models
Open-weight deploymentKimi K2.5 available for on-premise or custom fine-tuned deployments
Asian-language technical tasksStrong Chinese-language quality with technical depth

When to choose alternatives:

  • Broadest capability range → Claude Opus 4.7, GPT-5.5
  • EU data sovereignty → Mistral Le Chat
  • Open-source with MIT license → DeepSeek R1
  • Source-cited research → Perplexity
  • Enterprise workplace integration → Microsoft 365 Copilot or Google Workspace AI

Getting Started

  1. Visit kimi.ai — create an account (global access available; some features require verification)
  2. Try uploading a long PDF and asking questions that require cross-referencing multiple sections
  3. Test coding tasks — paste in a function description and evaluate the generated code
  4. Try Kimi Code — install it in VS Code or Cursor for AI-assisted coding in your editor
  5. For developers: visit platform.moonshot.cn for API access
  6. For open-weight deployment: search for "Kimi K2.5" on Hugging Face and download via Ollama or vLLM

Key Takeaways

  • Kimi is Moonshot AI's frontier chatbot — Kimi K2.6 (May 2026) is now the flagship and ranks as the second-most-used model on OpenRouter, shipped alongside Moonshot's $2 billion raise at a $20 billion valuation
  • The K2 family beats GPT 5.2 on SWE-Bench Multilingual and VideoMMU and outperforms Gemini 3 Pro on SWE-Bench Verified — making it one of the strongest coding and multimodal models available
  • K2.6 retains the 1 trillion parameter MoE architecture (32 billion active per token), extends the context window to 262K, adds native INT4 quantization, and introduces an Agent Swarm system that scales to 300 sub-agents and 4,000 coordinated steps in 12-hour autonomous sessions — see Kimi K2.6 (Moonshot AI) for the full architecture and benchmark deep-dive
  • Kimi Code is an open-source coding tool with integrations for VS Code, Cursor, Zed, and terminals — bringing Kimi's coding strengths directly into developer workflows
  • The K2 family is released as open-weight models under a permissive license — making them downloadable, self-hostable, and fine-tunable for custom deployments
  • Data privacy concerns apply when using Kimi's cloud services — use the open-weight model locally for sensitive data
  • Moonshot reported $200 million annualized recurring revenue in April 2026, with the May $2 billion round more than doubling its valuation since earlier in 2026

Save your progress & take the quiz

Sign up free to bookmark lessons, track which modules you've completed, and lock in what you learned with a quick knowledge-check quiz at the end of each lesson.

🧭Recommended for you