Free to read. Sign up to save your progress and take knowledge-check quizzes.

Sign up free
7 min read·Updated March 24, 2026

Command A

Cohere logoBy Cohere

Command A is Cohere's flagship enterprise LLM — 111 billion parameters with 256K context, running on just 2 GPUs. It powers the North agentic AI platform behind customer firewalls for enterprise RAG and agentic workflows.

Listen to this lesson

Free preview · first 0:30
0:00 / 0:30

Audio & video lessons are paid features

Plus unlocks audio streaming. Pro adds downloadable audio, video, certificates, and more.

Plus adds:
  • Audio streaming
  • Downloadable PDFs
  • All AI Playbooks
  • Personalized content
Pro also adds:
  • Certificates of completion
  • Audio MP3 downloads
  • Video lessonssoon
  • & More…soon

Watch this lesson

Video coming soon

Learning Objectives

  • Understand how Command A advances Cohere's enterprise-first strategy and why running on just 2 GPUs matters for enterprise deployment
  • Identify Command A's core capabilities including the North platform, Command A Translate, and Command A Reasoning variants
  • Evaluate when Command A is the right choice versus general-purpose models like Claude, GPT-5.1, or open-source alternatives

What Is Command A?

Command A is the flagship large language model from Cohere, a Canadian AI company founded by former Google Brain researchers (including co-authors of the original Transformer paper). Released in March 2025 as the successor to Command R+, Command A represents a fundamental shift in how enterprise LLMs are designed — optimized not just for quality but for practical deployment economics.

At 111 billion parameters with a 256K token context window, Command A runs on just 2 GPUs — a 150% throughput improvement over its predecessor. This efficiency matters enormously for enterprise deployment: fewer GPUs means lower infrastructure costs, simpler operations, and the ability to run frontier-quality AI behind corporate firewalls without building a small data center.

Command A matches or exceeds GPT-4o and DeepSeek V3 on enterprise agentic task benchmarks while costing significantly less to deploy. Cohere has built Command A specifically for enterprise use cases where AI needs to work with company-specific data, comply with data residency requirements, and integrate into existing business workflows.

Cohere is now valued at $7 billion (September 2025), generating over $240 million in annual recurring revenue, with an IPO expected in 2026. The company's customers include RBC (Royal Bank of Canada), Dell, and LG CNS — large enterprises that prioritize data control and on-premise deployment.

💡Key Concept

Why 2-GPU deployment matters: Most frontier models require 8-16+ GPUs per instance, creating a significant infrastructure barrier for enterprise deployment. Command A's ability to run on just 2 GPUs means enterprises can deploy it on existing server infrastructure, behind their own firewalls, without massive capital expenditure. This is not just a technical detail — it fundamentally changes the economics of enterprise AI adoption.

Tip

Explore Command A: cohere.com — enterprise API access and North platform. Developer accounts available for prototyping.

Pricing & Access

OptionPriceDetails
Cohere API (Developer)Pay-per-tokenSelf-serve API access; free trial for prototyping
North PlatformEnterprise contractsAgentic AI workspace deployed behind customer firewalls
AWS BedrockPay-per-token (AWS pricing)Access alongside Claude, Llama, Nova in your AWS account
Azure AIPay-per-token (Azure pricing)Available through Azure AI model catalog
Google Cloud Vertex AIPay-per-token (GCP pricing)Available through Vertex AI Model Garden
On-Premise / Air-GappedCustom contractsFull deployment behind firewalls with no external data transfer

Cohere's any-cloud and on-premise deployment model means enterprises are not locked into a single vendor. The same Command A model runs identically whether accessed via API, deployed on a public cloud, or running in an air-gapped environment.

Core Capabilities

Enterprise Agentic Performance

Command A is designed for enterprise agentic tasks — multi-step workflows where the model plans, reasons, retrieves information, and takes action:

  • Matches GPT-4o and DeepSeek V3: On enterprise agentic benchmarks, Command A performs at parity with or better than models that cost significantly more to deploy
  • Tool use and function calling: Robust support for calling external APIs, querying databases, and executing multi-step business workflows
  • RAG-optimized: Built-in retrieval-augmented generation with automatic citation and document grounding — Cohere's long-standing strength

The North Platform

Launched in August 2025, North is Cohere's agentic AI workspace — the enterprise product that Command A powers:

  • Behind-firewall deployment: North runs inside the customer's own infrastructure — sensitive data never leaves the corporate network
  • Enterprise customers: RBC, Dell, LG CNS, and others use North for internal knowledge management, document analysis, and automated workflows
  • Agentic workflows: North agents can search internal documents, reason over complex business data, draft responses, and take action — not just answer questions

Command A Family

Command A is not a single model but a family of specialized variants:

  • Command A Translate: Translation across 23 languages with state-of-the-art quality — purpose-built for enterprise multilingual content rather than using a general model for translation
  • Command A Reasoning: A hybrid reasoning variant optimized for agentic tasks that require multi-step logic, planning, and structured decision-making

Efficient 2-GPU Deployment

Command A's 111 billion parameter architecture is optimized for practical enterprise deployment:

  • 2-GPU minimum: Runs on just 2 GPUs — a 150% throughput improvement over Command R+
  • Any-cloud flexibility: Deploy on AWS, Azure, Google Cloud, or on-premise with AMD Instinct GPU support
  • AMD partnership: Cohere's collaboration with AMD means Command A is optimized for AMD Instinct GPUs — providing an alternative to NVIDIA-dependent deployments
  • Air-gapped deployment: For defense, government, and highly regulated industries, Command A can run in fully disconnected environments

Strengths

  • Enterprise-optimized deployment: Runs on just 2 GPUs — dramatically lower infrastructure cost than competing frontier models
  • Any-cloud, any-hardware: AWS, Azure, GCP, on-premise, air-gapped, and AMD Instinct GPU support — maximum deployment flexibility
  • North platform: Production-ready agentic AI workspace that runs behind customer firewalls (RBC, Dell, LG CNS)
  • RAG leader: Built-in citation generation and document grounding — Cohere's defining capability since founding
  • Specialized variants: Command A Translate (23 languages SOTA) and Reasoning (agentic optimization) address specific enterprise needs
  • Canadian data governance: Headquartered in Canada under PIPEDA — a strong privacy framework for enterprise customers
  • Transformer pedigree: Founded by co-authors of the original Transformer paper — deep architectural expertise
  • $7 billion valuation, $240 million ARR: Proven enterprise traction with major customers, IPO expected 2026

Limitations & Considerations

  • Enterprise-focused: Not designed for individual users or small startups — the North platform and custom deployments target large organizations
  • No consumer chatbot: Unlike ChatGPT or Claude, Cohere does not offer a widely-used consumer chat product — Command A is an API and enterprise platform product
  • Closed model: Model weights are not publicly available; deployment requires a Cohere relationship (API or enterprise contract)
  • Narrower general capabilities: Optimized for enterprise RAG, agentic tasks, and business workflows — may not match the creative writing or casual conversation quality of general-purpose models

Best Use Cases

TaskWhy Command A
Behind-firewall enterprise AINorth platform deploys inside corporate networks — data never leaves the organization
Enterprise knowledge base Q&ARAG-optimized architecture with built-in citations for auditable answers
Multi-cloud enterprise deploymentRuns identically on AWS, Azure, GCP, on-premise, and air-gapped environments
Cost-efficient frontier deployment2-GPU requirement means lower infrastructure costs than 8-16 GPU alternatives
Enterprise translationCommand A Translate covers 23 languages at state-of-the-art quality
Regulated industry applicationsPIPEDA-governed Canadian company with air-gapped deployment for defense and government
AMD GPU infrastructureOptimized for AMD Instinct — an alternative for organizations not on NVIDIA hardware

When to choose alternatives:

  • General-purpose AI assistant for broad tasks → Claude or GPT-5.1
  • Consumer chatbot with broad ecosystem → ChatGPT
  • Open-source enterprise model → Llama 4 or Qwen 3.5
  • Research-focused with web citations → Perplexity
  • Individual developer or small team → Claude API or OpenAI API for simpler pricing

Getting Started

  1. Visit cohere.com and create a developer account for API access
  2. Use the Cohere Playground to test Command A with sample RAG queries and agentic tasks before writing code
  3. Prepare a small document collection (PDFs, internal docs) to test grounding quality on your own data
  4. Integrate using Cohere's SDK (Python, Node.js, Go, Java) — the RAG pipeline is built into the API
  5. Test citation output: ask questions about your uploaded documents and verify that every claim links to a source
  6. For North platform evaluation, contact Cohere sales to discuss behind-firewall deployment, cloud provider preference, and volume pricing

Tip

Enterprise deployment advantage: Command A's 2-GPU requirement changes the conversation from "can we afford frontier AI?" to "where should we deploy it?" For enterprises already running GPU-equipped servers for other workloads, adding Command A is an incremental infrastructure decision rather than a major capital project. Combined with AMD Instinct support and any-cloud flexibility, it removes the two biggest barriers to enterprise AI adoption: cost and vendor lock-in.

Key Takeaways

  • Command A is Cohere's flagship enterprise model — 111 billion parameters running on just 2 GPUs with 256K context, matching GPT-4o and DeepSeek V3 on enterprise agentic benchmarks
  • The North platform (August 2025) deploys agentic AI behind customer firewalls — RBC, Dell, and LG CNS are production customers handling sensitive data on-premise
  • Command A Translate (23 languages) and Command A Reasoning (agentic optimization) provide specialized capabilities beyond the base model
  • For individual users or creative tasks, general-purpose models like Claude or ChatGPT are better choices — Command A is designed for enterprises that need frontier AI deployed behind their own firewalls, on any cloud, with maximum deployment flexibility

Save your progress & take the quiz

Sign up free to bookmark lessons, track which modules you've completed, and lock in what you learned with a quick knowledge-check quiz at the end of each lesson.

🧭Recommended for you