Learning Objectives
- Understand ARM's architecture-licensing model and why it dominates mobile + edge AI
- Identify the Neoverse data-center platform's traction and the new AGI CPU's role
- Evaluate when ARM-based AI compute makes sense vs. x86 or specialized accelerators
What Is ARM AI?
ARM (majority-owned by SoftBank since 2016) is a UK chip-architecture company — but unlike Intel or AMD, ARM does not manufacture chips. Instead, it licenses CPU architectures (Cortex, Neoverse) and core IP that other companies build their own chips around. Apple's M-series, Amazon's Graviton, NVIDIA's Grace, Google's Tensor, Qualcomm Snapdragon, MediaTek Dimensity, and the SoCs in every iPhone and most Android phones are all built on ARM architecture.
For AI specifically, ARM has two strategic positions: Cortex dominates mobile and edge AI inference (where most consumer-facing AI compute happens), and Neoverse has aggressively expanded into AI server CPUs — by 2025 reaching nearly 50% market share of CPU shipments among the top hyperscalers. In March 2026, ARM took the next step by announcing AGI, its first in-house designed data-center processor.
💡Key Concept
ARM's architecture-license model: ARM doesn't compete with chipmakers — it sells them the foundation. AWS Graviton is an ARM-architecture chip designed by AWS; NVIDIA Grace is an ARM-architecture chip designed by NVIDIA; Apple M-series is ARM. This means ARM gets royalty revenue on virtually every advanced mobile + AI chip shipping today, without needing to manufacture anything.
✅Tip
Visit ARM: arm.com/markets/artificial-intelligence — architecture licenses sold through ARM enterprise sales; reference designs available for evaluation
Pricing & Access
ARM does not sell chips. ARM sells architecture licenses to chipmakers and system designers, who then design and fabricate their own silicon (typically at TSMC or Samsung).
- Allows licensee to design custom CPUs around ARM ISA
- Apple, NVIDIA, Qualcomm tier
- Highest customization rights
- Licensee uses ARM-designed cores (Cortex, Neoverse)
- Most chipmakers operate here
- Lower upfront cost
- Paid on every shipping chip
- Compounds across billions of devices
- Foundation of ARM's revenue
- 136 Neoverse V3 cores
- TSMC 3nm process
- Production silicon ready, volume shipments end of 2026
- Server CPU reference, mobile SoC reference
- Accelerates time-to-market for licensees
The new AGI CPU is a meaningful strategy shift — for the first time, ARM is selling its own chip rather than just licensing the design.
Core Architecture Components
Neoverse Data Center Platform
ARM's server-CPU platform — Neoverse V (max performance) and N (efficiency) cores. Used by AWS Graviton 4, Microsoft Cobalt 100, Google Axion, Oracle Ampere, and NVIDIA Grace. By 2025, Neoverse-based chips reached billion+ deployed cores and approximately 50% market share of CPU shipments to top hyperscalers — a stunning shift from x86 dominance just five years prior.
Cortex Mobile + Edge
The dominant CPU architecture for mobile AI inference globally. Apple A-series and M-series, Qualcomm Snapdragon, MediaTek Dimensity, and Samsung Exynos all build on Cortex cores. Most everyday AI inference (speech recognition, image processing, on-device LLM) runs on Cortex-based silicon.
AGI CPU (March 2026)
ARM's first in-house data-center processor, announced March 24, 2026. Specs: up to 136 Neoverse V3 cores, built on TSMC 3nm, designed specifically for AI inference orchestration and agentic computing workloads. Lead customer is Meta (using AGI to power AI search ranking and recommendation systems). OpenAI, Cerebras, and Cloudflare also announced as launch partners. Production silicon is ready to order; volume shipments by end of 2026.
NVIDIA NVLink Fusion Integration
Neoverse platform integrates NVIDIA's NVLink Fusion interconnect — letting ARM-based head nodes pair efficiently with NVIDIA GPU clusters in AI training and inference deployments.
SoftBank Vision Fund Synergy
SoftBank also owns Ampere Computing, another ARM server-chip designer, and has an extensive Vision Fund portfolio of AI startups — making the ARM/SoftBank ecosystem strategically connected to the broader AI investment landscape.
Strengths
- Architecture ubiquity: Almost every mobile device and increasing share of data centers run ARM — ARM royalties scale with the entire computing market
- Power efficiency: ARM's RISC heritage means lower power-per-watt vs. x86 for many workloads — critical for AI inference at scale and for mobile/edge AI
- Hyperscaler tailwind: AWS, Microsoft, Google, Oracle all designing custom ARM-based server CPUs — ARM's success compounds with hyperscaler AI capex
- AGI CPU launch (2026): First-party data-center chip lets ARM capture more value per deployment than pure architecture licensing
- NVIDIA partnership: Neoverse-NVLink Fusion integration positions ARM as the head-node CPU for NVIDIA AI clusters
Limitations & Considerations
- Not a chipmaker: ARM ships architecture, not finished chips — execution depends on partner foundries (TSMC, Samsung) and customer silicon teams
- Geopolitical sensitivity: UK-based, SoftBank-owned, with Chinese subsidiary (ARM China) operating semi-independently — geopolitical risk varies by jurisdiction
- AI accelerator gap: ARM is a CPU architecture; for actual AI training acceleration, NVIDIA GPUs / TPUs / Gaudi remain the workload — ARM is the orchestrator, not the accelerator
- Royalty rate sensitivity: Single-digit-cents-per-chip royalties mean ARM revenue is highly sensitive to royalty-rate negotiations with Apple, Qualcomm, and other big customers
- AGI is new: AGI CPU production silicon is fresh as of March 2026 — real-world performance and TCO data still emerging through 2026
Best Use Cases
| Use Case | Why ARM Fits | Caveat |
|---|---|---|
| Hyperscaler custom server CPUs | AWS Graviton, Azure Cobalt, Google Axion all designed on Neoverse | Requires in-house silicon design capability |
| Mobile AI inference | Cortex dominates phone SoCs, on-device AI runs here | Apple, Qualcomm, MediaTek control the actual chips |
| AI inference orchestration | Neoverse + AGI CPU specifically designed for it | For training, pair with NVIDIA / TPU / Gaudi GPUs |
| Edge AI deployment | Power efficiency advantage | Less common for high-throughput inference |
| AI agent compute | AGI CPU explicitly targets agentic workloads | New product, real-world data pending |
When to choose alternatives:
- AI training (large model) → NVIDIA H100/H200/B200 or Intel Gaudi 3 GPUs/accelerators
- Frontier inference at the largest scale → NVIDIA Grace Hopper (still ARM-based, but the GPU does the work)
- Pure-x86 compatibility required → Intel Xeon or AMD EPYC for legacy software stacks
Key Takeaways
- ARM is a chip architecture company, not a manufacturer — it licenses CPU IP that powers most mobile AI inference globally and increasingly AI server CPUs
- Neoverse server platform has captured approximately 50 percent of CPU shipments to top hyperscalers by 2025, deployed across AWS Graviton, Azure Cobalt, Google Axion, Oracle Ampere, and NVIDIA Grace
- AGI CPU (announced March 2026) is ARM's first in-house data-center processor — 136 Neoverse V3 cores on TSMC 3nm, with Meta as lead customer and OpenAI/Cerebras/Cloudflare as launch partners
- ARM is the AI orchestrator, not the AI accelerator — pair with NVIDIA GPUs / TPUs / Gaudi for actual training workloads
- SoftBank ownership ties ARM strategically to the broader AI investment landscape, including SoftBank's Ampere Computing and the Vision Fund AI portfolio