Name: ARM AI Processors
Availability: InStock
Author: SoftBank

Learning Objectives

Understand ARM's architecture-licensing model and why it dominates mobile + edge AI
Identify the Neoverse data-center platform's traction and the new AGI CPU's role
Evaluate when ARM-based AI compute makes sense vs. x86 or specialized accelerators

What Is ARM AI?

ARM (majority-owned by SoftBank since 2016) is a UK chip-architecture company — but unlike Intel or AMD, ARM does not manufacture chips. Instead, it licenses CPU architectures (Cortex, Neoverse) and core IP that other companies build their own chips around. Apple's M-series, Amazon's Graviton, NVIDIA's Grace, Google's Tensor, Qualcomm Snapdragon, MediaTek Dimensity, and the SoCs in every iPhone and most Android phones are all built on ARM architecture.

For AI specifically, ARM has two strategic positions: Cortex dominates mobile and edge AI inference (where most consumer-facing AI compute happens), and Neoverse has aggressively expanded into AI server CPUs — by 2025 reaching nearly 50% market share of CPU shipments among the top hyperscalers. In March 2026, ARM took the next step by announcing AGI, its first in-house designed data-center processor.

💡Key Concept

ARM's architecture-license model: ARM doesn't compete with chipmakers — it sells them the foundation. AWS Graviton is an ARM-architecture chip designed by AWS; NVIDIA Grace is an ARM-architecture chip designed by NVIDIA; Apple M-series is ARM. This means ARM gets royalty revenue on virtually every advanced mobile + AI chip shipping today, without needing to manufacture anything.

✅Tip

Visit ARM: arm.com/markets/artificial-intelligence — architecture licenses sold through ARM enterprise sales; reference designs available for evaluation

Pricing & Access

ARM does not sell chips. ARM sells architecture licenses to chipmakers and system designers, who then design and fabricate their own silicon (typically at TSMC or Samsung).

Plan	Price	Features
Architecture License	Tens of millions of dollars upfront	Allows licensee to design custom CPUs around ARM ISA Apple, NVIDIA, Qualcomm tier Highest customization rights
Core License	Per-design license fee + royalty per chip	Licensee uses ARM-designed cores (Cortex, Neoverse) Most chipmakers operate here Lower upfront cost
Per-Chip Royalty	Single-digit cents to dollars per chip	Paid on every shipping chip Compounds across billions of devices Foundation of ARM's revenue
AGI CPU (announced March 2026)	First-party ARM data-center processor	136 Neoverse V3 cores TSMC 3nm process Production silicon ready, volume shipments end of 2026
Reference Designs	Included with licenses	Server CPU reference, mobile SoC reference Accelerates time-to-market for licensees

Architecture LicenseTens of millions of dollars upfront

Allows licensee to design custom CPUs around ARM ISA
Apple, NVIDIA, Qualcomm tier
Highest customization rights

Core LicensePer-design license fee + royalty per chip

Licensee uses ARM-designed cores (Cortex, Neoverse)
Most chipmakers operate here
Lower upfront cost

Per-Chip RoyaltySingle-digit cents to dollars per chip

Paid on every shipping chip
Compounds across billions of devices
Foundation of ARM's revenue

AGI CPU (announced March 2026)First-party ARM data-center processor

136 Neoverse V3 cores
TSMC 3nm process
Production silicon ready, volume shipments end of 2026

Reference DesignsIncluded with licenses

Server CPU reference, mobile SoC reference
Accelerates time-to-market for licensees

The new AGI CPU is a meaningful strategy shift — for the first time, ARM is selling its own chip rather than just licensing the design.

Core Architecture Components

Neoverse Data Center Platform

ARM's server-CPU platform — Neoverse V (max performance) and N (efficiency) cores. Used by AWS Graviton 4, Microsoft Cobalt 100, Google Axion, Oracle Ampere, and NVIDIA Grace. By 2025, Neoverse-based chips reached billion+ deployed cores and approximately 50% market share of CPU shipments to top hyperscalers — a stunning shift from x86 dominance just five years prior.

Cortex Mobile + Edge

The dominant CPU architecture for mobile AI inference globally. Apple A-series and M-series, Qualcomm Snapdragon, MediaTek Dimensity, and Samsung Exynos all build on Cortex cores. Most everyday AI inference (speech recognition, image processing, on-device LLM) runs on Cortex-based silicon.

AGI CPU (March 2026)

ARM's first in-house data-center processor, announced March 24, 2026. Specs: up to 136 Neoverse V3 cores, built on TSMC 3nm, designed specifically for AI inference orchestration and agentic computing workloads. Lead customer is Meta (using AGI to power AI search ranking and recommendation systems). OpenAI, Cerebras, and Cloudflare also announced as launch partners. Production silicon is ready to order; volume shipments by end of 2026.

NVIDIA NVLink Fusion Integration

Neoverse platform integrates NVIDIA's NVLink Fusion interconnect — letting ARM-based head nodes pair efficiently with NVIDIA GPU clusters in AI training and inference deployments.

SoftBank Vision Fund Synergy

SoftBank also owns Ampere Computing, another ARM server-chip designer, and has an extensive Vision Fund portfolio of AI startups — making the ARM/SoftBank ecosystem strategically connected to the broader AI investment landscape.

Strengths

Architecture ubiquity: Almost every mobile device and increasing share of data centers run ARM — ARM royalties scale with the entire computing market
Power efficiency: ARM's RISC heritage means lower power-per-watt vs. x86 for many workloads — critical for AI inference at scale and for mobile/edge AI
Hyperscaler tailwind: AWS, Microsoft, Google, Oracle all designing custom ARM-based server CPUs — ARM's success compounds with hyperscaler AI capex
AGI CPU launch (2026): First-party data-center chip lets ARM capture more value per deployment than pure architecture licensing
NVIDIA partnership: Neoverse-NVLink Fusion integration positions ARM as the head-node CPU for NVIDIA AI clusters

Limitations & Considerations

Not a chipmaker: ARM ships architecture, not finished chips — execution depends on partner foundries (TSMC, Samsung) and customer silicon teams
Geopolitical sensitivity: UK-based, SoftBank-owned, with Chinese subsidiary (ARM China) operating semi-independently — geopolitical risk varies by jurisdiction
AI accelerator gap: ARM is a CPU architecture; for actual AI training acceleration, NVIDIA GPUs / TPUs / Gaudi remain the workload — ARM is the orchestrator, not the accelerator
Royalty rate sensitivity: Single-digit-cents-per-chip royalties mean ARM revenue is highly sensitive to royalty-rate negotiations with Apple, Qualcomm, and other big customers
AGI is new: AGI CPU production silicon is fresh as of March 2026 — real-world performance and TCO data still emerging through 2026

Best Use Cases

Use Case	Why ARM Fits	Caveat
Hyperscaler custom server CPUs	AWS Graviton, Azure Cobalt, Google Axion all designed on Neoverse	Requires in-house silicon design capability
Mobile AI inference	Cortex dominates phone SoCs, on-device AI runs here	Apple, Qualcomm, MediaTek control the actual chips
AI inference orchestration	Neoverse + AGI CPU specifically designed for it	For training, pair with NVIDIA / TPU / Gaudi GPUs
Edge AI deployment	Power efficiency advantage	Less common for high-throughput inference
AI agent compute	AGI CPU explicitly targets agentic workloads	New product, real-world data pending

When to choose alternatives:

AI training (large model) → NVIDIA H100/H200/B200 or Intel Gaudi 3 GPUs/accelerators
Frontier inference at the largest scale → NVIDIA Grace Hopper (still ARM-based, but the GPU does the work)
Pure-x86 compatibility required → Intel Xeon or AMD EPYC for legacy software stacks

Key Takeaways

ARM is a chip architecture company, not a manufacturer — it licenses CPU IP that powers most mobile AI inference globally and increasingly AI server CPUs
Neoverse server platform has captured approximately 50 percent of CPU shipments to top hyperscalers by 2025, deployed across AWS Graviton, Azure Cobalt, Google Axion, Oracle Ampere, and NVIDIA Grace
AGI CPU (announced March 2026) is ARM's first in-house data-center processor — 136 Neoverse V3 cores on TSMC 3nm, with Meta as lead customer and OpenAI/Cerebras/Cloudflare as launch partners
ARM is the AI orchestrator, not the AI accelerator — pair with NVIDIA GPUs / TPUs / Gaudi for actual training workloads
SoftBank ownership ties ARM strategically to the broader AI investment landscape, including SoftBank's Ampere Computing and the Vision Fund AI portfolio

ARM AI Processors

Audio & video lessons are paid features