Free to read. Sign up to save your progress and take knowledge-check quizzes.

Sign up free
6 min read·Updated May 4, 2026

AMD Instinct

AMD logoBy AMD

AMD Instinct is the company's datacenter AI accelerator line — currently the MI355X (CDNA 4, 288 GB HBM3e, GA Q3 2025), with the MI400 series announced at CES 2026 to compete with NVIDIA Vera Rubin. Major customer wins include Oracle, Microsoft Azure, OpenAI, and Meta.

Listen to this lesson

Free preview · first 0:30
0:00 / 0:30

Audio & video lessons are paid features

Plus unlocks audio streaming. Pro adds downloadable audio, video, certificates, and more.

Plus adds:
  • Audio streaming
  • Downloadable PDFs
  • All AI Playbooks
  • Personalized content
Pro also adds:
  • Certificates of completion
  • Audio MP3 downloads
  • Video lessonssoon
  • & More…soon

Watch this lesson

Video coming soon

Learning Objectives

  • Understand where AMD Instinct fits in the datacenter AI accelerator market and how it compares to NVIDIA's H200 and Blackwell generation
  • Identify the MI355X's headline specs and the announced MI400 family, including the memory advantage that drives AMD's competitive pitch
  • Evaluate the major 2025-2026 customer wins (Oracle, Microsoft Azure, OpenAI, Meta) and what they say about AMD's structural position

What Is AMD Instinct?

AMD Instinct is AMD's datacenter AI accelerator line — the company's direct response to NVIDIA's H100, H200, and Blackwell GPUs. It is the only at-scale alternative shipping today to NVIDIA in the datacenter AI training and inference market, and Instinct deployments back the bulk of AMD's case to be the credible number-two AI silicon vendor.

The current shipping flagship is the MI355X (4th-generation CDNA architecture, generally available Q3 2025). At CES 2026, AMD unveiled the MI400 series — the MI430X, MI440X, and MI455X — built on a new architecture and aimed at NVIDIA's Vera Rubin generation. The MI400 series begins shipping mid-to-second-half 2026; until then, MI355X is what hyperscalers actually buy.

💡Key Concept

CDNA versus RDNA: AMD splits its GPU architectures into two families. CDNA (Compute DNA) is the datacenter-only AI training and inference architecture used in Instinct. RDNA (Radeon DNA) is the gaming and graphics architecture used in Radeon consumer GPUs. CDNA strips out the graphics-only logic and adds matrix-engine acceleration, larger HBM stacks, and Infinity Fabric scaling — all the pieces datacenter AI workloads need.

MI355X — Current Shipping Flagship

The MI355X is built on a 3nm process and ships with 288 GB of HBM3e memory — about 50% more capacity than NVIDIA's Blackwell B200 (192 GB) and the headline number AMD leads with in customer pitches. Memory capacity matters because the largest models (frontier 400-billion-plus parameter dense models, 1-trillion-plus MoE models) routinely overflow per-GPU memory on H100 and H200, forcing slower model-parallel splits.

SpecMI355XNVIDIA B200
ArchitectureCDNA 4 (3nm)Blackwell
HBM memory288 GB HBM3e192 GB HBM3e
Memory bandwidth8 TB/sec8 TB/sec
FP16 throughput2.3 petaFLOPS~5 petaFLOPS
FP8 throughput4.6 petaFLOPS~10 petaFLOPS
FP4 throughput9.2 petaFLOPS~20 petaFLOPS
GA timingQ3 2025H1 2025

NVIDIA still leads on raw throughput at every precision tier; AMD's pitch is the memory ceiling plus aggressive pricing on rack-scale deployments.

MI400 Series — Announced at CES 2026

At CES 2026 (January 5), AMD unveiled the MI400 family — three SKUs aimed at the datacenter, deployed in the new "Helios" rack-scale architecture. AMD's announced flagship MI455X claims 432 GB of HBM4, 320 billion transistors, and up to 40 petaFLOPS of FP4 — roughly 4 times the platform peak of the MI300X family.

⚠️Warning

Vendor-claimed, pre-shipping. MI400 series numbers come from AMD's own announcement decks and have not yet been independently benchmarked. Production volume begins mid-2026, with Helios rack systems targeting Q3 2026. Treat MI400 specs as design targets until silicon ships in third-party hands.

Major 2025-2026 Customer Wins

The deployments below validate AMD as a serious second source rather than a bench warmer:

  • Oracle Cloud Infrastructure — General availability on MI355X via OCI, with single clusters of 130,000-plus MI355X GPUs (announced as the world's largest single-cluster Instinct deployment). Oracle has further committed to deploying 50,000 MI450 GPUs beginning Q3 2026.
  • Microsoft Azure — MI300X is in production for select Azure AI inference workloads; Microsoft is reportedly evaluating MI355X for the next refresh.
  • OpenAI — A signed multi-year supply deal in October 2025 for 6 gigawatts of AMD AI compute, with the first 1-gigawatt MI450 datacenter starting deployment in 2026.
  • Meta — Public commitment to MI350-class deployments for Llama-family training and inference.
  • xAI — Named as a Helios architecture customer for the MI400 era.

📝Note

The OpenAI deal is signed, not yet deployed. The 6-gigawatt headline number is the multi-year contracted capacity; actual silicon comes online in tranches starting 2026.

ROCm — The Software Side

Hardware does not run AI on its own. AMD's open software stack ROCm is what lets PyTorch, vLLM, and the broader open-source LLM ecosystem actually run on Instinct. The vLLM CI pass rate on Instinct jumped from 37 percent in November 2025 to 93 percent by January 2026 — the most-cited proof point that ROCm has closed the gap on NVIDIA's CUDA on the LLM-inference path. Strong ROCm availability is what makes Instinct a credible NVIDIA alternative rather than just a memory-ceiling differentiator.

Pricing

Cloud (OCI, Azure, others)Per-GPU-hour
  • Pay-as-you-go via hyperscaler
  • MI300X / MI355X
  • No upfront commitment
Direct purchaseEnterprise quote
  • Volume server OEM channel
  • Dell, HPE, Supermicro, Lenovo
  • Multi-year support
Helios rack-scaleEnterprise quote
  • MI400 series via Helios architecture
  • Q3 2026 onward
  • For frontier-model operators

AMD does not publish list prices for Instinct accelerators. Pricing is set per deal — competitive pressure on NVIDIA pricing is widely reported as the reason hyperscalers cite for adding AMD as a second source.

Strengths

  • Memory capacity headline — 288 GB on MI355X versus 192 GB on B200; 432 GB announced for MI455X. Frontier and trillion-parameter MoE models fit in fewer GPUs.
  • Real hyperscaler footprint — OCI, Azure, OpenAI, Meta, xAI deployments are all public and large.
  • Open software stack (ROCm) — Permissive licensing, native consumer-GPU support, and rapidly improving framework parity with CUDA on the inference path.
  • Cross-stack vendor leverage — Customers running EPYC plus Instinct plus Pensando NICs get a single-vendor hardware stack and matching support contracts.
  • ACE standards play — AMD co-authored the new x86 AI Compute Extensions (ACE) standard with Intel in April 2026, signaling cross-vendor cooperation on the CPU side that complements Instinct's GPU position.

Limitations and Considerations

  • Lower raw throughput than B200 — At every precision tier, NVIDIA Blackwell still wins on FLOPS. Memory advantage matters more for inference of very large models than for training throughput.
  • CUDA ecosystem gap — Hyperscaler-grade open-source LLMs run well on ROCm in 2026; the long tail of research code, custom kernels, and vendor-specific libraries still favors NVIDIA. Migrating an existing CUDA-heavy training stack is a real engineering project.
  • MI400 is not yet shipping — Architecture, headline specs, and Helios rack design are public, but production silicon arrives mid-to-second-half 2026. Buyers committing to MI400 today are committing to a roadmap rather than benchmarked hardware.
  • Software toolchain maturity — Profilers, debuggers, and large-model training utilities (DeepSpeed, Megatron-LM) work, but the CUDA versions are typically the reference implementations.

Key Takeaways

  • AMD Instinct is the only credible at-scale NVIDIA alternative for datacenter AI training and inference — currently shipping MI355X with 288 GB of HBM3e memory, the highest per-GPU memory capacity in production
  • The MI400 series unveiled at CES 2026 (MI430X, MI440X, MI455X) targets NVIDIA Vera Rubin in mid-to-second-half 2026, with the MI455X claiming 432 GB of HBM4 and 40 petaFLOPS of FP4
  • Major 2025-2026 customer commitments — Oracle (130,000-plus GPU clusters and 50,000 MI450), Microsoft Azure (MI300X production), OpenAI (6-gigawatt multi-year deal), Meta, and xAI — validate AMD as a structural second source rather than a niche alternative
  • AMD's pitch is a combination of memory-ceiling advantage on frontier models plus the rapidly maturing open ROCm software stack — vLLM CI pass rate on Instinct went from 37 percent to 93 percent across late 2025

Save your progress & take the quiz

Sign up free to bookmark lessons, track which modules you've completed, and lock in what you learned with a quick knowledge-check quiz at the end of each lesson.

🧭Recommended for you