Free to read. Sign up to save your progress and take knowledge-check quizzes.

Sign up free
6 min read·Updated April 29, 2026

Micron HBM3e

Micron logoBy Micron

Micron HBM3e is the high-bandwidth memory powering NVIDIA H200 and B200 AI accelerators — the third major HBM supplier alongside SK hynix and Samsung, with Micron capturing approximately 20-25 percent global market share by end of 2025 and HBM4 transition arriving late 2026.

Listen to this lesson

Free preview · first 0:30
0:00 / 0:30

Audio & video lessons are paid features

Plus unlocks audio streaming. Pro adds downloadable audio, video, certificates, and more.

Plus adds:
  • Audio streaming
  • Downloadable PDFs
  • All AI Playbooks
  • Personalized content
Pro also adds:
  • Certificates of completion
  • Audio MP3 downloads
  • Video lessonssoon
  • & More…soon

Watch this lesson

Video coming soon

Learning Objectives

  • Understand why high-bandwidth memory (HBM) is the dominant constraint on AI accelerator performance
  • Identify Micron's HBM3e generation specs and its role in NVIDIA H200/B200
  • Evaluate the HBM4 transition timeline and its implications for AI compute capacity

What Is Micron HBM3e?

HBM3e is the latest production-grade generation of High Bandwidth Memory — a stacked-DRAM technology designed for the extreme memory bandwidth that AI accelerators require. Micron is one of three companies in the world (alongside SK hynix and Samsung) that manufactures HBM at scale.

For AI specifically, HBM is the gating constraint on accelerator performance more often than raw compute. NVIDIA H200, B200, AMD MI300X, and most other modern AI accelerators are bounded by how fast they can move weights and activations between HBM and the compute cores. Micron's HBM3e ships in NVIDIA H200 GPUs (introduced 2024) and is also qualified for NVIDIA B100/B200.

💡Key Concept

Why HBM matters more than peak FLOPS: A modern AI accelerator can do trillions of floating-point operations per second — but only if the data is sitting in HBM ready to be consumed. For most large-model inference workloads, the time the GPU spends waiting for memory I/O exceeds the time it spends computing. Bigger and faster HBM directly translates to faster real-world performance — which is why H200's 141 GB at 4.8 TB/s materially outperforms H100's 80 GB at 3.35 TB/s on memory-bound workloads.

Tip

Visit Micron HBM3e: micron.com/products/memory/hbm/hbm3e — sold to AI accelerator vendors (NVIDIA, AMD); not sold direct to consumers

Pricing & Access

HBM is sold to a small number of large customers (chip designers and system integrators) under long-term supply agreements. Pricing is not publicly disclosed.

HBM3e 8-Hi 24GBSold to chip designers
  • Volume pricing not public
  • Used in NVIDIA H200
  • Mass production from Q2 2024
HBM3e 12-Hi 36GBHigher-capacity variant
  • Higher per-stack price
  • Used in NVIDIA B100 / B200
  • Targets the highest-tier AI accelerators
HBM4 (transition late 2026)Generational shift incoming
  • 2,048-bit interface (vs HBM3e's 1,024)
  • Up to 2 TB/s per stack
  • Re-architected packaging
Supply allocationsMulti-quarter contracts
  • HBM is the AI compute bottleneck
  • Customers compete for capacity
  • Pricing favors larger commitments

HBM supply has been the rate-limiter on AI accelerator output for the past two years. Customer queues and allocations matter as much as nominal pricing.

Core Capabilities

HBM3e 8-Hi 24 GB Cubes

The mainstream production tier as of 2025-2026. Stacks 8 DRAM dies to deliver 24 GB per cube at over 1.2 TB/s of bandwidth per cube. Used in NVIDIA H200 GPUs (which contain 6 HBM3e cubes for 141 GB total at 4.8 TB/s).

HBM3e 12-Hi 36 GB Cubes

Higher-density variant stacking 12 dies for 36 GB per cube. Targets the highest-tier AI accelerators including NVIDIA B100/B200 (Blackwell). NVIDIA B200 specs 192 GB of HBM3e at 8 TB/s aggregate bandwidth — using these higher-density cubes.

Power Efficiency Lead

Micron has differentiated on power efficiency — early HBM3e benchmarks showed Micron's products consuming meaningfully less power per GB-throughput than competitors, which matters at AI-data-center scale where HBM contributes substantially to total accelerator power.

Market Share Trajectory

Micron entered HBM late — at approximately 5% market share in 2023, with SK hynix and Samsung dominating. By end of 2025 Micron targeted 20-25% market share through aggressive capacity expansion. For NVIDIA's H200 and B100/B200, Micron has secured material design wins, breaking the SK hynix near-monopoly that prevailed earlier.

HBM4 Transition (Late 2026)

The HBM4 specification represents the largest architectural change since HBM's introduction:

  • Interface width doubled from 1,024 to 2,048 bits per stack
  • Transfer speed up to 8 Gbps across the wider interface
  • Total bandwidth up to 2 TB/s per stack (vs. HBM3e's ~1.2 TB/s)
  • New packaging required — meaningful re-tooling for both DRAM makers and accelerator vendors

The HBM4 transition will reshape AI accelerator roadmaps and create another supply-allocation contest among NVIDIA, AMD, Google, and other major buyers.

Stable Supply Agreements

Micron's HBM business runs on multi-quarter (sometimes multi-year) supply agreements with NVIDIA and other accelerator vendors. Customers lock in capacity early; spot purchases are rare.

Strengths

  • Third major HBM supplier: One of only three companies globally producing HBM at scale — NVIDIA, AMD, and other accelerator makers benefit from supply diversity
  • NVIDIA H200 + B200 design wins: Material share of NVIDIA's flagship accelerator memory now sources from Micron, breaking SK hynix near-monopoly
  • Power efficiency: Micron HBM3e benchmarks favorably on power-per-GB-throughput, meaningful at AI-data-center scale
  • Capacity expansion: Aggressive investment in HBM production capacity targeting 20-25% market share
  • HBM4 readiness: Engaged in HBM4 spec development; positioned for late-2026 generational transition
  • Stable customer base: Multi-quarter supply agreements with major AI accelerator vendors

Limitations & Considerations

  • Three-supplier oligopoly: Micron, SK hynix, Samsung control all HBM supply — pricing and allocation dynamics are concentrated and opaque
  • Long lead times: HBM capacity additions take years; customers cannot hedge near-term shortages by buying spot capacity
  • Geopolitical exposure: Most HBM is produced in South Korea (SK hynix, Samsung) and the US (Micron); export-control regimes affect cross-border supply
  • HBM4 transition risk: Generational shifts are disruptive; customer roadmaps must align with HBM4 timing or face dual-architecture supply complexity
  • Customer concentration: NVIDIA is the dominant HBM3e buyer; demand cyclicality at NVIDIA flows directly to Micron HBM revenue
  • Not a finished product: HBM is sold as memory cubes to chip designers — end customers buy GPUs, not HBM directly

Best Use Cases

StakeholderWhy Micron HBM3e MattersHow They Engage
AI accelerator vendors (NVIDIA, AMD)HBM3e is the memory in flagship GPUs (H200, B200, MI300X)Long-term supply agreements; capacity reservations
AI infrastructure operatorsH200/B200 capacity depends on HBM supply chainTrack HBM allocations as a leading indicator of GPU availability
Memory-bound AI workloadsLarger HBM (141GB H200, 192GB B200) enables longer context windowsPick GPUs with larger HBM for long-context inference
National policymakersHBM is a chokepoint technology in US-China AI rivalryExport controls, CHIPS Act incentives shape HBM supply
InvestorsHBM revenue is a leading indicator of AI compute capex cycleTrack Micron + SK hynix HBM revenue trends

When to choose alternatives:

  • HBM is the dominant memory for AI accelerators — there are no direct alternatives at the same bandwidth tier
  • For lower-cost inference at smaller scale, GDDR6X (consumer GPUs) is meaningfully cheaper but bandwidth-limited
  • For specialty AI ASICs (Cerebras, Groq), on-chip SRAM partially replaces HBM for very specific workloads

Key Takeaways

  • Micron HBM3e is the high-bandwidth memory powering NVIDIA H200 (141GB at 4.8 TB/s) and NVIDIA B100/B200 (192GB at 8 TB/s aggregate)
  • HBM is the dominant constraint on AI accelerator real-world performance — bigger and faster HBM materially outperforms equivalent compute with smaller memory
  • Micron is one of three HBM suppliers globally (alongside SK hynix and Samsung), targeting 20-25 percent market share by end of 2025 vs ~5 percent in 2023
  • HBM4 generation arrives late 2026 with doubled interface width (2,048 bits) and up to 2 TB/s per stack — a generational shift requiring new packaging
  • HBM supply is the rate-limiter on global AI accelerator output; customer allocations matter as much as nominal pricing

Save your progress & take the quiz

Sign up free to bookmark lessons, track which modules you've completed, and lock in what you learned with a quick knowledge-check quiz at the end of each lesson.

🧭Recommended for you