Name: Micron HBM3e
Availability: InStock
Author: Micron

Learning Objectives

Understand why high-bandwidth memory (HBM) is the dominant constraint on AI accelerator performance
Identify Micron's HBM3e generation specs and its role in NVIDIA H200/B200
Evaluate the HBM4 transition timeline and its implications for AI compute capacity

What Is Micron HBM3e?

HBM3e is the latest production-grade generation of High Bandwidth Memory — a stacked-DRAM technology designed for the extreme memory bandwidth that AI accelerators require. Micron is one of three companies in the world (alongside SK hynix and Samsung) that manufactures HBM at scale.

For AI specifically, HBM is the gating constraint on accelerator performance more often than raw compute. NVIDIA H200, B200, AMD MI300X, and most other modern AI accelerators are bounded by how fast they can move weights and activations between HBM and the compute cores. Micron's HBM3e ships in NVIDIA H200 GPUs (introduced 2024) and is also qualified for NVIDIA B100/B200.

💡Key Concept

Why HBM matters more than peak FLOPS: A modern AI accelerator can do trillions of floating-point operations per second — but only if the data is sitting in HBM ready to be consumed. For most large-model inference workloads, the time the GPU spends waiting for memory I/O exceeds the time it spends computing. Bigger and faster HBM directly translates to faster real-world performance — which is why H200's 141 GB at 4.8 TB/s materially outperforms H100's 80 GB at 3.35 TB/s on memory-bound workloads.

✅Tip

Visit Micron HBM3e: micron.com/products/memory/hbm/hbm3e — sold to AI accelerator vendors (NVIDIA, AMD); not sold direct to consumers

Pricing & Access

HBM is sold to a small number of large customers (chip designers and system integrators) under long-term supply agreements. Pricing is not publicly disclosed.

Plan	Price	Features
HBM3e 8-Hi 24GB	Sold to chip designers	Volume pricing not public Used in NVIDIA H200 Mass production from Q2 2024
HBM3e 12-Hi 36GB	Higher-capacity variant	Higher per-stack price Used in NVIDIA B100 / B200 Targets the highest-tier AI accelerators
HBM4 (transition late 2026)	Generational shift incoming	2,048-bit interface (vs HBM3e's 1,024) Up to 2 TB/s per stack Re-architected packaging
Supply allocations	Multi-quarter contracts	HBM is the AI compute bottleneck Customers compete for capacity Pricing favors larger commitments

HBM3e 8-Hi 24GBSold to chip designers

Volume pricing not public
Used in NVIDIA H200
Mass production from Q2 2024

HBM3e 12-Hi 36GBHigher-capacity variant

Higher per-stack price
Used in NVIDIA B100 / B200
Targets the highest-tier AI accelerators

HBM4 (transition late 2026)Generational shift incoming

2,048-bit interface (vs HBM3e's 1,024)
Up to 2 TB/s per stack
Re-architected packaging

Supply allocationsMulti-quarter contracts

HBM is the AI compute bottleneck
Customers compete for capacity
Pricing favors larger commitments

HBM supply has been the rate-limiter on AI accelerator output for the past two years. Customer queues and allocations matter as much as nominal pricing.

Core Capabilities

HBM3e 8-Hi 24 GB Cubes

The mainstream production tier as of 2025-2026. Stacks 8 DRAM dies to deliver 24 GB per cube at over 1.2 TB/s of bandwidth per cube. Used in NVIDIA H200 GPUs (which contain 6 HBM3e cubes for 141 GB total at 4.8 TB/s).

HBM3e 12-Hi 36 GB Cubes

Higher-density variant stacking 12 dies for 36 GB per cube. Targets the highest-tier AI accelerators including NVIDIA B100/B200 (Blackwell). NVIDIA B200 specs 192 GB of HBM3e at 8 TB/s aggregate bandwidth — using these higher-density cubes.

Power Efficiency Lead

Micron has differentiated on power efficiency — early HBM3e benchmarks showed Micron's products consuming meaningfully less power per GB-throughput than competitors, which matters at AI-data-center scale where HBM contributes substantially to total accelerator power.

Micron entered HBM late — at approximately 5% market share in 2023, with SK hynix and Samsung dominating. By end of 2025 Micron targeted 20-25% market share through aggressive capacity expansion. For NVIDIA's H200 and B100/B200, Micron has secured material design wins, breaking the SK hynix near-monopoly that prevailed earlier.

Through mid-2026 the bottleneck has only tightened: Micron has reported its HBM output effectively sold out for the year, with memory emerging as the single scarcest component in an AI server and HBM now its fastest-growing product line. That demand has made memory makers — not just GPU designers — central to how fast the industry can add AI capacity.

HBM4 Transition (Late 2026)

The HBM4 specification represents the largest architectural change since HBM's introduction:

Interface width doubled from 1,024 to 2,048 bits per stack
Transfer speed up to 8 Gbps across the wider interface
Total bandwidth up to 2 TB/s per stack (vs. HBM3e's ~1.2 TB/s)
New packaging required — meaningful re-tooling for both DRAM makers and accelerator vendors

The HBM4 transition will reshape AI accelerator roadmaps and create another supply-allocation contest among NVIDIA, AMD, Google, and other major buyers.

Stable Supply Agreements

Micron's HBM business runs on multi-quarter (sometimes multi-year) supply agreements with NVIDIA and other accelerator vendors. Customers lock in capacity early; spot purchases are rare.

Strengths

Third major HBM supplier: One of only three companies globally producing HBM at scale — NVIDIA, AMD, and other accelerator makers benefit from supply diversity
NVIDIA H200 + B200 design wins: Material share of NVIDIA's flagship accelerator memory now sources from Micron, breaking SK hynix near-monopoly
Power efficiency: Micron HBM3e benchmarks favorably on power-per-GB-throughput, meaningful at AI-data-center scale
Capacity expansion: Aggressive investment in HBM production capacity targeting 20-25% market share
HBM4 readiness: Engaged in HBM4 spec development; positioned for late-2026 generational transition
Stable customer base: Multi-quarter supply agreements with major AI accelerator vendors

Limitations & Considerations

Three-supplier oligopoly: Micron, SK hynix, Samsung control all HBM supply — pricing and allocation dynamics are concentrated and opaque
Long lead times: HBM capacity additions take years; customers cannot hedge near-term shortages by buying spot capacity
Geopolitical exposure: Most HBM is produced in South Korea (SK hynix, Samsung) and the US (Micron); export-control regimes affect cross-border supply
HBM4 transition risk: Generational shifts are disruptive; customer roadmaps must align with HBM4 timing or face dual-architecture supply complexity
Customer concentration: NVIDIA is the dominant HBM3e buyer; demand cyclicality at NVIDIA flows directly to Micron HBM revenue
Not a finished product: HBM is sold as memory cubes to chip designers — end customers buy GPUs, not HBM directly

Best Use Cases

Stakeholder	Why Micron HBM3e Matters	How They Engage
AI accelerator vendors (NVIDIA, AMD)	HBM3e is the memory in flagship GPUs (H200, B200, MI300X)	Long-term supply agreements; capacity reservations
AI infrastructure operators	H200/B200 capacity depends on HBM supply chain	Track HBM allocations as a leading indicator of GPU availability
Memory-bound AI workloads	Larger HBM (141GB H200, 192GB B200) enables longer context windows	Pick GPUs with larger HBM for long-context inference
National policymakers	HBM is a chokepoint technology in US-China AI rivalry	Export controls, CHIPS Act incentives shape HBM supply
Investors	HBM revenue is a leading indicator of AI compute capex cycle	Track Micron + SK hynix HBM revenue trends

When to choose alternatives:

HBM is the dominant memory for AI accelerators — there are no direct alternatives at the same bandwidth tier
For lower-cost inference at smaller scale, GDDR6X (consumer GPUs) is meaningfully cheaper but bandwidth-limited
For specialty AI ASICs (Cerebras, Groq), on-chip SRAM partially replaces HBM for very specific workloads

Key Takeaways

Micron HBM3e is the high-bandwidth memory powering NVIDIA H200 (141GB at 4.8 TB/s) and NVIDIA B100/B200 (192GB at 8 TB/s aggregate)
HBM is the dominant constraint on AI accelerator real-world performance — bigger and faster HBM materially outperforms equivalent compute with smaller memory
Micron is one of three HBM suppliers globally (alongside SK hynix and Samsung), targeting 20-25 percent market share by end of 2025 vs ~5 percent in 2023
HBM4 generation arrives late 2026 with doubled interface width (2,048 bits) and up to 2 TB/s per stack — a generational shift requiring new packaging
HBM supply is the rate-limiter on global AI accelerator output; customer allocations matter as much as nominal pricing

Micron HBM3e

Audio & video lessons are paid features

Learning Objectives

What Is Micron HBM3e?

Pricing & Access

Core Capabilities

HBM3e 8-Hi 24 GB Cubes

HBM3e 12-Hi 36 GB Cubes

Power Efficiency Lead

HBM4 Transition (Late 2026)

Stable Supply Agreements

Strengths

Limitations & Considerations

Best Use Cases

Key Takeaways

Save your progress & take the quiz

Audio & video lessons are paid features

Learning Objectives

What Is Micron HBM3e?

Pricing & Access

Core Capabilities

HBM3e 8-Hi 24 GB Cubes

HBM3e 12-Hi 36 GB Cubes

Power Efficiency Lead

Market Share Trajectory

HBM4 Transition (Late 2026)

Stable Supply Agreements

Strengths

Limitations & Considerations

Best Use Cases

Key Takeaways

Save your progress & take the quiz