TL;DR

High Bandwidth Memory has become the main pressure point in the 2026 memory squeeze, according to Thorsten Meyer AI’s late-June report. The report says AI accelerators are absorbing wafer capacity that would otherwise support DDR5 RAM and some GDDR7 graphics-card supply.

High Bandwidth Memory has become the main capacity drain in the 2026 memory crunch, according to Thorsten Meyer AI’s late-June report, as AI-chip demand pulls fab output away from DDR5 RAM and leaves related GDDR7 GPU memory tight.

The hardware facts are well established: HBM is stacked DRAM, not a flat memory stick. It layers 8 to 16 DRAM dies, connects them with through-silicon vias, and places the stack beside an AI accelerator to feed data at far higher bandwidth than standard graphics memory.

The report says the trade-off is manufacturing efficiency. One bit of HBM can consume about three to four times the wafer area of one bit of DDR5, and one defect in a stacked package can spoil the entire memory tower. Pricing cited in the report puts HBM3 near $200 per stack, HBM3E near $300, and HBM4 at an estimated $500.

The supply claims are more provisional. Thorsten Meyer AI, citing Silicon Analysts, TrendForce, DigiTimes, Introl, Unibetter, Astute Group and Reuters, says SK Hynix leads the HBM market, with Samsung and Micron competing for the next wave. The report also says all three have qualified for HBM4, while DigiTimes-cited reporting says Nvidia cut RTX 50-series output in early 2026 as GDDR7 supply tightened.

At a glance

analysisWhen: late June 2026, fast-moving

The developmentA late-June report identifies High Bandwidth Memory as the component driving much of the 2026 memory crunch and related GPU supply pressure.

AI Dispatch · Reality Check · The Memory Squeeze · Part 2 of 10

HBM ate the fab

The thing the factories make instead of your RAM is a tower of stacked memory bolted to every AI chip. In three years it went from niche part to the component that sets the price of nearly all the world’s memory — and now a chunk of its GPUs.

What it is — and why it’s so wafer-hungry

BASE LOGIC DIE

8–16 DRAM dies · TSVs · 1 stack

A tower, not a sheet

HBM stacks DRAM dies vertically, links them with thousands of through-silicon vias, and sits beside the GPU to deliver 5–10× the bandwidth of normal graphics memory. AI is bandwidth-bound — without it, the world’s most expensive silicon sits starved for data. But stacking is inefficient: one HBM bit eats 3–4× the wafer area of DDR5, and one defect can ruin a whole tower.

≈ 8 HBM stacks wrap every AI GPU

The annual arms race — faster, denser, dearer

HBM3

~819 GB/s

per stack · the H100 era

~$200 / stack

HBM3E

~1.18 TB/s

2026 workhorse · H200, B200

~$300 / stack (+20% for ’26)

HBM4

~2.8 TB/s

new logic base die · Nvidia “Rubin”

~$500 / stack (est.)

The three-horse race for the most coveted chip

SK Hynix

~50–62%

the leader; ~90% of its HBM goes to Nvidia

Samsung

~28–40%

2026 comeback; qualified for Rubin HBM4

Micron

~5–10%

sold out for 2026; HBM4 for inference chips

June 2026: all three qualified for HBM4 — the question shifts from “can you ship?” to “who ships best?”

−30–40%

It didn’t just eat your RAM — it ate your GPU too. With suppliers prioritizing HBM, the GDDR7 memory consumer cards need went short; Nvidia reportedly cut RTX 50-series production by a third or more in H1 2026.

The take

This isn’t artificial scarcity — AI really is bandwidth-bound, HBM really is the fix, and it really does eat 3–4× its weight in fab capacity. The discomfort is structural: one component, coupled to one customer’s demand, now sets the price of nearly all memory and a slice of GPUs. The market is now $35B → ~$100B by 2028, ~41% of all DRAM revenue (was 8% in 2023), and sold out through 2026. The one hope: with all three suppliers finally racing on HBM4, competition can add supply. The matching risk: if AI demand corrects, HBM is where it breaks first. Next: DDR5 now, DDR6 soon.

Sources: Silicon Analysts; Introl; TrendForce; DigiTimes; Unibetter; Astute Group; Reuters. Per-stack pricing is estimated/point-in-time; bandwidth per JEDEC/vendor specs. As of late June 2026, fast-moving.

thorstenmeyerai.com

AI Memory Is Repricing DRAM

The shift matters because HBM competes for fab capacity with the memory used in ordinary PCs, servers and graphics cards. If suppliers can earn far more by dedicating wafers to AI accelerators, less capacity remains for commodity DRAM and consumer GPU memory.

The report estimates the HBM market could rise from about $35 billion to nearly $100 billion by 2028, reaching around 41% of DRAM revenue after accounting for only 8% in 2023. For readers, that means the AI buildout is no longer just a data-center story; it can affect RAM prices, graphics-card availability, and upgrade costs.

Amazon

High Bandwidth Memory (HBM) modules

View Latest Price

As an affiliate, we earn on qualifying purchases.

From H100 To Rubin

The report frames HBM’s rise around three product waves. HBM3 powered the Nvidia H100 era at roughly 819 GB/s per stack. HBM3E pushed bandwidth past 1.18 TB/s for newer accelerators such as H200 and B200.

The next step is HBM4, tied in the report to Nvidia’s coming Rubin platform and estimated at about 2.8 TB/s per stack. The report says the industry’s focus has shifted from whether major suppliers can make HBM4 to which supplier can ship it with the best yield, volume and customer alignment.

“The thing the factories make instead of your RAM is a tower of stacked memory bolted to every AI chip.”
— Thorsten Meyer AI

Supply Race Still Has Gaps

Several details remain unsettled. The exact wafer allocation between HBM, DDR5 and GDDR7 is not public, and the cited per-stack prices are estimates tied to a fast-changing market. Nvidia’s reported RTX 50-series cuts also remain based on industry reporting rather than a full public breakdown from the company.

It is also unclear how durable AI accelerator demand will be through 2027 and 2028. If orders keep rising, HBM supply may stay tight. If demand slows, the same concentrated market could be the first place where pricing pressure appears.

HBM4 Output Sets The Test

The next milestone is the HBM4 ramp from SK Hynix, Samsung and Micron. Investors, PC buyers and data-center customers will watch whether added supply eases DDR5 pricing, improves GDDR7 availability, and reduces dependence on a small group of memory suppliers.

The series says the next installment will turn to DDR5 now and DDR6 soon, which should show how the HBM squeeze is feeding into the memory products most consumers and server buyers actually purchase.

Key Questions

What does ‘HBM ate the fab’ mean?

It means High Bandwidth Memory is taking a larger share of memory wafer capacity, leaving less room for products such as DDR5 and some GDDR7 supply.

Why does AI need HBM?

AI accelerators need huge data flow. HBM sits next to the GPU and provides far more memory bandwidth than conventional graphics memory, helping keep expensive compute chips busy.

Is the shortage confirmed to be artificial?

No. The report argues the squeeze is mainly structural: AI chips need HBM, HBM uses more wafer area, and suppliers earn more from it. Claims about supplier conduct would require separate evidence.

Why are graphics cards affected?

Consumer GPUs use GDDR7, not HBM, but the same memory suppliers and fab decisions influence availability. The report cites industry reporting that RTX 50-series output was cut as GDDR7 supply tightened.

What could ease the memory crunch?

More HBM4 output, better yields, and added wafer capacity could help. The timing depends on how quickly SK Hynix, Samsung and Micron can ship at scale.

Source: Thorsten Meyer AI

Pet-care content is informational — consult your veterinarian for advice about your animal.

HBM Ate the Fab

Up next

15 Best Pet Supplies in 2026

Author

The Puppy Store Team

Share article