Google TPU v8 Designs Put HBM Capacity at Center of AI Scaling

Price impact: 5Direction: upSource: StorageReview

StorageReview's coverage of Google's TPU v8t and TPU v8i platforms shows memory capacity and bandwidth becoming central design constraints for large AI systems. The training-focused TPU v8t is described as scaling to 9,600 chips per superpod with 2PB of HBM, while the inference-focused TPU v8i pairs 288GB of HBM with 384MB of on-chip SRAM inside a 1,152-chip scale-up domain. For RamTrend, the important signal is that custom AI accelerators are not reducing the importance of advanced memory. Instead, they are increasing the amount of HBM tied to each large training and inference deployment. Google's Virgo data center fabric and v8 architecture are presented as ways to scale compute, but the platform still depends on very large pools of HBM to keep accelerators fed. The likely market impact is upward for HBM and advanced DRAM demand. Even though the article is about Google accelerators rather than a memory supplier announcement, TPU deployments at this scale can consume substantial HBM volume. If more hyperscalers build custom AI silicon with similar memory footprints, HBM allocation will remain a strategic pressure point for suppliers. This draft should be treated as an AI infrastructure memory signal rather than a direct pricing report. The source provides concrete HBM figures for TPU v8 systems, which makes it useful for RamTrend's view of future server memory demand.

GoogleNVIDIAHBMHBM4HBM4EHBM3EDRAMAI Infrastructure

Original source Back to news archive