StorageReview's coverage of Google's TPU v8t and TPU v8i platforms shows memory capacity and bandwidth becoming central design constraints for large AI systems. The training-focused TPU v8t is described as scaling to 9,600 chips per superpod with 2PB of HBM, while the inference-focused TPU v8i pairs 288GB of HBM with 384MB of on-chip SRAM inside a 1,152-chip scale-up domain. For RamTrend, the important signal is that custom AI accelerators are not reducing the importance of advanced memory. Instead, they are increasing the amount of HBM tied to each large training and inference deployment. Google's Virgo data center fabric and v8 architecture are presented as ways to scale compute, but the platform still depends on very large pools of HBM to keep accelerators fed. The likely market impact is upward for HBM and advanced DRAM demand. Even though the article is about Google accelerators rather than a memory supplier announcement, TPU deployments at this scale can consume substantial HBM volume. If more hyperscalers build custom AI silicon with similar memory footprints, HBM allocation will remain a strategic pressure point for suppliers. This draft should be treated as an AI infrastructure memory signal rather than a direct pricing report. The source provides concrete HBM figures for TPU v8 systems, which makes it useful for RamTrend's view of future server memory demand.
AI Infrastructure · May 4, 2026
Google TPU v8 Designs Put HBM Capacity at Center of AI Scaling
Google's next TPU generation highlights how much AI accelerator roadmaps now depend on high-bandwidth memory. StorageReview reports that TPU v8t superpods carry petabyte-scale HBM capacity, while v8i inference chips pair large HBM pools with on-chip SRAM for low-latency serving.
Price impact: 5Direction: upSource: StorageReview
GoogleNVIDIAHBMHBM4HBM4EHBM3EDRAMAI Infrastructure
Original sourceBack to news archive