Nvidia Groq 3 Shows AI Inference Designs Moving Beyond HBM

Price impact: 1Direction: neutralSource: IEEE Spectrum Semiconductors

The IEEE Spectrum item describes Nvidia Groq 3 as an AI inference processor that relies on SRAM integrated in the processor rather than HBM placed beside GPUs. It contrasts that with Rubin, described as having access to 288GB of HBM. For RamTrend, the point is that inference and training may put different pressure on memory architecture. SRAM-focused inference chips can reduce dependence on external HBM for some workloads, while HBM-heavy GPU systems remain central to AI infrastructure.

NvidiaGroqHBMSRAMAI inference

Original source Back to news archive