Nvidia CEO Jensen Huang highlighted at GTC 2026 that AI has shifted from early model training to an era defined by inference and agent computing. To meet growing inference demands, Nvidia integrated its strategic acquisition of Groq and launched the Groq 3 LPU Rack as a token accelerator designed for ultra-low latency inference tasks, with Huang announcing that the LPU chip is manufactured by Samsung Electronics.
Explainer: Why Nvidia’s Groq LPU runs on Samsung silicon— Groq’s scale and inference strategy
23
Mar