GTC 2026
Jensen Huang Declares “The Age of Inference” at GTC, Unveils Rubin Architecture and Predicts $1 Trillion AI Compute Demand
NVIDIA’s CEO keynoted the biggest AI infrastructure event of the year at SAP Center, announcing DLSS 5, the Rubin next-generation GPU platform, and a sweeping vision for inference-era computing that he says will reshape every industry on Earth.
Jensen Huang strode onto the stage at San Jose’s SAP Center on Monday morning in his trademark leather jacket, but the message he delivered was anything but routine. In a keynote that stretched past two hours and drew thousands of developers, researchers, and enterprise executives, the NVIDIA CEO declared that the AI industry has crossed a decisive threshold — from the era of training to the era of inference. “We’ve reached an inflection point,” Huang told the audience, his voice carrying the conviction of someone who has watched his company’s market capitalization climb past $3 trillion on the back of exactly this bet. He predicted that cumulative demand for AI compute would exceed $1 trillion through 2027, driven not by a handful of hyperscalers training frontier models but by millions of enterprises deploying AI agents, reasoning systems, and real-time decision engines at the edge and in the cloud.
The centerpiece hardware announcement was Rubin, NVIDIA’s next-generation GPU architecture and the successor to the Blackwell platform that has dominated data center AI for the past eighteen months. Huang offered architectural details that sent ripples through the technical audience: Rubin will feature a new interconnect fabric designed for multi-chip inference clusters, dramatically improved memory bandwidth for serving large language models, and native support for mixture-of-experts routing at the hardware level. Alongside Rubin, Huang unveiled DLSS 5, the latest iteration of NVIDIA’s AI-powered graphics upscaling technology, which he described as leveraging a dedicated neural rendering pipeline that can reconstruct photorealistic frames from as little as one-eighth of the natively rendered pixels. For gamers and creative professionals, DLSS 5 promises a generational leap in visual fidelity at a fraction of the traditional compute cost.
But it was the enterprise partnerships that signaled the broader strategic shift. Huang announced expanded collaborations with SAP, ServiceNow, Snowflake, and a roster of Fortune 500 companies that are embedding NVIDIA’s inference stack directly into their production workflows. The message was unmistakable: NVIDIA is no longer content to sell shovels in the AI gold rush. It intends to be the operating system of the inference economy — the layer between raw silicon and the intelligent applications that enterprises are racing to deploy. For an industry that has spent the past three years fixated on who can train the biggest model, GTC 2026 marked a decisive pivot toward who can serve the most intelligence, fastest, at the lowest cost.