NVIDIA Shows More AI Infrastructure at GTC 2025: ‘Every Single Layer of Computing Has Been Transformed' | eWeek

NVIDIA Shows More AI Infrastructure at GTC 2025: ‘Every Single Layer of Computing Has Been Transformed’

NVIDIA CEO Jensen Huang.

Image: NVIDIA

Written By
Megan Crouse
Megan Crouse
Mar 18, 2025
4 minute read
eWeek content and product recommendations are editorially independent. We may make money when you click on links to our partners. Learn More

NVIDIA CEO and cofounder Jensen Huang took to the stage at the NVIDIA GTC conference in San Jose on March 18 to show the upcoming roadmap for the company, which has thrived over the last few years by providing infrastructure for generative AI.

“Every single layer of computing has been transformed by the arrival of generative AI,” Huang said.

NVIDIA announced two new chips for running AI, a unifying operating system for “AI factory” data centers, and a reasoning platform for robotics.

“Where we had a retrieval computing model, we now have a generative computing model,” he said.

Vera Rubin and Blackwell Ultra are NVIDIA’s next chips

NVIDIA’s next-generation GPU architecture will be Vera Rubin, named after the astronomer who discovered dark matter. Vera Rubin will be available in the second half of 2026, and will provide 3.3 times the performance of a Blackwell Ultra in a similar rack setup. Everything is new except the chassis, Huang said Vera Rubin will be followed in the second half of 2027 by an enhanced version: Rubin Ultra. NVIDIA stated it offers 14x the performance of Blackwell Ultra.

NVIDIA will “gracefully glide” into production of the tuned version of the Blackwell chip later this year, Huang said. Blackwell Ultra will have 1.5 more FLOPs, 1.5 times more memory, and two times more bandwidth than the original Blackwell.

The Vera Rubin will be available in late 2026.
The Vera Rubin will be available in late 2026. Image: NVIDIA

The Vera Rubin will be available in late 2026. Image: NVIDIA

The next-generation NVIDIA chips will be named Feynman, after the physicist Richard Feynman.

Blackwell Ultra AI Factory built out with Blackwell Ultra

In order to provide the backbone for the massive number of generative AI workloads, including agentic AI, being processed in data centers across the globe today, NVIDIA unified several technologies, including Blackwell Ultra, into its AI factory platform. It includes the Blackwell Ultra with the NVIDIA GB300 NVL72 rack-scale solution and HGX B300 NVL16. Inside, 72 Blackwell Ultra GPUs and 36 ArmNeoverse-based NVIDIA Grace CPUs are linked together. It also links together the NVIDIA Spectrum-X Ethernet networking platform and NVIDIA Quantum-X800 InfiniBand platform.

The Blackwell Ultra AI factory will be available after the Blackwell Ultra chip is in the second half of 2025.

Huang framed NVIDIA’s AI infrastructure projects as “scaling up” to “scale out,” allowing for great efficiency and therefore greater revenue for its customers. NVIDIA partnered with most of the major cloud providers, including Microsoft Azure and Google, to develop the platform. 

NVIDIA also announced a Quantum-X photonics switch, which has the Spectrum-X Photonics Ethernet and Quantum-X Photonics InfiniBand packaged inside. Quantum-X integrated silicon is coming in the second half of 2026.

Advertisement

Introducing Dynamo, the “operating system” of the AI factory

Another aspect of the Blackwell AI factory will be Dynamo, open-source inference software that Huang called the “operating system” of the AI factory concept. It replaces the NVIDIA Triton Inference Server for orchestrating inference for reasoning AI and maximizing revenue as measured against the cost of generating tokens. (Tokens are the chunks of information generative AI puts together to make content, such as symbols, letters, or words.) It can be used to accelerate the use of AI inference at AWS, Meta, Microsoft Azure, and other major cloud platforms.

GROOT N1 is a foundation model that teaches robots to walk

Huang also introduced Isaac Groot N1, a foundation model for generative AI in robotics. It employs a dual system modeled after the human brain, with one slow-thinking system that makes a plan and another fast-thinking system that carries out actions. It is an effort to solve three problems – training data, model architecture, and scaling – that face efforts to make robots adaptable enough to work in factories or do chores.

GROOT N1 is “for very fine grain, rigid and soft bodies, designed for being able to train tactile feedback and fine motor skills and actuator controls,” Huang said.

Groot N1 is open source.

NVIDIA partnered with GM on autonomous driving

GM will partner with NVIDIA to build its self-driving car fleet. “The time for autonomous vehicles has arrived,” Huang said.

AI will be used in manufacturing, enterprise, and in the vehicles. A product called NVIDIA Halos provides in-chip safety systems for self-driving vehicles. In Halos, “student” and “teacher” models are used to tune the AI for real-world situations.

GM will also use NVIDIA Cosmos, the physical AI foundation model platform.

NVIDIA’s stock fell after the GTC keynote

NVIDIA stock fell after the keynote, following a pattern of deceasing confidence in the tech sector’s finances. The Chinese AI company DeepSeek’s reveal of its reasoning model R1 heralded an era of uncertainty in Silicon Valley as pundits wait nervously for the AI bubble to burst or for Chinese rivals to overtake U.S. ambitions. According to The Information, Amazon offered discounts on its own AI chips to try to undercut NVIDIA.

On the other hand, NVIDIA saw record-breaking revenue in Q4 2024.

Megan Crouse

Megan Crouse has a decade of experience in business-to-business news and feature writing, including as first a writer and then the editor of Manufacturing.net. Her news and feature stories have appeared in Military & Aerospace Electronics, Fierce Wireless, TechRepublic, and eWeek. She copyedited cybersecurity news and features at Security Intelligence. She holds a degree in English Literature and minored in Creative Writing at Fairleigh Dickinson University.

eWeek Logo

eWeek has the latest technology news and analysis, buying guides, and product reviews for IT professionals and technology buyers. The site's focus is on innovative solutions and covering in-depth technical content. eWeek stays on the cutting edge of technology news and IT trends through interviews and expert analysis. Gain insight from top innovators and thought leaders in the fields of IT, business, enterprise software, startups, and more.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.