
What makes a robot gripper useful isn't that it can pick up one object - it's that it can pick up the next one, and the one after that, with a tool it's never held before.
What makes an autonomous vehicle system safe isn't just that it can reason through a situation - it's that it can do so quickly enough on the hardware actually installed in the car.
What makes a virtual agent capable is exposure to as many different environments as possible before it faces the real world.
At this year's Computer Vision and Pattern Recognition (CVPR) conference, NVIDIA Research is presenting three papers that address each of these challenges - and share a common theme: training at scale creates systems that generalize across diverse applications.
The three papers cover different challenges in physical AI research:
GraspGen-X, the first foundation model for zero-shot grasping, was trained on billions of simulated grasps to work with any gripper it's shown.
LCDrive introduces a model that replaces expensive text-based reasoning with compact latent representations, letting autonomous vehicles think faster on embedded hardware.
NitroGen is a generalized gameplay AI foundation model that harnesses the NVIDIA Isaac GR00T robot foundation model architecture to help train embodied agents in virtual environments across tens of thousands of hours of interaction.
NVIDIA also unveiled at CVPR new physical AI agent skills that help researchers and developers speed the development of autonomous vehicles, robots and vision AI systems.
The First Foundation Model for Grasping Most AI systems for robotic grasping are specialists.
A vision-language-action policy trained for a two-finger gripper only learns to grasp with those two fingers. Similarly, a policy for dextrous grasping will only work for the bespoke multi-fingered gripper it's trained on. For every new embodiment, the process typically needs to be repeated - requiring new training data, fine-tuning and validation. This constraint means most robotics companies pick a gripper, train for it and stick with it.
GraspGen-X is the first foundation model for grasping built to eliminate this bottleneck.
Like a large language model that can apply its understanding of language to a new task without retraining, GraspGen-X applies its understanding of geometry and contact to any robotic gripper it encounters. Given the geometry of a new gripper and an unknown object it's never seen before, the model generates reliable grasp pose proposals to enable the robot to grasp the object.
https://blogs.nvidia.com/wp-content/uploads/2026/06/GraspGenX.mp4
To get there, the researchers needed a dataset that's impossible to collect in the real world at scale. They generated 2 billion simulated grasps across thousands of object shapes and synthetic gripper configurations, spanning the diversity of form factors a deployed robot might encounter.
For robot developers, this foundation model eliminates the need for per-gripper training cycles and can be applied out of the box for several commonly used grippers. GraspGenX can be used in conjunction with curoboV2, a new CUDA-accelerated motion planning library, to achieve these grasp poses in unknown environments.
Building on the GraspGen research foundation, another paper, Grasp-MPC - presented at ICRA 2026 - advances the next step in the pipeline: moving from grasp generation to closed-loop grasp execution.
Teaching Autonomous Vehicles to Think Faster In recent years, researchers have found that letting an AI reason - generating intermediate thinking steps before committing to an answer - reliably improves its decision-making.
For autonomous vehicles, the challenge is doing that reasoning on the hardware inside an actual vehicle. Text-based chain-of-thought reasoning generates words, and every word is a token that takes time to produce. On the processor running inside a car, token count is a real constraint on how fast the system can respond.
LCDrive tackles this problem by replacing words with compressed latent representations.
Instead of generating human-readable reasoning steps, the system thinks in a compact latent space - states that capture spatial information rather than producing text. The architecture alternates between two kinds of thinking: proposing candidate actions, then predicting what the world will look like if those actions are taken.
It uses that predicted world state to refine its next step. It's the same reasoning loop - just in a more computationally efficient form than natural language.
The result: comparable output trajectory quality to text-based reasoning, using roughly half the tokens.
The model was built on NVIDIA Alpamayo and trained using supervision derived from existing vehicle data.
Embodied Agents Trained in Virtual Worlds Isaac GR00T - NVIDIA's open foundation model for humanoid robots - is built on a simple principle: expose a model to enough diverse situations, and it will generalize to ones it hasn't seen.
NitroGen extends that principle to virtual environments, using the GR00T architecture to train a foundation model for embodied agents across a breadth of virtual worlds.
Video games offer something that's hard to build from scratch: structured, varied worlds with defined goals and well-specified success conditions. They're high-quality training environments, available at scale.
NitroGen treats them that way - as a training ground for agents that will eventually be trained to handle novel real- or simulated-world situations, like powering a robot that helps with housework based on broad instructions such as, Put these items away in the pantry.
Trained across more than 1,000 games and 40,000 hours of interaction using a model based on GR00T, the resulting agents learn to generalize across environments. Th
More from Nvidia
03/06/2026
At CVPR, NVIDIA is unveiling new physical AI agent skills that help researchers ...
03/06/2026
What makes a robot gripper useful isn't that it can pick up one object - it&...
02/06/2026
The agentic AI moment has arrived, but delivering on its promise requires more t...
02/06/2026
Accelerated computing has revolutionized industrial engineering, compressing sim...
01/06/2026
Agentic AI is getting physical.
At COMPUTEX on Tuesday, NVIDIA announced NVIDIA JetPack 7.2 and NVIDIA NemoClaw support on NVIDIA Jetson.
JetPack 7.2 brings a...
01/06/2026
Financial institutions have spent years building AI: fraud models, credit models...
31/05/2026
Taiwan is home to more than 500 NVIDIA ecosystem partners. More than 1 million N...
31/05/2026
As factories move from isolated automation to plant-wide intelligence, manufacturers need AI systems that can connect live machine signals, quality systems, wor...
31/05/2026
The NVIDIA AI Cloud ecosystem is accelerating the global buildout of AI factory infrastructure. Partners are expanding capacity to meet growing demand from ente...
28/05/2026
License to stream, shaken and stirred.
GeForce NOW is dialing up the espionage with the launch of 007 First Light, letting members slip into James Bond's r...
28/05/2026
Robotics is entering a new phase: moving from controlled demos and scripted automation toward generalizable, reliable embodied autonomy in the real world.
At ...
26/05/2026
The shift to agentic AI creates a new CPU requirement for the AI factory: fast cores, massive memory bandwidth and the ability to sustain high performance when ...
21/05/2026
The future of AI is landing in Taipei. At NVIDIA GTC Taipei at COMPUTEX, the world's developers, researchers and industry leaders are converging to dive int...
21/05/2026
The mission begins now.
GeForce NOW is dialing up the action with a blockbuster...
19/05/2026
At this year's Google I/O conference, NVIDIA and Google Cloud are accelerating the work of more than 100,000 developers in the companies' joint develope...
18/05/2026
Agentic AI inference at one-tenth the cost per token with NVIDIA Vera Rubin NVL7...
14/05/2026
Editor's note: The Gaijin single sign-on feature is now up and running.
Dive masks on - Subnautica 2 is making a splash on GeForce NOW day-and-date with la...
13/05/2026
Agentic AI is changing the way users get work done. Following the success of OpenClaw, the community is embracing new open source agentic frameworks. The latest...
13/05/2026
Reinforcement-learning agents - AI systems that learn by trial and error - can c...
12/05/2026
From finance and procurement to supply chain and manufacturing, specialized AI agents are moving into the enterprise systems where business decisions are made, ...
07/05/2026
AI will help build the energy it needs.
That's the case U.S. Energy Secreta...
07/05/2026
Less typing, more tanking.
Faster logins mean more time in the gaming action - and this week provides GeForce NOW members with a smoother path straight into th...
06/05/2026
The race to build the world's most powerful AI factories demands networking ...
05/05/2026
Enterprise AI has learned to generate. It has learned to reason. Now companies are asking the next question: How should AI act?
Early agent systems have shown ...
30/04/2026
Editor's note: This post is part of the Nemotron Labs blog series, which explores how the latest open models, datasets and training techniques help business...
30/04/2026
[Editor's note] The blog has been updated to note that GeForce RTX 5080-powe...
28/04/2026
Editor's note: This post is part of Into the Omniverse, a series focused on how developers, 3D practitioners, and enterprises can transform their workflows ...
28/04/2026
AI agent systems today juggle separate models for vision, speech and language - ...
23/04/2026
AI agents have revolutionized developer workflows, and their next frontier is kn...
23/04/2026
GeForce NOW is doubling down on what matters most: gamers. This week's upgra...
22/04/2026
NVIDIA and Google Cloud have collaborated for more than a decade, co engineering a full stack AI platform that spans every technology layer - from performance o...
20/04/2026
Manufacturing is at an inflection point. Across every major industrial economy, ...
20/04/2026
AI agents are transforming how work gets done across all industries, acceleratin...
16/04/2026
Head straight for orbit with GeForce NOW - no space helmet required.
PRAGMATA,...
15/04/2026
Traditional data centers only stored, retrieved and processed data. In the generative and agentic AI era, these facilities have evolved into AI token factories....
15/04/2026
The NAB Show 2026 trade show, running April 18-22 in Las Vegas, is set to showcase a wave of new features and optimizations for top video editing applications. ...
09/04/2026
A timeless story of grit, faith and rebellion takes center stage as Samson: A Ty...
02/04/2026
Open models are driving a new wave of on-device AI, extending innovation beyond the cloud to everyday devices. As these models advance, their value increasingly...
02/04/2026
No joke - GFN Thursday is skipping the tricks and heading straight into the games. April kicks off with ten new titles, bringing fresh adventures to GeForce NOW...
31/03/2026
CERAWeek - dubbed the Davos of energy - is where policymakers, producers, techno...
26/03/2026
Editor's note: This post is part of Into the Omniverse, a series focused on ...
26/03/2026
That gaming backlog won't clear itself - GeForce NOW is here to help. Stream the latest titles straight from the cloud across a variety of devices.
This we...
25/03/2026
AI is the defining technology of our time, quickly becoming core business infrastructure. It's fueled by a diverse ecosystem of models: large and small, ope...
25/03/2026
At the half-time whistle of the UEFA EURO 2020 round of 16 football match betwee...
24/03/2026
Artificial intelligence has rapidly emerged as one of the most critical workload...
23/03/2026
Autonomous agents mark a new inflection point in AI. Systems are no longer limited to generating responses or reasoning through tasks. They can take action: Age...
19/03/2026
It's a double feature on GFN Thursday. This week, GeForce NOW offers smoother sights in virtual reality (VR) and a sprawling new land to conquer.
Streaming...
17/03/2026
As AI native applications scale to more users, agents and devices, the telecommu...
17/03/2026
The features on social media apps like Snapchat evolve nearly as fast as what...
17/03/2026
The paradigm of consumer computing has revolved around the concept of a personal...