
What makes a robot gripper useful isn't that it can pick up one object - it's that it can pick up the next one, and the one after that, with a tool it's never held before.
What makes an autonomous vehicle system safe isn't just that it can reason through a situation - it's that it can do so quickly enough on the hardware actually installed in the car.
What makes a virtual agent capable is exposure to as many different environments as possible before it faces the real world.
At this year's Computer Vision and Pattern Recognition (CVPR) conference, NVIDIA Research is presenting three papers that address each of these challenges - and share a common theme: training at scale creates systems that generalize across diverse applications.
The three papers cover different challenges in physical AI research:
GraspGen-X, the first foundation model for zero-shot grasping, was trained on billions of simulated grasps to work with any gripper it's shown.
LCDrive introduces a model that replaces expensive text-based reasoning with compact latent representations, letting autonomous vehicles think faster on embedded hardware.
NitroGen is a generalized gameplay AI foundation model that harnesses the NVIDIA Isaac GR00T robot foundation model architecture to help train embodied agents in virtual environments across tens of thousands of hours of interaction.
NVIDIA also unveiled at CVPR new physical AI agent skills that help researchers and developers speed the development of autonomous vehicles, robots and vision AI systems.
The First Foundation Model for Grasping Most AI systems for robotic grasping are specialists.
A vision-language-action policy trained for a two-finger gripper only learns to grasp with those two fingers. Similarly, a policy for dextrous grasping will only work for the bespoke multi-fingered gripper it's trained on. For every new embodiment, the process typically needs to be repeated - requiring new training data, fine-tuning and validation. This constraint means most robotics companies pick a gripper, train for it and stick with it.
GraspGen-X is the first foundation model for grasping built to eliminate this bottleneck.
Like a large language model that can apply its understanding of language to a new task without retraining, GraspGen-X applies its understanding of geometry and contact to any robotic gripper it encounters. Given the geometry of a new gripper and an unknown object it's never seen before, the model generates reliable grasp pose proposals to enable the robot to grasp the object.
https://blogs.nvidia.com/wp-content/uploads/2026/06/GraspGenX.mp4
To get there, the researchers needed a dataset that's impossible to collect in the real world at scale. They generated 2 billion simulated grasps across thousands of object shapes and synthetic gripper configurations, spanning the diversity of form factors a deployed robot might encounter.
For robot developers, this foundation model eliminates the need for per-gripper training cycles and can be applied out of the box for several commonly used grippers. GraspGenX can be used in conjunction with curoboV2, a new CUDA-accelerated motion planning library, to achieve these grasp poses in unknown environments.
Building on the GraspGen research foundation, another paper, Grasp-MPC - presented at ICRA 2026 - advances the next step in the pipeline: moving from grasp generation to closed-loop grasp execution.
Teaching Autonomous Vehicles to Think Faster In recent years, researchers have found that letting an AI reason - generating intermediate thinking steps before committing to an answer - reliably improves its decision-making.
For autonomous vehicles, the challenge is doing that reasoning on the hardware inside an actual vehicle. Text-based chain-of-thought reasoning generates words, and every word is a token that takes time to produce. On the processor running inside a car, token count is a real constraint on how fast the system can respond.
LCDrive tackles this problem by replacing words with compressed latent representations.
Instead of generating human-readable reasoning steps, the system thinks in a compact latent space - states that capture spatial information rather than producing text. The architecture alternates between two kinds of thinking: proposing candidate actions, then predicting what the world will look like if those actions are taken.
It uses that predicted world state to refine its next step. It's the same reasoning loop - just in a more computationally efficient form than natural language.
The result: comparable output trajectory quality to text-based reasoning, using roughly half the tokens.
The model was built on NVIDIA Alpamayo and trained using supervision derived from existing vehicle data.
Embodied Agents Trained in Virtual Worlds Isaac GR00T - NVIDIA's open foundation model for humanoid robots - is built on a simple principle: expose a model to enough diverse situations, and it will generalize to ones it hasn't seen.
NitroGen extends that principle to virtual environments, using the GR00T architecture to train a foundation model for embodied agents across a breadth of virtual worlds.
Video games offer something that's hard to build from scratch: structured, varied worlds with defined goals and well-specified success conditions. They're high-quality training environments, available at scale.
NitroGen treats them that way - as a training ground for agents that will eventually be trained to handle novel real- or simulated-world situations, like powering a robot that helps with housework based on broad instructions such as, Put these items away in the pantry.
Trained across more than 1,000 games and 40,000 hours of interaction using a model based on GR00T, the resulting agents learn to generalize across environments. Th
North America Stories
03/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
03/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
03/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
03/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
03/06/2026
For more than three decades, Re-recording Mixer Andrew Wilson, AMPS, CAS, has helped bring the natural world to the screen with exceptional audio enjoyed by mil...
03/06/2026
Telestream, a global leader in media workflow technologies, will showcase its latest innovations for modern AV production environments at InfoComm 2026 (Booth N...
03/06/2026
DPA Microphones will present a comprehensive portfolio of integrated audio solutions designed to meet the evolving needs of today's professional AV environm...
03/06/2026
Lightware announces the GVN-HC-TX220AP, a new transmitter in the Gemini GVN 1G AV-over-IP family that introduces full-featured USB-C for professional 1Gb AV-ove...
03/06/2026
Evergent, the customer management and monetization leader for streaming and digital subscription businesses, and Minno, the global leader in faith-based content...
03/06/2026
Alfalite, Europe's only LED display manufacturer, has completed a new broadcast installation with the deployment of two UHD Finepix 1.5 MATIX AlfaCOB LED di...
03/06/2026
Broadcast Solutions, a leading systems integrator and provider of innovative solutions for the broadcast media industry, is completing a contract to build eight...
03/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
03/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
03/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
03/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
03/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
03/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
03/06/2026
Creamsource, known for its tried-and-true Vortex Series of cinematic lighting, has announced the Vortex2 (V2) and Vortex2 Soft (V2S), two compact additions to t...
03/06/2026
Faster, More Flexible AI Matting Fuels Boris FX Silhouette
Jessie Electa Petrov June 2, 2026
0 Comments
The 2026 release helps artists tackle complex ...
03/06/2026
La T l 's National League Hockey Expansion Powered by Blackmagic Design
Brie Clayton June 2, 2026
0 Comments
Blackmagic Videohub 120 120 12G provi...
03/06/2026
ZY Optics to Offer Exclusive First Look at Zone T1 Cine Kit at Cine Gear Expo 20...
03/06/2026
Berklee Alumni St. Vincent and Ruby Plume to Perform Together on St. Vincents 20...
03/06/2026
At CVPR, NVIDIA is unveiling new physical AI agent skills that help researchers ...
03/06/2026
What makes a robot gripper useful isn't that it can pick up one object - it&...
02/06/2026
Tennis Channel has completed a transition from satellite-based distribution to a...
02/06/2026
Daktronics has announced the 2026 High School Video Summit, a two-day educational event for high school educators and student production teams, taking place Jun...
02/06/2026
Fandango will bring Telemundo's live Spanish-language coverage of the FIFA W...
02/06/2026
All Women's Sports Network (AWSN) has announced the live television schedule...
02/06/2026
Spalk, a cloud-based multilingual commentary and production platform, has announced three new partnerships: Ligue 1 (English and Portuguese highlights), Eurolea...
02/06/2026
NHL Network's 2026 Stanley Cup Final coverage began June 1 with NHL Tonight: Stanley Cup Final Media Day from the Carolina Hurricanes' Lenovo Center at ...
02/06/2026
Matrox Video has announced the Maevex MGX Series, a lineup of IPMX-ready video e...
02/06/2026
Marshall Electronics will exhibit at InfoComm 2026 (Booth C7521), showcasing a lineup of 4K and HD compact POV cameras for corporate, education, hospitality, wo...
02/06/2026
The Professional Audio Manufacturers Alliance (PAMA) and Shure Incorporated have...
02/06/2026
MultiDyne Video and Fiber Optic Systems will exhibit at InfoComm 2026 (Booth C50...
02/06/2026
Advanced Systems Group (ASG) has announced the promotion of Joe Marchitto to Western Regional CTO. In his new role, Marchitto will oversee system design across ...
02/06/2026
The NHL has announced that Brothers Osborne will headline a free outdoor concert...
02/06/2026
CP Communications has announced the appointment of its first two female executives in the company's 40-year history. Tabitha Coleman has been named Vice Pre...
02/06/2026
The Lighting Design Group (LDG) has announced the completion of Studio C at Yahoo's headquarters at 770 Broadway in Manhattan. The studio launched April 24 ...
02/06/2026
Zee Entertainment Enterprises Ltd. (Z) has announced a partnership with FIFA to broadcast FIFA World Cup 2026, FIFA World Cup 2030, FIFA Women's World Cup 2...
02/06/2026
Behind The Mic provides a roundup of recent news regarding on-air talent, including new deals, departures, and assignments compiled from press releases and repo...
02/06/2026
The department is broadcasting NCAA Baseball Super Regionals at Plainsman Park this weekend
Broadcast and production crews at Division I institutions are busy ...
02/06/2026
The initiative is extends beyond the league's arenas to team training rinks,...
02/06/2026
Olivia Wilde, Seth Rogen, Pen lope Cruz, and Edward Norton appear in The Invite by Olivia Wilde, an official selection of the 2026 Sundance Film Festival. (Co...
02/06/2026
blueCORE standalone processors headline solutions designed to simplify the trans...
02/06/2026
The TV agency was one of the earliest adopters of Nielsen's local television...
02/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
02/06/2026
Grass Valley today announced that Australian News Channel (ANC), operator of Sky News Australia, has deployed Grass Valley AMPP to transform its newsroom produc...
02/06/2026
Studio Technologies, a leading manufacturer of high-quality audio, video, and fiber-optic solutions, announces its new Model 385 Mic/Intercom Beltpack. The Mode...
02/06/2026
The Riedel Group today announced the appointment of Gudrun Scharler as CEO of Riedel Networks. She succeeds Michael Martens, who has led Riedel Networks since 2...
02/06/2026
More signals, higher quality, and outstanding ingest and streaming flexibility deliver professional results in a small, all-in-one footprint...