NVIDIA Research Unlocks Advanced Grasping, Smarter Autonomous Driving and Agent Training at Scale

03/06/2026

What makes a robot gripper useful isn't that it can pick up one object - it's that it can pick up the next one, and the one after that, with a tool it's never held before.

What makes an autonomous vehicle system safe isn't just that it can reason through a situation - it's that it can do so quickly enough on the hardware actually installed in the car.

What makes a virtual agent capable is exposure to as many different environments as possible before it faces the real world.

At this year's Computer Vision and Pattern Recognition (CVPR) conference, NVIDIA Research is presenting three papers that address each of these challenges - and share a common theme: training at scale creates systems that generalize across diverse applications.

The three papers cover different challenges in physical AI research:

GraspGen-X, the first foundation model for zero-shot grasping, was trained on billions of simulated grasps to work with any gripper it's shown.

LCDrive introduces a model that replaces expensive text-based reasoning with compact latent representations, letting autonomous vehicles think faster on embedded hardware.

NitroGen is a generalized gameplay AI foundation model that harnesses the NVIDIA Isaac GR00T robot foundation model architecture to help train embodied agents in virtual environments across tens of thousands of hours of interaction.

NVIDIA also unveiled at CVPR new physical AI agent skills that help researchers and developers speed the development of autonomous vehicles, robots and vision AI systems.

NitroGen and another NVIDIA-authored paper, PixelDIT, were named best paper finalists at the conference - an accolade given to just 15 of over 4,000 accepted papers at CVPR.

The First Foundation Model for Grasping Most AI systems for robotic grasping are specialists.

A vision-language-action policy trained for a two-finger gripper only learns to grasp with those two fingers. Similarly, a policy for dextrous grasping will only work for the bespoke multi-fingered gripper it's trained on. For every new embodiment, the process typically needs to be repeated - requiring new training data, fine-tuning and validation. This constraint means most robotics companies pick a gripper, train for it and stick with it.

GraspGen-X is the first foundation model for grasping built to eliminate this bottleneck.

Like a large language model that can apply its understanding of language to a new task without retraining, GraspGen-X applies its understanding of geometry and contact to any robotic gripper it encounters. Given the geometry of a new gripper and an unknown object it's never seen before, the model generates reliable grasp pose proposals to enable the robot to grasp the object.

https://blogs.nvidia.com/wp-content/uploads/2026/06/GraspGenX.mp4

To get there, the researchers needed a dataset that's impossible to collect in the real world at scale. They generated 2 billion simulated grasps across thousands of object shapes and synthetic gripper configurations, spanning the diversity of form factors a deployed robot might encounter.

For robot developers, this foundation model eliminates the need for per-gripper training cycles and can be applied out of the box for several commonly used grippers. GraspGenX can be used in conjunction with curoboV2, a new CUDA-accelerated motion planning library, to achieve these grasp poses in unknown environments.

Building on the GraspGen research foundation, another paper, Grasp-MPC - presented at ICRA 2026 - advances the next step in the pipeline: moving from grasp generation to closed-loop grasp execution.

Teaching Autonomous Vehicles to Think Faster In recent years, researchers have found that letting an AI reason - generating intermediate thinking steps before committing to an answer - reliably improves its decision-making.

For autonomous vehicles, the challenge is doing that reasoning on the hardware inside an actual vehicle. Text-based chain-of-thought reasoning generates words, and every word is a token that takes time to produce. On the processor running inside a car, token count is a real constraint on how fast the system can respond.

LCDrive tackles this problem by replacing words with compressed latent representations.

Instead of generating human-readable reasoning steps, the system thinks in a compact latent space - states that capture spatial information rather than producing text. The architecture alternates between two kinds of thinking: proposing candidate actions, then predicting what the world will look like if those actions are taken.

It uses that predicted world state to refine its next step. It's the same reasoning loop - just in a more computationally efficient form than natural language.

The result: comparable output trajectory quality to text-based reasoning, using roughly half the tokens.

The model was built on NVIDIA Alpamayo and trained using supervision derived from existing vehicle data.

Embodied Agents Trained in Virtual Worlds Isaac GR00T - NVIDIA's open foundation model for humanoid robots - is built on a simple principle: expose a model to enough diverse situations, and it will generalize to ones it hasn't seen.

NitroGen extends that principle to virtual environments, using the GR00T architecture to train a foundation model for embodied agents across a breadth of virtual worlds.

Video games offer something that's hard to build from scratch: structured, varied worlds with defined goals and well-specified success conditions. They're high-quality training environments, available at scale.

NitroGen treats them that way - as a training ground for agents that will eventually be trained to handle novel real- or simulated-world situations, like powering a robot that helps with housework based on broad instructions such as, Put these items away in the

LINK:	https://blogs.nvidia.com/blog/cvpr-research-grasping-driving-agent-tra...
	See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

07/10/2026

Dalet Flex LTS Delivers Smarter Media Operations from Ingest to Distribution

Dalet, a leading technology and service provider for media-rich organizations, today announced the latest Long-Term Supported (LTS) release of Dalet Flex. Build...

06/09/2026

Dolby and MagentaTV Bring Fans Closer to the FIFA World Cup 2026 in Germany with Dolby Vision and Dolby Atmos

June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

18/07/2026

More Than Just a Game: FIFA World Cups Lance Brass Breaks Down Stadium Production & Entertainment

Topics include pre-match ceremonies, live performances, the tournament's fir...

18/07/2026

As the Final Approaches, FIFA and HBS Take Stock of a World Cup That Rewrote the Production Playbook

When FIFA and HBS set out to produce the 2026 FIFA World Cup, the numbers alone ...

18/07/2026

IK Multimedia add Brown Panel Signature Collection to TONEX

Captures nine sought-after Fender amps IK Multimedia's latest TONEX expansion captures a selection of nine rare Brown Panel' Fender amps that were ...

18/07/2026

Frap Tools update the Magnolia

Latest batch ships alongside firmware update Since being unveiled at Superbooth 2025, Frap Tools' debut polysynth has been met with widespread praise, a...

18/07/2026

Netflix Viewing Hit Record 97 Billion Hours in First Half of 2026

Share Copy link Facebook X Linkedin Bluesky Email...

18/07/2026

YouTube's Creative Ecosystem Contributed $60 Billion to U.S. GDP

Share Copy link Facebook X Linkedin Bluesky Email...

17/07/2026

SVG GameDay, Ep. 24: Mercedes-Benz Stadiums Cole Gallagher - Supporting Shows in the ATL

In-venue and creative video staffers at the professional and collegiate level ha...

17/07/2026

Brooklyn Bowl Williamsburg Stagehands Vote to Join IATSE Local 4

Production workers at Brooklyn Bowl's Williamsburg location voted 15-1 to join IATSE Local 4. The bargaining unit covers 24 production workers at the venue,...

17/07/2026

DAZN and ADI Predictstreet Announce Exclusive Global Prediction Market Partnership

DAZN and ADI Predictstreet have announced an exclusive global strategic partners...

17/07/2026

Zixi and Comcast Technology Solutions Announce Integration for C-Band Satellite Replacement

Zixi and Comcast Technology Solutions (CTS) have announced a strategic integrati...

17/07/2026

Professional Fighters League Announces Multi-Year Partnership with ESPN in Brazil

Professional Fighters League (PFL) has announced a multi-year partnership with E...

17/07/2026

Spectrum Business Launches Spectrum TV Control Pro for Multi-Screen Venue Management

Spectrum Business has announced Spectrum TV Control Pro, a centralized app-based...

17/07/2026

Clark Wire and Cable Appoints Rick Fernandez as Latin American Representative

Clark Wire and Cable has announced that Rick Fernandez, Managing Director of Axxion Consulting, will serve as Independent Manufacturers Representative for Centr...

17/07/2026

TikTok, NBA, and WNBA Announce Multi-Year Global Content Partnership

TikTok, the NBA, and the WNBA have announced a multi-year global content partnership covering highlights distribution, creator access to marquee events, live-ga...

17/07/2026

MSG Entertainment Files Defamation Lawsuit Against Wired Over July 9 Article

Company alleges article contained false and misleading claims regarding customer data...

17/07/2026

Ratings Roundup: Argentina-England Semifinal Breaks Records for FOX; MLB All-Star Game Is Most Watched Since 2018

Ratings Roundup is a rundown of recent rating news and is derived from press rel...

17/07/2026

American Pachuco Gives a Legend of Stage and Screen His Due

(L-R) Edward James Olmos, Luis Valdez, Lou Diamond Phillips and Lupe Valdez attend American Pachuco: The Legend Of Luis Valdez Premiere during the 2026 Sundan...

17/07/2026

Sonuscore introduce Elysion Elements

Full engine access with reduced soundset Sonuscore have steadily been introducing a selection of reduced-cost and free versions of their flagship products r...

17/07/2026

Arturia AstroLab Silver & KeyLab Mk3 Ultra

Two new special-edition models revealed Over the past week, Arturia have launched special-edition versions of both their premium MIDI controller and stage p...

17/07/2026

Dirk Ulrich reacquires Plugin Alliance & Brainworx

Original founder now back at the helm In a message posted on his personal social media accounts, Dirk Ulrich has announced that both Plugin Alliance and Bra...

17/07/2026

Legendary presenter Anton Enus to sign off from SBS World News after 27 years with the network

Legendary presenter Anton Enus to sign off from SBS World News after 27 years wi...

17/07/2026

Millions rise before dawn to watch FIFA World Cup 2026 semi-finals on SBS

Millions rise before dawn to watch FIFA World Cup 2026 semi-finals on SBS 17 July, 2026 Media releases England v Argentina attracted a Total TV reach of a...

17/07/2026

Spectrum Simplifies Control Of Multiple TVs At Bars, Restaurants

Share Copy link Facebook X Linkedin Bluesky Email...

17/07/2026

Foundry Releases SmartRoto for Nuke

Foundry Releases SmartRoto for Nuke Brie Clayton July 17, 2026 0 Comments Spline-based AI powered plugin accelerates time-consuming rotoscoping, helping...

17/07/2026

A Short Documentary About a Giant Pencil Edited with DaVinci Resolve Studio

A Short Documentary About a Giant Pencil Edited with DaVinci Resolve Studio Brie Clayton July 17, 2026 0 Comments SBIFF jury award winner finished fro...

17/07/2026

Wheatstone to Feature Virtual Mixing Platform at IBC 2026

Share Copy link Facebook X Linkedin Bluesky Email...

17/07/2026

Trump Threatens Networks Over Speech Coverage

Share Copy link Facebook X Linkedin Bluesky Email...

17/07/2026

Lightware Sustainability Report Details Product Efficienc...

Lightware has published its second voluntary Sustainability Report, showing how energy efficiency, product longevity and responsible material use are increasing...

17/07/2026

Foundry Unlocks Professional-Grade AI for VFX with Gripta...

Creative software developer Foundry today announced Griptape Enterprise, a new tier of the Griptape AI workflow orchestration platform, extended to meet the str...

17/07/2026

Foundry Releases SmartRoto for Nuke

Creative software developer Foundry today announced the availability of SmartRoto, a new AI-powered plugin upgrade for Nuke, NukeX, Nuke Studio, and Nuke Indie ...

17/07/2026

The Final Release of Blackmagic Fairlight Live is Now Available!

The Final Release of Blackmagic Fairlight Live is Now Available! Brie Clayton July 16, 2026 0 Comments Fairlight Live is a new audio mixer designed fo...

17/07/2026

Leveraging All of Entertainment's IP

Leveraging All of Entertainment's IP Andy Marken July 16, 2026 0 Comments Nobody ever wins the games. Period. There are survivors. There\s no win...

17/07/2026

Few mics are trusted on an antique Stradivarius copy 4099 is

From intimate acoustic settings to large-scale festival stages, her focus has remained constant: preserving the true natural voice of the violin in every perfor...

17/07/2026

RT Radio 1 continues its summer of stories, music, sport and conversation

New and returning series, live festival broadcasts, powerful documentaries and major sporting moments headline RT Radio 1's summer schedule RT Radio 1 is...

17/07/2026

NVIDIA Vera Rubin Maximizes Intelligence per Dollar for Post-Training Workloads - a Key Metric for Agentic AI

Think of a professional athlete. What separates elite performers is what happens...

17/07/2026

Dith, Jacqui and Louise bring hurling fans together ahead of the All-Ireland Finals

Up For The Match - Live From Croke Park this Saturday at 8pm on RT One and RT ...

17/07/2026

DiscoverIreland.ie announced as official show sponsor of The Traitors Ireland series 2

RT Commercial has today announced DiscoverIreland.ie, as sponsor of The Traitor...

16/07/2026

Business Innovation Synergizer: Grants of up to 30,000 for media organisations

Media organisations with tested plans for new products, services or revenue models can now apply for grants of up to 30,000 through the Business Innovation Syn...

16/07/2026

SVG Students To Watch: Kaden Sherman, Albany State University

This recent graduate and Georgia native has turned an early fascination with live production into a growing passion for camera work In the live-sports-video in...

16/07/2026

TNDV Marks First Year of Aspiration 35 Cinematic Production Truck

TNDV is marking the first anniversary of Aspiration 35, a mobile production truck built around ARRI Alexa 35 Live camera systems. Over its first year, the truck...

16/07/2026

ATSC Releases First Major Revision to A/85 Audio Loudness Recommended Practice Since 2013

ATSC has completed a major revision to its A/85 Recommended Practice: Techniques...

16/07/2026

FloSports Named Exclusive Global Media Partner for 2026 CrossFit Games

FloSports has announced an exclusive global media partnership with CrossFit for the 2026 CrossFit Games, presented by Air National Guard, beginning July 21 in S...

16/07/2026

Sennheiser Releases Spectera Firmware 1.4 and Command Button

Sennheiser has released firmware version 1.4 for its Spectera wireless system and announced the Spectera Command Button (Cat. No. 701014). The update adds comma...

16/07/2026

ESPN and Southwestern Athletic Conference Announce Multi-Year Media Rights Agreement Through 2030-31

ESPN and the Southwestern Athletic Conference (SWAC) have announced a multi-year...

16/07/2026

Audio-Technica Releases 20 Series Control Application for Elgato Stream Deck

Audio-Technica has announced the 20 Series Control Application, a Stream Deck plug-in for compatible Audio-Technica USB microphones. The application is compatib...

16/07/2026

SSL Live L550 Plus Consoles Power Audio Production for The Voice France Season 15

The fifteenth season of The Voice - La plus belle voix on TF1 used four Solid St...

View most recent headlines