Sony Pixel Power calrec Sony

NVIDIA Advances Robot Learning and Humanoid Development With New AI and Simulation Tools

06/11/2024

www.1x.tech

Robotics developers can greatly accelerate their work on AI-enabled robots, including humanoids, using new AI and simulation tools and workflows that NVIDIA revealed this week at the Conference for Robot Learning (CoRL) in Munich, Germany.

The lineup includes the general availability of the NVIDIA Isaac Lab robot learning framework; six new humanoid robot learning workflows for Project GR00T, an initiative to accelerate humanoid robot development; and new world-model development tools for video data curation and processing, including the NVIDIA Cosmos tokenizer and NVIDIA NeMo Curator for video processing.

The open-source Cosmos tokenizer provides robotics developers superior visual tokenization by breaking down images and videos into high-quality tokens with exceptionally high compression rates. It runs up to 12x faster than current tokenizers, while NeMo Curator provides video processing curation up to 7x faster than unoptimized pipelines.

Also timed with CoRL, NVIDIA presented 23 papers and nine workshops related to robot learning and released training and workflow guides for developers. Further, Hugging Face and NVIDIA announced they're collaborating to accelerate open-source robotics research with LeRobot, NVIDIA Isaac Lab and NVIDIA Jetson for the developer community.

Accelerating Robot Development With Isaac Lab NVIDIA Isaac Lab is an open-source, robot learning framework built on NVIDIA Omniverse, a platform for developing OpenUSD applications for industrial digitalization and physical AI simulation.

Developers can use Isaac Lab to train robot policies at scale. This open-source unified robot learning framework applies to any embodiment - from humanoids to quadrupeds to collaborative robots - to handle increasingly complex movements and interactions.

Leading commercial robot makers, robotics application developers and robotics research entities around the world are adopting Isaac Lab, including 1X, Agility Robotics, The AI Institute, Berkeley Humanoid, Boston Dynamics, Field AI, Fourier, Galbot, Mentee Robotics, Skild AI, Swiss-Mile, Unitree Robotics and XPENG Robotics.

Project GR00T: Foundations for General-Purpose Humanoid Robots Building advanced humanoids is extremely difficult, demanding multilayer technological and interdisciplinary approaches to make the robots perceive, move and learn skills effectively for human-robot and robot-environment interactions.

Project GR00T is an initiative to develop accelerated libraries, foundation models and data pipelines to accelerate the global humanoid robot developer ecosystem.

Six new Project GR00T workflows provide humanoid developers with blueprints to realize the most challenging humanoid robot capabilities. They include:

GR00T-Gen for building generative AI-powered, OpenUSD-based 3D environments

GR00T-Mimic for robot motion and trajectory generation

GR00T-Dexterity for robot dexterous manipulation

GR00T-Control for whole-body control

GR00T-Mobility for robot locomotion and navigation

GR00T-Perception for multimodal sensing

Humanoid robots are the next wave of embodied AI, said Jim Fan, senior research manager of embodied AI at NVIDIA. NVIDIA research and engineering teams are collaborating across the company and our developer ecosystem to build Project GR00T to help advance the progress and development of global humanoid robot developers.

New Development Tools for World Model Builders Today, robot developers are building world models - AI representations of the world that can predict how objects and environments respond to a robot's actions. Building these world models is incredibly compute- and data-intensive, with models requiring thousands of hours of real-world, curated image or video data.

NVIDIA Cosmos tokenizers provide efficient, high-quality encoding and decoding to simplify the development of these world models. They set a new standard of minimal distortion and temporal instability, enabling high-quality video and image reconstructions.

Providing high-quality compression and up to 12x faster visual reconstruction, the Cosmos tokenizer paves the path for scalable, robust and efficient development of generative applications across a broad spectrum of visual domains.

1X, a humanoid robot company, has updated the 1X World Model Challenge dataset to use the Cosmos tokenizer.

NVIDIA Cosmos tokenizer achieves really high temporal and spatial compression of our data while still retaining visual fidelity, said Eric Jang, vice president of AI at 1X Technologies. This allows us to train world models with long horizon video generation in an even more compute-efficient manner.

Other humanoid and general-purpose robot developers, including XPENG Robotics and Hillbot, are developing with the NVIDIA Cosmos tokenizer to manage high-resolution images and videos.

NeMo Curator now includes a video processing pipeline. This enables robot developers to improve their world-model accuracy by processing large-scale text, image and video data.

Curating video data poses challenges due to its massive size, requiring scalable pipelines and efficient orchestration for load balancing across GPUs. Additionally, models for filtering, captioning and embedding need optimization to maximize throughput.

NeMo Curator overcomes these challenges by streamlining data curation with automatic pipeline orchestration, reducing processing time significantly. It supports linear scaling across multi-node, multi-GPU systems, efficiently handling over 100 petabytes of data. This simplifies AI development, reduces costs and accelerates time to market.

Advancing the Robot Learning Community at CoRL The nearly two dozen research papers the NVIDIA robotics team released with CoRL cover breakthroughs in integrating vision language models for improved environmental understanding and task execution, temporal robot navigati
LINK: https://blogs.nvidia.com/blog/robot-learning-humanoid-development/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

06/09/2026

Dolby and MagentaTV Bring Fans Closer to the FIFA World Cup 2026 in Germany with Dolby Vision and Dolby Atmos

June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

30/06/2026

Supreme Court Gives Trump Tight Control over Independent Regulators

Share Copy link Facebook X Linkedin Bluesky Email...

30/06/2026

Rocket Lab to Acquire Iridium in $8 Billion Deal

Share Copy link Facebook X Linkedin Bluesky Email...

30/06/2026

Kyocera AVX Releases New Web-Based Antenna Integration Tool

Share Copy link Facebook X Linkedin Bluesky Email...

30/06/2026

YouTube Shorts Get a Makeover

Share Copy link Facebook X Linkedin Bluesky Email...

30/06/2026

Rise Announces 2026 Worldwide Mentoring Cohorts

Share Copy link Facebook X Linkedin Bluesky Email...

30/06/2026

Other World Computing Launches New Atlas Core Line with 256GB CFExpress 4.0 Type B Memory Card

Other World Computing Launches New Atlas Core Line with 256GB CFExpress 4.0 Type...

30/06/2026

DaVinci Resolve Studio Used for Taketoshi Sado's Perfume Cold Sleep -25 years Document-

DaVinci Resolve Studio Used for Taketoshi Sado's Perfume Cold Sleep -25 year...

30/06/2026

June 29, 2026

Scripps Research scientists demonstrate a faster, cheaper route to making critical drugs using common table sugar New method illustrates how to build a tough ch...

29/06/2026

Op-Ed: Why the 2026 World Cup Is Redefining the Economics of Live Sports Production

By Andy Rayner, CTO, Appear The 2026 FIFA World Cup is the largest football tou...

29/06/2026

Study: Esports Plays Major Role in Gen Z Media Habits, Purchasing Behavior

A new multi-country study from ESL FACEIT Group, Hero Esports, and Niko Partners estimates that 400 million Gen Z consumers regularly engage with esports, under...

29/06/2026

ESPN Sets Multiplatform Plans for America 250 Celebration

ESPN will mark America's 250th anniversary with a series of content initiatives across its linear, digital, and streaming platforms, including a special edi...

29/06/2026

OBSBOT Named Official Camera and Webcam Partner of Esports World Cup 2026

The Esports Foundation has named OBSBOT the Official Camera and Webcam Partner for the Esports World Cup 2026, bringing the company's AI-powered imaging tec...

29/06/2026

Insight Productions Launches Insight Storm, 53-Foot Esports Broadcast Truck

Insight Productions has launched Insight Storm, a 53-foot mobile broadcast unit designed specifically for esports production, live entertainment, and digital-fi...

29/06/2026

Gravity Media Delivers Global Distribution, Streaming Services for World Economic Forum in Dalian

Gravity Media once again provided broadcast, streaming, and content-distribution...

29/06/2026

Wimbledon Introduces AI-Powered Fan Features, Modernized Digital Platforms for 2026 Championships

The All England Lawn Tennis Club and IBM have introduced new and enhanced digita...

29/06/2026

Evolve Dark Matter from Excite Audio

Four-layer instrument aimed at dark electronic music Excite Audio's latest software instrument has been designed with dark drum and bass, atmospheric te...

29/06/2026

Tracktion unleashes Waveform 14 DAW

New AI Assistant, Multi-channel Audio, ARA2 improvements & more Tracktion's DAW software has just received its latest major update, gaining a selection ...

29/06/2026

Focusrite publish 2026 Sustainability Report

Details environmental policies & results The Focusrite Group have just announced that following a long audit process, they have published their 2026 sustain...

29/06/2026

Comcast to Spin Off NBCUniversal, Sky

Share Copy link Facebook X Linkedin Bluesky Email...

29/06/2026

Supreme Court Rules Trump Can Fire FTC Commissioner Without Cause

Share Copy link Facebook X Linkedin Bluesky Email...

29/06/2026

CVP Launches Warranty plus to Protect Professional Equipm...

CVP, one of Europes leading suppliers of professional video and broadcast solutions, has announced the launch of CVP Warranty , a new extended warranty programm...

29/06/2026

TitanTV Hires Mark Hadley as Technology Specialist

Share Copy link Facebook X Linkedin Bluesky Email...

29/06/2026

Fox Sports Delivers Another FIFA Mens World Cup Audience Record

Share Copy link Facebook X Linkedin Bluesky Email...

29/06/2026

Claude Meets Blackwell Ultra: Anthropic's Models Now Run on NVIDIA GB300 in Azure

Anthropic's Claude models in Microsoft Foundry - hosted on Microsoft Azure a...

29/06/2026

Arvato Systems Achieves AWS Cloud Operations Competency

Arvato Systems Achieves AWS Cloud Operations Competency The team behind the AWS Cloud Operations Competency (from left to right: Philipp Hellmich, Johanna Bod...

29/06/2026

Ger Gilroy to join RT to present a Daily Sports Podcast

RT today announced that Ger Gilroy will join the RT Podcasts team to present a daily sports podcast. Launching later this year, the new show will set the spor...

29/06/2026

OUTsurance announced as sponsors of Oliver Callan on RT Radio 1

RT Commercial announced OUTsurance as sponsor of the Oliver Callan show on RT Radio 1 from Wednesday 1 July. Oliver Callan doubles the fun, delivering two ...

29/06/2026

Open Models, Closed Environments: Palantir Brings Secure AI to US Agencies With NVIDIA Nemotron

Showcasing the importance of open source innovation in American AI, Palantir'...

28/06/2026

Softube unveil lower cost Console 1 Compact

Half-size model joins Console 1 line-up Shortly after the release of their new Flow Studio controller, Softube have announced the launch of another new surf...

28/06/2026

EVE Audio introduce EVE Origin

New EXO Series DSP control software announced EVE Audio's EXO Series monitor range has just gained a new software element that provides remote access to...

28/06/2026

Freedman Labs Releases PrepMyMedia and ViewMyAttic for Post Production Professionals - 50% off for COWs!

Freedman Labs Releases PrepMyMedia and ViewMyAttic for Post Production Professio...

27/06/2026

Through Their Lens: What Cinematographer Amy Vincent Saw at the 2026 Directors Lab

There's no doubt that you've seen the world through Amy Vincent's ey...

27/06/2026

UJAM release Retrocraft

Brings together saturation & lo-fi effects Following on from the release of their Voxcraft vocal-processing plug-in, UJAM have announced the launch of Retro...

27/06/2026

A record 4.84 million Australians choose SBS as the Socceroos advance at FIFA World Cup 2026

A record 4.84 million Australians choose SBS as the Socceroos advance at FIFA Wo...

27/06/2026

Apogee CRAS Symphony Mkii Education Feature Blog

Why CRAS Upgraded to Symphony I/O MK II When an audio school runs studios all day, every day, gear doesn't just need to sound good , it needs to survive rea...

27/06/2026

MultiDyne Acquires the Assets of MRMC

Share Copy link Facebook X Linkedin Bluesky Email...

27/06/2026

Spectrum Intelligence Ventures Launches Latis

Share Copy link Facebook X Linkedin Bluesky Email...

27/06/2026

Krotos Video to Sound Plugin Now Available for Adobe Premiere Pro

Krotos Video to Sound Plugin Now Available for Adobe Premiere Pro Brie Clayton June 26, 2026 0 Comments Editors can analyze footage, generate synchron...

27/06/2026

Mirai Media Elevates Digital and Broadcast Productions with Blackmagic Design

Mirai Media Elevates Digital and Broadcast Productions with Blackmagic Design Brie Clayton June 26, 2026 0 Comments Studio uses Ultimatte 12 HD and Po...

27/06/2026

Lutra Cafe & Bakery Opens At American Tobacco Campus

DURHAM, N.C. - JUNE 26, 2026 - Lutra Cafe & Bakery has opened its first brick-and-mortar location at American Tobacco Campus after owner Chris McLaurin operated...

26/06/2026

SVG GameDay, Ep. 21: Minnesota Vikings Allan Wertheimer - Large-Scale Shows in Minny

In-venue and creative video staffers at the professional and collegiate level ha...

26/06/2026

Strike Fighter League Announces Second Online Tournament, Set for July 25 in Las Vegas

Strike Fighter League (SFL), a professional air combat digital sport combining f...

26/06/2026

InfoComm 2026: Wisycom Announces MPR60 Firmware Update, MATF Antenna Matrix, and PFL RFoF Box

Wisycom has announced three new additions to its professional wireless ecosystem...

26/06/2026

Eurovision Services Inaugurates Expanded Master Control Room in Madrid

Eurovision Services inaugurated an expanded Master Control Room (MCR) in Madrid on June 1, 2026, building on a broadcast hub the company has operated in the cit...

26/06/2026

Midco Sports and University of North Dakota Renew Broadcast and Sponsorship Partnership

Midco Sports and the University of North Dakota (UND) have announced a two-year ...

26/06/2026

G&D and VuWall Appoint Vutec as Exclusive South Africa Distributor

Guntermann and Drunck (G&D) and VuWall, both part of the Panoptec Technologies Group, have appointed Vutec (Pty) Ltd as exclusive distributor for their KVM and ...