Sony Pixel Power calrec Sony

NVIDIA Advances Robot Learning and Humanoid Development With New AI and Simulation Tools

06/11/2024

www.1x.tech

Robotics developers can greatly accelerate their work on AI-enabled robots, including humanoids, using new AI and simulation tools and workflows that NVIDIA revealed this week at the Conference for Robot Learning (CoRL) in Munich, Germany.

The lineup includes the general availability of the NVIDIA Isaac Lab robot learning framework; six new humanoid robot learning workflows for Project GR00T, an initiative to accelerate humanoid robot development; and new world-model development tools for video data curation and processing, including the NVIDIA Cosmos tokenizer and NVIDIA NeMo Curator for video processing.

The open-source Cosmos tokenizer provides robotics developers superior visual tokenization by breaking down images and videos into high-quality tokens with exceptionally high compression rates. It runs up to 12x faster than current tokenizers, while NeMo Curator provides video processing curation up to 7x faster than unoptimized pipelines.

Also timed with CoRL, NVIDIA presented 23 papers and nine workshops related to robot learning and released training and workflow guides for developers. Further, Hugging Face and NVIDIA announced they're collaborating to accelerate open-source robotics research with LeRobot, NVIDIA Isaac Lab and NVIDIA Jetson for the developer community.

Accelerating Robot Development With Isaac Lab NVIDIA Isaac Lab is an open-source, robot learning framework built on NVIDIA Omniverse, a platform for developing OpenUSD applications for industrial digitalization and physical AI simulation.

Developers can use Isaac Lab to train robot policies at scale. This open-source unified robot learning framework applies to any embodiment - from humanoids to quadrupeds to collaborative robots - to handle increasingly complex movements and interactions.

Leading commercial robot makers, robotics application developers and robotics research entities around the world are adopting Isaac Lab, including 1X, Agility Robotics, The AI Institute, Berkeley Humanoid, Boston Dynamics, Field AI, Fourier, Galbot, Mentee Robotics, Skild AI, Swiss-Mile, Unitree Robotics and XPENG Robotics.

Project GR00T: Foundations for General-Purpose Humanoid Robots Building advanced humanoids is extremely difficult, demanding multilayer technological and interdisciplinary approaches to make the robots perceive, move and learn skills effectively for human-robot and robot-environment interactions.

Project GR00T is an initiative to develop accelerated libraries, foundation models and data pipelines to accelerate the global humanoid robot developer ecosystem.

Six new Project GR00T workflows provide humanoid developers with blueprints to realize the most challenging humanoid robot capabilities. They include:

GR00T-Gen for building generative AI-powered, OpenUSD-based 3D environments

GR00T-Mimic for robot motion and trajectory generation

GR00T-Dexterity for robot dexterous manipulation

GR00T-Control for whole-body control

GR00T-Mobility for robot locomotion and navigation

GR00T-Perception for multimodal sensing

Humanoid robots are the next wave of embodied AI, said Jim Fan, senior research manager of embodied AI at NVIDIA. NVIDIA research and engineering teams are collaborating across the company and our developer ecosystem to build Project GR00T to help advance the progress and development of global humanoid robot developers.

New Development Tools for World Model Builders Today, robot developers are building world models - AI representations of the world that can predict how objects and environments respond to a robot's actions. Building these world models is incredibly compute- and data-intensive, with models requiring thousands of hours of real-world, curated image or video data.

NVIDIA Cosmos tokenizers provide efficient, high-quality encoding and decoding to simplify the development of these world models. They set a new standard of minimal distortion and temporal instability, enabling high-quality video and image reconstructions.

Providing high-quality compression and up to 12x faster visual reconstruction, the Cosmos tokenizer paves the path for scalable, robust and efficient development of generative applications across a broad spectrum of visual domains.

1X, a humanoid robot company, has updated the 1X World Model Challenge dataset to use the Cosmos tokenizer.

NVIDIA Cosmos tokenizer achieves really high temporal and spatial compression of our data while still retaining visual fidelity, said Eric Jang, vice president of AI at 1X Technologies. This allows us to train world models with long horizon video generation in an even more compute-efficient manner.

Other humanoid and general-purpose robot developers, including XPENG Robotics and Hillbot, are developing with the NVIDIA Cosmos tokenizer to manage high-resolution images and videos.

NeMo Curator now includes a video processing pipeline. This enables robot developers to improve their world-model accuracy by processing large-scale text, image and video data.

Curating video data poses challenges due to its massive size, requiring scalable pipelines and efficient orchestration for load balancing across GPUs. Additionally, models for filtering, captioning and embedding need optimization to maximize throughput.

NeMo Curator overcomes these challenges by streamlining data curation with automatic pipeline orchestration, reducing processing time significantly. It supports linear scaling across multi-node, multi-GPU systems, efficiently handling over 100 petabytes of data. This simplifies AI development, reduces costs and accelerates time to market.

Advancing the Robot Learning Community at CoRL The nearly two dozen research papers the NVIDIA robotics team released with CoRL cover breakthroughs in integrating vision language models for improved environmental understanding and task execution, temporal robot navigati
LINK: https://blogs.nvidia.com/blog/robot-learning-humanoid-development/...
See more stories from nvidia

North America Stories

03/04/2026

TelevisaUnivision Signs New Nielsen Media Intelligence Deal

Share Copy link Facebook X Linkedin Bluesky Email...

03/04/2026

The WNET Group, JIB Launch NHK World-Japan in New York

Share Copy link Facebook X Linkedin Bluesky Email...

03/04/2026

NAB Leadership Foundation Welcomes New Board Members

Share Copy link Facebook X Linkedin Bluesky Email...

03/04/2026

EverPass Media Expands Distribution Deal with Netflix

Share Copy link Facebook X Linkedin Bluesky Email...

03/04/2026

Versant Acquires AI-Data Platform StockStory

Share Copy link Facebook X Linkedin Bluesky Email...

03/04/2026

CVP Grows European Footprint with Strategic Expansion in...

CVP, one of Europe's leading suppliers of professional video and broadcast solutions, today announces the launch of its new German operation and the formati...

03/04/2026

MRMC Announces Appointment of Chief Operating Officer

Mark Roberts Motion Control (MRMC) today announces the appointment of Nick Barthee as Chief Operating Officer, strengthening its leadership as the company conti...

03/04/2026

Net Insight Introduces Programmable Trust Boundaries for...

Net Insight introduces programmable Trust Boundaries that make live media interconnection predictable as traffic moves between facilities, networks and cloud en...

03/04/2026

Winning in the new media economy: Avid showcases AI-powered, connected intelligence to unlock media value at NAB Show 2026

Winning in the new media economy: Avid showcases AI-powered, connected intellige...

03/04/2026

NUGEN Audio CEO Dr. Paul Tapper to Lead Presentation About Dialog Intelligibility and Loudness at NAB 2026

NUGEN Audio CEO Dr. Paul Tapper to Lead Presentation About Dialog Intelligibilit...

03/04/2026

NAB Show 2026: PlayBox Neo Highlights Workflow, Security, and IP Advances

NAB Show 2026: PlayBox Neo Highlights Workflow, Security, and IP Advances Brie Clayton April 2, 2026 0 Comments PlayBox Neo will showcase the latest i...

03/04/2026

For Taku Hirano, Everything Is Connected

For Taku Hirano, Everything Is Connected From touring and composition to teaching and instrument design, the in-demand percussionist sees it all as one body o...

03/04/2026

Berklee Honors Humberto Ramirez with Master of Latin Music Award

Berklee Honors Humberto Ramirez with Master of Latin Music Award The alumnus and acclaimed trumpeter is honored for his influence as a performer, composer, an...

02/04/2026

HBO and NFL Films Announce Hard Knocks: Training Camp with the Seattle Seahawks, Debuting August 11

HBO and NFL Films have announced Hard Knocks: Training Camp with the Seattle Sea...

02/04/2026

NAB 2026: Haivision Unveils Makito ONE Video Transport Platform

Haivision has announced the Makito ONE, a single-blade video encoding and decoding platform, at NAB Show 2026. The platform combines dual-channel video encoding...

02/04/2026

NAB 2026: Telestream Introduces UP.Lens Cloud-Based Multiviewer and Monitoring Service

Telestream has introduced UP.Lens, a cloud-based multiviewer and monitoring serv...

02/04/2026

NAB 2026: MRMC to Showcase Robotic Camera Technology and Mark 60th Anniversary

Mark Roberts Motion Control (MRMC) will exhibit at NAB Show 2026 (Booth C5220, April 19-22, Las Vegas Convention Center), marking the company's 60th anniver...

02/04/2026

NAB 2026: Net Insight Introduces Programmable Trust Boundaries for Live Media Interconnection

Net Insight has introduced programmable Trust Boundaries, a feature integrated i...

02/04/2026

NAB 2026: Bitmovin Adds SGAI Support to Playback Products

Bitmovin has announced support for SGAI (Server-Guided Ad Insertion) in its playback products, using HLS interstitials. SGAI combines elements of client-side an...

02/04/2026

Binghamton University Athletics Adds Riedel SimplyLive RiMotion R12 for Student-Run Productions

Riedel Communications' SimplyLive RiMotion R12 replay system is supporting B...

02/04/2026

NAB 2026: LTN and Ateme Announce Integration of Video Processing with IP Transport

LTN, a managed IP video transport company, and Ateme, a video compression and de...

02/04/2026

TDF Expands Channel Capacity on Terrestrial Broadcast Network with Harmonic

Harmonic has announced that TDF, a broadcast infrastructure operator in France, has deployed Harmonic's XOS Advanced Media Processor and ProStream X Video S...

02/04/2026

United Rugby Championship Reports First-Year Results with Eluvio Streaming Platform

Eluvio and the United Rugby Championship (URC) have announced first-year results...

02/04/2026

Kansas City Current and Scripps Sports Announce ION as Broadcast Home of 2026 Teal Rising Cup

The Kansas City Current and Scripps Sports have announced that ION will broadcas...

02/04/2026

ESPN Announces Courtside Alt-Cast for Women's Final Four

ESPN will debut Courtside at the Women's Final Four Presented by AT&T, an alt-cast airing Friday, April 3 at 7 p.m. and 9:30 p.m. ET on ESPN2, and Sunday, A...

02/04/2026

PAMA and Shure Accept Applications for 6th Annual Mark Brunner Professional Audio Scholarship

The Professional Audio Manufacturers Alliance (PAMA) and Shure Incorporated are ...

02/04/2026

ESPN's MegaCast Coverage of 2026 NCAA Women's Final Four Begins Friday, April 3 in Phoenix

ESPN's MegaCast Coverage of 2026 NCAA Women's Final Four Begins Friday, ...

02/04/2026

DAZN Launches Playmakers Creator Program

DAZN has announced the launch of DAZN Playmakers, a global influencer program designed to build a network of sports content creators. The programme will give cr...

02/04/2026

NFL Network Unveils New Production Ops Leadership Structure as ESPN Takes Over

Tony Cole, Jessica Lee shift into new roles reporting to ESPN SVP/Content Operations Chris Calcinari....

02/04/2026

TNT Sports and CBS Sports to Reunite Michigan's Iconic Fab Five for Special NCAA Men's Final Four Altcast on truTV & HBO Max

Michigan's Fab Five will reunite for an alternate presentation of the Mich...

02/04/2026

Coming of Age: ESPN and NHL's Inside Out Classic Marks Another Step Forward for the Animated Alternative Broadcast

Real-time tracking, virtual production, and Pixar storytelling converge for Apri...

02/04/2026

Streaming Around the Moon: NASA+ Goes Live From Historic Artemis II Mission

NASA's long-awaited Artemis II mission has launched four astronauts on a 10-day journey around the moon, marking the first manned launch toward the moon sin...

02/04/2026

Release Rundown: What to Watch in April, From Bunnylovr to Omaha

(L-R) Molly Belle Wright, Wyatt Solis, and John Magaro appear in Omaha by Cole Webley, an official selection of the 2025 Sundance Film Festival. (Photo courte...

02/04/2026

L3Harris Powers First Crewed Mission Around the Moon in 50 Years

L3Harris has successfully powered the historic launch of the Artemis II mission, providing propulsion and avionics....

02/04/2026

Scripps Completes Sale of WRTV to Circle City Broadcasting

Share Copy link Facebook X Linkedin Bluesky Email...

02/04/2026

GoVertical! AiDi Powers Real-Time 9:16 Autocropping for I...

Already deployed extensively by NBC Sports, FOR-A Corporation will demonstrate GoVertical! AiDi, the real-time 9:16 autocropping feature of viztrick AiDi, durin...

02/04/2026

Elite Media Technologies Selects Interra Systems BATON Fi...

Interra Systems, a provider of end-to-end quality assurance solutions for the digital media industry, announced that Elite Media Technologies has selected its B...

02/04/2026

TDF Expands Broadcast Channel Lineup with Harmonic

Harmonic's Media Processing Solutions Maximize Bandwidth Efficiency for Terrestrial Broadcast Delivery Harmonic (NASDAQ: HLIT) today announced that TDF, a...

02/04/2026

FOR-A's Software-Defined, AI-Powered Development Advances...

NBC Sports Deploys viztrick AiDi to Stream Live Events in 9:16 Mobile-First Formats with Auto Tracking, Development Signals Strategic Shift for FOR-A Long reco...

02/04/2026

Evergent showcases innovations in sports streaming and mo...

Evergent will showcase new innovations in subscriber lifecycle management and monetization at NAB Show 2026 (Las Vegas, April 18 22), including: New advances i...

02/04/2026

Binghamton University Strengthens Student Run Productions...

Riedel Communications is proud to be part of Binghamton University, State University of New York, Athletics' milestone year, celebrating the university'...

02/04/2026

Techex and Encompass Launch Industry-Leading Cloud-Based...

Encompass Digital Media and Techex have today announced new, fully managed, cloud-native Master Control services designed to meet the growing operational demand...

02/04/2026

Winning in the new media economy - Avid debuts fully avai...

Avid today announced it will showcase new innovations designed to help media companies win in the new media economy at NAB Show 2026 (April 18 22, Las Vegas Co...

02/04/2026

PlayBox Neo reinforces MIMO Tech with new Playout capabil...

PlayBox Neo helps AIS PLAY kick-off premier football content direct to fans PlayBox Neo has provided MIMO Tech with a brand-new major installation to extend it...

02/04/2026

Globo transitions primary distribution to SRT over IP wit...

Globo has transitioned its primary content distribution to Secure Reliable Transport over a fully IP-based managed backbone using Synamedia's Quortex PowerV...

02/04/2026

Nexstar Says Pausing Tegna Merger Creates 'Impossible' Challenges

Share Copy link Facebook X Linkedin Bluesky Email...

02/04/2026

FCC Launches Efforts to Strengthen U.S. Drone Ecosystem

Share Copy link Facebook X Linkedin Bluesky Email...

02/04/2026

WAPA+ to Launch on Dish, DishLatino, Sling TV and Sling Freestream

Share Copy link Facebook X Linkedin Bluesky Email...

02/04/2026

Student Spotlight: Al-Fadl Salem

Student Spotlight: Al-Fadl Salem The Danish singer recently performed for the queen of Denmark. April 1, 2026 By Editorial Staff Image by Junia Morrow Wh...

02/04/2026

Taku Hirano's Career Is Defined by Identity

Taku Hirano's Career Is Defined by Identity Whether he's performing, composing, teaching, or developing instruments, the do-it-all percussionist sees ...