Sony Pixel Power calrec Sony

NVIDIA Advances Robot Learning and Humanoid Development With New AI and Simulation Tools

06/11/2024

www.1x.tech

Robotics developers can greatly accelerate their work on AI-enabled robots, including humanoids, using new AI and simulation tools and workflows that NVIDIA revealed this week at the Conference for Robot Learning (CoRL) in Munich, Germany.

The lineup includes the general availability of the NVIDIA Isaac Lab robot learning framework; six new humanoid robot learning workflows for Project GR00T, an initiative to accelerate humanoid robot development; and new world-model development tools for video data curation and processing, including the NVIDIA Cosmos tokenizer and NVIDIA NeMo Curator for video processing.

The open-source Cosmos tokenizer provides robotics developers superior visual tokenization by breaking down images and videos into high-quality tokens with exceptionally high compression rates. It runs up to 12x faster than current tokenizers, while NeMo Curator provides video processing curation up to 7x faster than unoptimized pipelines.

Also timed with CoRL, NVIDIA presented 23 papers and nine workshops related to robot learning and released training and workflow guides for developers. Further, Hugging Face and NVIDIA announced they're collaborating to accelerate open-source robotics research with LeRobot, NVIDIA Isaac Lab and NVIDIA Jetson for the developer community.

Accelerating Robot Development With Isaac Lab NVIDIA Isaac Lab is an open-source, robot learning framework built on NVIDIA Omniverse, a platform for developing OpenUSD applications for industrial digitalization and physical AI simulation.

Developers can use Isaac Lab to train robot policies at scale. This open-source unified robot learning framework applies to any embodiment - from humanoids to quadrupeds to collaborative robots - to handle increasingly complex movements and interactions.

Leading commercial robot makers, robotics application developers and robotics research entities around the world are adopting Isaac Lab, including 1X, Agility Robotics, The AI Institute, Berkeley Humanoid, Boston Dynamics, Field AI, Fourier, Galbot, Mentee Robotics, Skild AI, Swiss-Mile, Unitree Robotics and XPENG Robotics.

Project GR00T: Foundations for General-Purpose Humanoid Robots Building advanced humanoids is extremely difficult, demanding multilayer technological and interdisciplinary approaches to make the robots perceive, move and learn skills effectively for human-robot and robot-environment interactions.

Project GR00T is an initiative to develop accelerated libraries, foundation models and data pipelines to accelerate the global humanoid robot developer ecosystem.

Six new Project GR00T workflows provide humanoid developers with blueprints to realize the most challenging humanoid robot capabilities. They include:

GR00T-Gen for building generative AI-powered, OpenUSD-based 3D environments

GR00T-Mimic for robot motion and trajectory generation

GR00T-Dexterity for robot dexterous manipulation

GR00T-Control for whole-body control

GR00T-Mobility for robot locomotion and navigation

GR00T-Perception for multimodal sensing

Humanoid robots are the next wave of embodied AI, said Jim Fan, senior research manager of embodied AI at NVIDIA. NVIDIA research and engineering teams are collaborating across the company and our developer ecosystem to build Project GR00T to help advance the progress and development of global humanoid robot developers.

New Development Tools for World Model Builders Today, robot developers are building world models - AI representations of the world that can predict how objects and environments respond to a robot's actions. Building these world models is incredibly compute- and data-intensive, with models requiring thousands of hours of real-world, curated image or video data.

NVIDIA Cosmos tokenizers provide efficient, high-quality encoding and decoding to simplify the development of these world models. They set a new standard of minimal distortion and temporal instability, enabling high-quality video and image reconstructions.

Providing high-quality compression and up to 12x faster visual reconstruction, the Cosmos tokenizer paves the path for scalable, robust and efficient development of generative applications across a broad spectrum of visual domains.

1X, a humanoid robot company, has updated the 1X World Model Challenge dataset to use the Cosmos tokenizer.

NVIDIA Cosmos tokenizer achieves really high temporal and spatial compression of our data while still retaining visual fidelity, said Eric Jang, vice president of AI at 1X Technologies. This allows us to train world models with long horizon video generation in an even more compute-efficient manner.

Other humanoid and general-purpose robot developers, including XPENG Robotics and Hillbot, are developing with the NVIDIA Cosmos tokenizer to manage high-resolution images and videos.

NeMo Curator now includes a video processing pipeline. This enables robot developers to improve their world-model accuracy by processing large-scale text, image and video data.

Curating video data poses challenges due to its massive size, requiring scalable pipelines and efficient orchestration for load balancing across GPUs. Additionally, models for filtering, captioning and embedding need optimization to maximize throughput.

NeMo Curator overcomes these challenges by streamlining data curation with automatic pipeline orchestration, reducing processing time significantly. It supports linear scaling across multi-node, multi-GPU systems, efficiently handling over 100 petabytes of data. This simplifies AI development, reduces costs and accelerates time to market.

Advancing the Robot Learning Community at CoRL The nearly two dozen research papers the NVIDIA robotics team released with CoRL cover breakthroughs in integrating vision language models for improved environmental understanding and task execution, temporal robot navigati
LINK: https://blogs.nvidia.com/blog/robot-learning-humanoid-development/...
See more stories from nvidia

North America Stories

07/05/2026

CNN Founder Ted Turner Dies at 87

Share Copy link Facebook X Linkedin Bluesky Email...

07/05/2026

FCC Urges Appeals Court to Toss Challenges to Nexstar-Tegna Deal

Share Copy link Facebook X Linkedin Bluesky Email...

07/05/2026

Carr Announces FCC Staff Promotions

Share Copy link Facebook X Linkedin Bluesky Email...

07/05/2026

Recreating the 1974 Doctor Who Time Tunnel in After Effects

Recreating the 1974 Doctor Who Time Tunnel in After Effects Graham Quince May 6, 2026 0 Comments The Time Tunnel from Doctor Who titles is one of th...

06/05/2026

Wisycom RF Solutions Support Gravity Medias Live Cycling and Marathon Broadcasts

Gravity Media Chief RF Communications Engineer Glenn Willems uses Wisycom RF over Fiber and wireless solutions across major cycling events and international mar...

06/05/2026

Sennheiser Spectera Module Now Available in Bitfocus Companion and Buttons

A Sennheiser Spectera module is now available in Bitfocus Companion and Buttons, enabling direct integration of Spectera with the two software platforms. The mo...

06/05/2026

Ted Turner, Cable Television Pioneer, Sports Broadcasting Hall of Famer, Dead at 87

Ted Turner, the visionary media entrepreneur whose appetite for disruption helpe...

06/05/2026

FIFA World Cup 2026: Peacock Launches Visin de Campo (aka Pitchside Live), Will Stream All 104 Matches in Spanish

Peacock is going all-in on the beautiful game - streaming all 104 FIFA World Cup...

06/05/2026

L3Harris to Boost Polish Navy Combat Power with Advanced Ship System

Polands Miecznik-class frigates are part of the largest contract in Polish shipbuilding history. (Image Credit: PGZ Stocznia Wojenna)...

06/05/2026

FCC's Anna Gomez Urges Rigorous Review of Paramount-WBD Merger

Share Copy link Facebook X Linkedin Bluesky Email...

06/05/2026

Riedel Ups Marc Engroff to CFO, Shifts Frank Eischet to Group COO

Share Copy link Facebook X Linkedin Bluesky Email...

06/05/2026

Amagi Launches In-Content Ads' to Attract More CTV Advertisers

Share Copy link Facebook X Linkedin Bluesky Email...

06/05/2026

Riedel Expands Leadership Structure Appoints Marc Engroff...

Riedel Communications today announced the expansion of its leadership structure as part of a strategic initiative to strengthen both its operational management ...

06/05/2026

Production Sound Mixer Dirk Sciarrotta Delivers Camera Re...

For nearly three decades, Veteran Production Sound Mixer and Five-time Emmy Award Winner Dirk Sciarrotta has helped define the sonic identity of the long-runnin...

06/05/2026

ZEISS CinCraft LensCore: Cinema Lens Looks for Compositing

ZEISS CinCraft LensCore: Cinema Lens Looks for Compositing Brie Clayton May 6, 2026 0 Comments ZEISS announces the launch of CinCraft LensCore, a nove...

06/05/2026

Wisycom Solves Extreme RF Challenges Across Miles of Live Action for Gravity Media

Wisycom Solves Extreme RF Challenges Across Miles of Live Action for Gravity Med...

06/05/2026

NAB Launches Weekly Podcast on Local Broadcast Policy

Share Copy link Facebook X Linkedin Bluesky Email...

06/05/2026

Mavis Launches Mavis Studio iPad For Media Production

Share Copy link Facebook X Linkedin Bluesky Email...

06/05/2026

Narrative Entertainment partners with Encompass to provid...

Narrative Entertainment has partnered with Encompass to deliver high-quality subtitling of its Great! network content using the Altitude Intelligence AI assiste...

06/05/2026

SipRadius extends its seamless creation and connectivity...

SipRadius, widely recognized for making content processing and connectivity secure and seamless, is proud to launch a dramatic new approach to AI content creati...

06/05/2026

Big Blue Marble at ANGA COM - TV as a Service in the spot...

When the broadband and media industry gathers at ANGA COM in Cologne from May 19 to 21, Big Blue Marble will be at the forefront. The international broadcast an...

06/05/2026

Cinegy makes its MPTS debut with software-defined televis...

Cinegy GmbH, a leading developer of software-defined television technology, is proud to exhibit at MPTS for the first time. Visitors to the stand will discover ...

06/05/2026

Val Jeanty Receives 2026 Doris Duke Artist Award

Val Jeanty Receives 2026 Doris Duke Artist Award Jeanty, a composer, percussionist, and turntablist, is the fourth Berklee recipient of the prestigious award ...

06/05/2026

Zeiss Launches CinCraft LensCore

Share Copy link Facebook X Linkedin Bluesky Email...

06/05/2026

Gomez Urges Rigorous FCC Review of Paramount-WBD Merger

Share Copy link Facebook X Linkedin Bluesky Email...

06/05/2026

Wisycom Solves Extreme RF Challenges Across Miles of Live...

When live cycling races and international marathons stretch for miles across cities and countryside, there is no margin for RF failure in live broadcast. As Chi...

06/05/2026

ZEISS CinCraft LensCore - Cinema Lens Looks for Compositi...

Oberkochen/Germany, May 5, 2026 ZEISS announces the launch of CinCraft LensCore, a novel solution for creating physically based cinematic lens looks for visual...

06/05/2026

NVIDIA Spectrum-X - the Open, AI-Native Ethernet Fabric - Sets the Standard for Gigascale AI, Now With MRC

The race to build the world's most powerful AI factories demands networking ...

06/05/2026

May 05, 2026

How changes to proteins can alter drug interactions for new precision therapies Scripps Research team maps how chemical modifications to proteins affect drug bi...

05/05/2026

Samsung Galaxy S26 Ultra Phone Cameras Bring New Excitement to Street League Skateboarding

Three phones were hardwired for power and transmission to the truck; camera feat...

05/05/2026

Case Study: How Zaki Rose Rebuilt Its Production Infrastructure, and What It Means for Sports Content Creators

The creative studio behind campaigns for the NBA, Fanatics Sportsbook & Casino, ...

05/05/2026

Nielsen Co-Viewing Pilot Shows Average 4% Viewership Increase for February Live Events

Nielsen has announced results from a co-viewing pilot program covering February&...

05/05/2026

Nippon TV and FOR-A Win NAB Product of the Year and Future Best of Show Awards for viztrick AiDi

viztrick AiDi, an on-device AI solution developed by Nippon TV, delivered global...

05/05/2026

ARRI Introduces Omnibar LED Linear Fixture for Film, Live Entertainment, and Content Creation

ARRI has announced Omnibar, a battery-powered, IP65-rated multi-color LED linear...

05/05/2026

France Tlvisions Becomes First Broadcaster to Deploy Imagine Communications SNP-XS

Imagine Communications has announced that France T l visions is the first broadc...

05/05/2026

WNBA Announces Historic Canadian Media Rights Agreement with Bell Media

The Women's National Basketball Association (WNBA) and Bell Media today announced a multiyear agreement to broadcast and stream WNBA games in Canada beginni...

05/05/2026

Save the Date: SVG Remote Production Forum Heads to WBD's Techwood Studios in Atlanta on Sept. 23-24

SVG is proud to announce Warner Bros. Discovery's Techwood Studios in Atlant...

05/05/2026

Look Who's Talking: ESPN Integrates New Automated Commentator-ID Technology Into Scorebar Graphic for UFL Coverage

With no operator required, AutoMic workflow automates talent identification on U...

05/05/2026

Return Flight: How Live Broadcast Drones Died - and Were Reborn - on the Ski Slopes of Northern Italy

A crash in 2015 set the industry back, but this winter proved that drones are he...

05/05/2026

L3Harris Provides Key Technologies for Newly Commissioned Navy Submarines

L3Harris provides communications, electronic warfare, sensors and mission systems that enable Virginia-class submarine crews to operate with confidence in conte...

05/05/2026

Gray Media Closes Purchase of 10 Allen Media Group Stations

Share Copy link Facebook X Linkedin Bluesky Email...

05/05/2026

Dang Ly Joins Operative as Chief Product Officer

Share Copy link Facebook X Linkedin Bluesky Email...

05/05/2026

CIMM, TVB Release Local TV Currency Measurement Guidelines

Share Copy link Facebook X Linkedin Bluesky Email...

05/05/2026

ARRI Introduces Omnibar LED Linear Fixture

Share Copy link Facebook X Linkedin Bluesky Email...

05/05/2026

France Televisions Continues ST 2110 Migration With Imagi...

Project Marks First Major Broadcast Deployment of Latest Addition to SNP Lineup Imagine Communications today announced that France T l visions is the first br...

05/05/2026

Shotoku Broadcast Systems Wins 2026 NAB Show Product of t...

Shotoku Broadcast Systems Wins 2026 NAB Show Product of the Year Award Shotoku Broadcast Systems announced today that its Swoop range of robotic cranes has be...

05/05/2026

DigitalGlues creativespace Intelligence Wins Futures Best...

DigitalGlue's creative.space Intelligence Wins Future's Best of Show Award, Presented by TV Tech creative.space Intelligence (CSI), part of the creativ...

05/05/2026

Zixi Showcases Next-Generation Live Video Workflows and M...

Zixi, a leader in live video delivery and workflow orchestration, will showcase next-generation broadcast workflows at the Media Production and Technology Show ...

05/05/2026

Stingr marks its launch with a new approach to second-screen interactivity

Stingr marks its launch with a new approach to second-screen interactivity Brie Clayton May 5, 2026 0 Comments Huge leap forward in revenues and engag...

05/05/2026

Shotoku Broadcast Systems Wins 2026 NAB Show Product of the Year Award

Shotoku Broadcast Systems Wins 2026 NAB Show Product of the Year Award Brie Clayton May 5, 2026 0 Comments Shotoku Broadcast Systems announced today tha...