Sony Pixel Power calrec Sony

NVIDIA Advances Robot Learning and Humanoid Development With New AI and Simulation Tools

06/11/2024

www.1x.tech

Robotics developers can greatly accelerate their work on AI-enabled robots, including humanoids, using new AI and simulation tools and workflows that NVIDIA revealed this week at the Conference for Robot Learning (CoRL) in Munich, Germany.

The lineup includes the general availability of the NVIDIA Isaac Lab robot learning framework; six new humanoid robot learning workflows for Project GR00T, an initiative to accelerate humanoid robot development; and new world-model development tools for video data curation and processing, including the NVIDIA Cosmos tokenizer and NVIDIA NeMo Curator for video processing.

The open-source Cosmos tokenizer provides robotics developers superior visual tokenization by breaking down images and videos into high-quality tokens with exceptionally high compression rates. It runs up to 12x faster than current tokenizers, while NeMo Curator provides video processing curation up to 7x faster than unoptimized pipelines.

Also timed with CoRL, NVIDIA presented 23 papers and nine workshops related to robot learning and released training and workflow guides for developers. Further, Hugging Face and NVIDIA announced they're collaborating to accelerate open-source robotics research with LeRobot, NVIDIA Isaac Lab and NVIDIA Jetson for the developer community.

Accelerating Robot Development With Isaac Lab NVIDIA Isaac Lab is an open-source, robot learning framework built on NVIDIA Omniverse, a platform for developing OpenUSD applications for industrial digitalization and physical AI simulation.

Developers can use Isaac Lab to train robot policies at scale. This open-source unified robot learning framework applies to any embodiment - from humanoids to quadrupeds to collaborative robots - to handle increasingly complex movements and interactions.

Leading commercial robot makers, robotics application developers and robotics research entities around the world are adopting Isaac Lab, including 1X, Agility Robotics, The AI Institute, Berkeley Humanoid, Boston Dynamics, Field AI, Fourier, Galbot, Mentee Robotics, Skild AI, Swiss-Mile, Unitree Robotics and XPENG Robotics.

Project GR00T: Foundations for General-Purpose Humanoid Robots Building advanced humanoids is extremely difficult, demanding multilayer technological and interdisciplinary approaches to make the robots perceive, move and learn skills effectively for human-robot and robot-environment interactions.

Project GR00T is an initiative to develop accelerated libraries, foundation models and data pipelines to accelerate the global humanoid robot developer ecosystem.

Six new Project GR00T workflows provide humanoid developers with blueprints to realize the most challenging humanoid robot capabilities. They include:

GR00T-Gen for building generative AI-powered, OpenUSD-based 3D environments

GR00T-Mimic for robot motion and trajectory generation

GR00T-Dexterity for robot dexterous manipulation

GR00T-Control for whole-body control

GR00T-Mobility for robot locomotion and navigation

GR00T-Perception for multimodal sensing

Humanoid robots are the next wave of embodied AI, said Jim Fan, senior research manager of embodied AI at NVIDIA. NVIDIA research and engineering teams are collaborating across the company and our developer ecosystem to build Project GR00T to help advance the progress and development of global humanoid robot developers.

New Development Tools for World Model Builders Today, robot developers are building world models - AI representations of the world that can predict how objects and environments respond to a robot's actions. Building these world models is incredibly compute- and data-intensive, with models requiring thousands of hours of real-world, curated image or video data.

NVIDIA Cosmos tokenizers provide efficient, high-quality encoding and decoding to simplify the development of these world models. They set a new standard of minimal distortion and temporal instability, enabling high-quality video and image reconstructions.

Providing high-quality compression and up to 12x faster visual reconstruction, the Cosmos tokenizer paves the path for scalable, robust and efficient development of generative applications across a broad spectrum of visual domains.

1X, a humanoid robot company, has updated the 1X World Model Challenge dataset to use the Cosmos tokenizer.

NVIDIA Cosmos tokenizer achieves really high temporal and spatial compression of our data while still retaining visual fidelity, said Eric Jang, vice president of AI at 1X Technologies. This allows us to train world models with long horizon video generation in an even more compute-efficient manner.

Other humanoid and general-purpose robot developers, including XPENG Robotics and Hillbot, are developing with the NVIDIA Cosmos tokenizer to manage high-resolution images and videos.

NeMo Curator now includes a video processing pipeline. This enables robot developers to improve their world-model accuracy by processing large-scale text, image and video data.

Curating video data poses challenges due to its massive size, requiring scalable pipelines and efficient orchestration for load balancing across GPUs. Additionally, models for filtering, captioning and embedding need optimization to maximize throughput.

NeMo Curator overcomes these challenges by streamlining data curation with automatic pipeline orchestration, reducing processing time significantly. It supports linear scaling across multi-node, multi-GPU systems, efficiently handling over 100 petabytes of data. This simplifies AI development, reduces costs and accelerates time to market.

Advancing the Robot Learning Community at CoRL The nearly two dozen research papers the NVIDIA robotics team released with CoRL cover breakthroughs in integrating vision language models for improved environmental understanding and task execution, temporal robot navigati
LINK: https://blogs.nvidia.com/blog/robot-learning-humanoid-development/...
See more stories from nvidia

North America Stories

09/04/2026

NAB 2026: Zixi to Demonstrate Live Video Workflows and Satellite Replacement

Zixi will demonstrate IP-based live video workflow solutions at NAB Show 2026 (Booth W2057). The industry is moving quickly toward IP-based distribution as br...

09/04/2026

Deloitte Research: Women's Elite Sports Revenues Expected to Reach at Least $3 Billion in 2026

Global women's elite sports revenues are expected to reach at least $3 billi...

09/04/2026

Monitor Engineer Gavin Tempany Mixes Kylie Minogue's Tension Tour on Solid State Logic L550 Plus

Monitor engineer Gavin Tempany mixed Kylie Minogue s Tension Tour on a Solid Sta...

09/04/2026

NAB 2026: KOKUSAI DENKI Electric America to Debut New 4K Camera and Remote Control Panel

KOKUSAI DENKI Electric America will exhibit at NAB Show 2026 (Booth C5507), debu...

09/04/2026

NBC Sports Reviews Innovations and Milestones from Its 2025-26 NBA Regular Season

With the 2025-26 NBA regular season concluded and the playoffs beginning next we...

09/04/2026

NAB 2026: Telestream and Mimir Announce Integration for Ingest-to-Editorial Workflows

Telestream and Mimir have announced an integration connecting Telestream's V...

09/04/2026

NAB 2026: Bitmovin Expands Live Encoding and Observability Solutions for End-to-End Live Streaming Monitoring

Bitmovin has expanded its Live Encoding and Observability solutions to provide r...

09/04/2026

Nashville Predators and Scripps Sports Announce Multi-Year Broadcast Agreement

The Nashville Predators and Scripps Sports have announced a multi-year media rights agreement covering local preseason, regular season, and first-round playoff ...

09/04/2026

ASG Partners with Beam Dynamics for Asset Intelligence Platform

Advanced Systems Group, LLC has announced a partnership with Beam Dynamics to offer the Beam Asset and License Intelligence Platform to its clients. The platfor...

09/04/2026

NAB 2026: Lawo Introduces Edge One Converged Video and Audio Stagebox

Lawo has unveiled Edge One, a combined video and audio stagebox for broadcast and Pro AV workflows. The device will be on display at NAB Show (Booth C2108, Apri...

09/04/2026

NAB 2026: SMPTE to Host ST 2110 IP Media Roadshow

The Society of Motion Picture and Television Engineers (SMPTE) will host the SMPTE ST 2110 IP Media Roadshow on Tuesday, April 21, 2026, at the Las Vegas Conven...

09/04/2026

Atlanta Braves Upgrade Video Displays at Truist Park

The Atlanta Braves have completed upgrades to video displays in and around Truist Park ahead of the 2026 MLB season. The upgrades include the Delta Out-of-Town ...

09/04/2026

USC Installs Daktronics LED Displays Across Four Athletics Venues

The University of Southern California has contracted Daktronics (NASDAQ: DAKT) of Brookings, South Dakota, to manufacture and install 22 LED displays across fou...

09/04/2026

NAB 2026: Backlight to Showcase Iconik and Wildmoka Integration

Backlight, the media technology company behind Iconik and Wildmoka, will showcase its Creative Operations Platform at NAB Show 2026 (Booth N2829, April 19-22). ...

09/04/2026

MotoAmerica Superbike to Air on VICE TV for 2026 Season

MotoAmerica and V10 Entertainment have announced a partnership to broadcast MotoAmerica Superbike racing on VICE TV for the 2026 season. Coverage begins live on...

09/04/2026

Proton Camera Innovations Appoints Tod Musgrave as US Sales and Marketing Director

Proton Camera Innovations has announced the appointment of Tod Musgrave as US Sa...

09/04/2026

Former UEFA, Orange Executive Nicolas Dal Launches OVERCAST Private-Cloud Production Service

Designed specifically for live sports broadcasting, new platform features IP-nat...

09/04/2026

NEWstalgia: How the Return of the NBA on NBC Was Driven by a Bold and Ownable' Graphics Package

Blending 1990s DNA, modern motion theory, and a distinctly colorful brand identi...

09/04/2026

SVG Sit-Down: Christy Media's Amy Vacher on What It Takes To Find the Best Person for the Job

Technical capability is essential, but long-term success often depends on how we...

09/04/2026

Sundance Film Festival: CDMX 2026 by Cinpolis Unveils Official Program for Its Third Edition

15 feature films, including fiction and documentaries, along with six short film...

09/04/2026

ENCO to Showcase New aiTrack Capabilities at 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

09/04/2026

LTN Unveils Network Enhancements in Advance of C-Band Changes

Share Copy link Facebook X Linkedin Bluesky Email...

09/04/2026

FOR-A Buys Tamura Corp. Information Equipment Business

Share Copy link Facebook X Linkedin Bluesky Email...

09/04/2026

Imagine Showcases Expanded Multiviewer Portfolio at 2026...

Purpose Built Monitoring From Live Production to Master Control to OTT, Across On Prem and Cloud Environments At the 2026 NAB Show (April 19-22, Las Vegas Con...

09/04/2026

Harmonic Extends Hybrid Streaming Leadership with AI and...

Purpose Built Monitoring From Live Production to Master Control to OTT, Across On Prem and Cloud Environments At the 2026 NAB Show (April 19-22, Las Vegas Con...

09/04/2026

LTN unveils network enhancements as broadcaster satellite...

New advances meet surge in demand for broadcast-grade IP migration as C-band spectrum auctions approach LTN announces major enhancements to its purpose-built g...

09/04/2026

Hitomi Broadcast Expands Sales Team with Nicola Milburn

Hitomi Broadcast has expanded its sales team with the addition of Nicola Milburn as Technical Sales Manager. In this role, Nicola will work with customers and p...

09/04/2026

Layercake and Ceeblue Announce Strategic Integration to D...

Revolutionary combined solution brings sub-second latency, resilient delivery, and workflow orchestration to global broadcasters and digital platforms Layercak...

09/04/2026

BIA Increases 2026 Local Ad Forecast to $184.5 Billion

Share Copy link Facebook X Linkedin Bluesky Email...

09/04/2026

Yospace surpasses 10 billion ads stitched in a single mon...

Yospace exceeded 10 billion dynamically stitched ads in a single month, reaching 11.6 billion as ad-supported streaming surged. Driven by a packed global sports...

09/04/2026

Bitmovin Expands Live Encoding and Observability Solution...

Bitmovin has expanded its Live Encoding and Observability solutions to provide true end to end, real time insights across live streaming workflows, from encodin...

09/04/2026

Leyra Powers the Launch of Icelandic Broadcaster RUVs Str...

Leyra has announced the launch of Icelandic public broadcaster R V's streaming service on Samsung and LG Smart TVs. R V is the first public broadcaster to d...

09/04/2026

3Play Media Launches AI Dubbing Solution for YouTube Crea...

3Play Media, a global leader in video accessibility and localization, today announced an AI Dubbing solution purpose-built for YouTube creators. The company, wh...

09/04/2026

Big Blue Marble Recognized as AWS Managed Services Provid...

Big Blue Marble, a provider of broadcast-grade, cloud-native video solutions, has been recognized as an Amazon Web Services (AWS) Managed Services Provider (MSP...

09/04/2026

PDC Cleeng and Urban Zoo partner to launch new global str...

The Professional Darts Corporation (PDC) has officially launched its revamped global streaming service, PDC TV, in collaboration with Cleeng and sports technolo...

09/04/2026

Cleeng launches industry first cross-platform AI agents t...

Cleeng, the Subscriber Retention Management (SRM ) pioneer, today announced a raft of new AI agents for its AI Assistant to accelerate decision-making and autom...

09/04/2026

Hearst Taps Merzigo to Expand Presence on YouTube, Facebook

Share Copy link Facebook X Linkedin Bluesky Email...

09/04/2026

Fox Jumps Into Prediction Markets with Kalshi Integration

Share Copy link Facebook X Linkedin Bluesky Email...

09/04/2026

NAB Show: Kokusai Denki To Feature New, Affordable 4K Broadcast Camera

Share Copy link Facebook X Linkedin Bluesky Email...

09/04/2026

Corus Taps Appear for IP-First Operations

Share Copy link Facebook X Linkedin Bluesky Email...

09/04/2026

2026 NAB Show Exhibitor Insights: Big Blue Marble

Share Copy link Facebook X Linkedin Bluesky Email...

09/04/2026

FCC Extends Audible Crawl Rule Waiver for 18 Months

Share Copy link Facebook X Linkedin Bluesky Email...

09/04/2026

Pliant Technologies Showcases Expanded Accessories at NAB...

New Charging, Connectivity, and Mounting Solutions Now Available LAS VEGAS, APRIL 8, 2026 Pliant Technologies will highlight a range of new accessories at th...

09/04/2026

Media Links Brings Ultra Resilient IP Transport Proven at...

As live production continues its shift to IP, the challenge is no longer adoption it's reliability. At NAB Show 2026 (Booth W2033), Media Links will demon...

09/04/2026

QuickLink Launches StudioPro Town Hall at 2026 NAB Show

QuickLink, a leading provider of award-winning video production and remote guest contribution solutions, launches its new AI-powered add-on for its StudioPro p...

09/04/2026

The Hive Group Releases White Paper on the FCC Upper C-Ba...

The Hive Group has published a new white paper, No Safe Harbor: Why the FCC's Upper C-Band Auction Demands Early Action from Broadcasters ahead of NAB Sho...

09/04/2026

Saranyu Technologies Renews VisualOn Optimizer VoD Subscr...

Saranyu Technologies today announced the renewal of its Optimizer VoD subscription with VisualOn. The announcement coincides with NAB Show 2026, taking place Ap...

09/04/2026

Love, Sacrifice and Vengeance Collide in My Dearest Assassin' Main Trailer, Out May 7

Back to All News Love, Sacrifice and Vengeance Collide in My Dearest Assassin&...

09/04/2026

Netflix Drops the Gripping Trailer for The Chestnut Man: Hide and Seek'

Back to All News Netflix Drops the Gripping Trailer for The Chestnut Man: Hide and Seek' Entertainment 09 April 2026 GlobalDenmarkFinlandNorwaySweden ...

09/04/2026

'The Marked Woman' premieres on Netflix on June 5

Back to All News The Marked Woman premieres on Netflix on June 5 Entertainment 09 April 2026 GlobalSpain Link copied to clipboard Download the first image...