Sony Pixel Power calrec Sony

NVIDIA Advances Robot Learning and Humanoid Development With New AI and Simulation Tools

06/11/2024

www.1x.tech

Robotics developers can greatly accelerate their work on AI-enabled robots, including humanoids, using new AI and simulation tools and workflows that NVIDIA revealed this week at the Conference for Robot Learning (CoRL) in Munich, Germany.

The lineup includes the general availability of the NVIDIA Isaac Lab robot learning framework; six new humanoid robot learning workflows for Project GR00T, an initiative to accelerate humanoid robot development; and new world-model development tools for video data curation and processing, including the NVIDIA Cosmos tokenizer and NVIDIA NeMo Curator for video processing.

The open-source Cosmos tokenizer provides robotics developers superior visual tokenization by breaking down images and videos into high-quality tokens with exceptionally high compression rates. It runs up to 12x faster than current tokenizers, while NeMo Curator provides video processing curation up to 7x faster than unoptimized pipelines.

Also timed with CoRL, NVIDIA presented 23 papers and nine workshops related to robot learning and released training and workflow guides for developers. Further, Hugging Face and NVIDIA announced they're collaborating to accelerate open-source robotics research with LeRobot, NVIDIA Isaac Lab and NVIDIA Jetson for the developer community.

Accelerating Robot Development With Isaac Lab NVIDIA Isaac Lab is an open-source, robot learning framework built on NVIDIA Omniverse, a platform for developing OpenUSD applications for industrial digitalization and physical AI simulation.

Developers can use Isaac Lab to train robot policies at scale. This open-source unified robot learning framework applies to any embodiment - from humanoids to quadrupeds to collaborative robots - to handle increasingly complex movements and interactions.

Leading commercial robot makers, robotics application developers and robotics research entities around the world are adopting Isaac Lab, including 1X, Agility Robotics, The AI Institute, Berkeley Humanoid, Boston Dynamics, Field AI, Fourier, Galbot, Mentee Robotics, Skild AI, Swiss-Mile, Unitree Robotics and XPENG Robotics.

Project GR00T: Foundations for General-Purpose Humanoid Robots Building advanced humanoids is extremely difficult, demanding multilayer technological and interdisciplinary approaches to make the robots perceive, move and learn skills effectively for human-robot and robot-environment interactions.

Project GR00T is an initiative to develop accelerated libraries, foundation models and data pipelines to accelerate the global humanoid robot developer ecosystem.

Six new Project GR00T workflows provide humanoid developers with blueprints to realize the most challenging humanoid robot capabilities. They include:

GR00T-Gen for building generative AI-powered, OpenUSD-based 3D environments

GR00T-Mimic for robot motion and trajectory generation

GR00T-Dexterity for robot dexterous manipulation

GR00T-Control for whole-body control

GR00T-Mobility for robot locomotion and navigation

GR00T-Perception for multimodal sensing

Humanoid robots are the next wave of embodied AI, said Jim Fan, senior research manager of embodied AI at NVIDIA. NVIDIA research and engineering teams are collaborating across the company and our developer ecosystem to build Project GR00T to help advance the progress and development of global humanoid robot developers.

New Development Tools for World Model Builders Today, robot developers are building world models - AI representations of the world that can predict how objects and environments respond to a robot's actions. Building these world models is incredibly compute- and data-intensive, with models requiring thousands of hours of real-world, curated image or video data.

NVIDIA Cosmos tokenizers provide efficient, high-quality encoding and decoding to simplify the development of these world models. They set a new standard of minimal distortion and temporal instability, enabling high-quality video and image reconstructions.

Providing high-quality compression and up to 12x faster visual reconstruction, the Cosmos tokenizer paves the path for scalable, robust and efficient development of generative applications across a broad spectrum of visual domains.

1X, a humanoid robot company, has updated the 1X World Model Challenge dataset to use the Cosmos tokenizer.

NVIDIA Cosmos tokenizer achieves really high temporal and spatial compression of our data while still retaining visual fidelity, said Eric Jang, vice president of AI at 1X Technologies. This allows us to train world models with long horizon video generation in an even more compute-efficient manner.

Other humanoid and general-purpose robot developers, including XPENG Robotics and Hillbot, are developing with the NVIDIA Cosmos tokenizer to manage high-resolution images and videos.

NeMo Curator now includes a video processing pipeline. This enables robot developers to improve their world-model accuracy by processing large-scale text, image and video data.

Curating video data poses challenges due to its massive size, requiring scalable pipelines and efficient orchestration for load balancing across GPUs. Additionally, models for filtering, captioning and embedding need optimization to maximize throughput.

NeMo Curator overcomes these challenges by streamlining data curation with automatic pipeline orchestration, reducing processing time significantly. It supports linear scaling across multi-node, multi-GPU systems, efficiently handling over 100 petabytes of data. This simplifies AI development, reduces costs and accelerates time to market.

Advancing the Robot Learning Community at CoRL The nearly two dozen research papers the NVIDIA robotics team released with CoRL cover breakthroughs in integrating vision language models for improved environmental understanding and task execution, temporal robot navigati
LINK: https://blogs.nvidia.com/blog/robot-learning-humanoid-development/...
See more stories from nvidia

North America Stories

20/03/2026

MRMC Names CP Communications Its Official U.S. Rental, Sales Partner

Share Copy link Facebook X Linkedin Bluesky Email...

20/03/2026

FOR-A To Feature Software-Defined, AI-Driven Solutions At 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

20/03/2026

2026 NAB Show Exhibitor Insight: Riedel Communications

Share Copy link Facebook X Linkedin Bluesky Email...

20/03/2026

DirecTV Files Suit to Block Nexstar/Tegna Deal

Share Copy link Facebook X Linkedin Bluesky Email...

20/03/2026

Fujifilm Announces Four New Broadcast Zoom Lenses

Share Copy link Facebook X Linkedin Bluesky Email...

20/03/2026

NAB 2026: Software-Defined, AI-Powered Workflow Tells the...

Real-time 9:16 AI-Generated Autocropping; Software-Defined Station in a Box; and Software Switcher with Unlimited Layering Are Among Show Highlights For the fi...

20/03/2026

Signiant Showcases New Content Innovations Driving Visibility, Access, and Action at NAB 2026

Signiant Showcases New Content Innovations Driving Visibility, Access, and Actio...

20/03/2026

Caffeine Relies on DaVinci Resolve Studio for End to End Post Workflow

Caffeine Relies on DaVinci Resolve Studio for End to End Post Workflow Brie Clayton March 19, 2026 0 Comments Blackmagic Cloud helps Mexican post faci...

19/03/2026

The Rise of Streaming, Particularly for Sports, Revives Loudness Issues

Live sports production increases complexity, with dynamic audio levels and an overall philosophy that encourages transient volume spikes Fourteen years ago, Am...

19/03/2026

Advanced Systems Group Names Peter Thordarson as Technical Account Executive

Advanced Systems Group, a technology and services provider for media creatives and content owners, announced the appointment of Peter Thordarson to the newly cr...

19/03/2026

SVG Students To Watch: Arya Taymuree, University of Washington

For this senior from the Bay Area, the speed and pressure of live sports production play right into her strengths In the live-sports-video industry, the future...

19/03/2026

Grass Valley Expands Partnership with University of Pittsburgh Athletics, Upgrading Production Infrastructure to SMPTE ST 2110 IP

Grass Valley has expanded its long-term partnership with University of Pittsburg...

19/03/2026

Audio-Technica Debuts ATV-SG1 and ATV-SG1LE On-Camera Shotgun Microphones

Audio-Technica has released the ATV-SG1 and ATV-SG1LE On-Camera Shotgun Microphones, designed for use with DSLR, mirrorless SLR, and other cameras. The ATV-SG1...

19/03/2026

NAB 2026: Harmonic Enhances XOS Advanced Media Processor to Streamline Next-Generation Broadcast Distribution

Harmonic (booth W2831) announces updates to its XOS Advanced Media Processor aim...

19/03/2026

DAZN and Top Rank Sign Multi-Year Rights Deal to Bring Marquee Events and Historic Archive to the Global Home of Boxing

DAZN and Top Rank have announced a multi-year partnership that will bring Top Ra...

19/03/2026

IHSE and Cyviz Announce Strategic Partnership

IHSE, a provider of KVM systems, has announced a partnership with Cyviz AS, a provider of technology solutions for collaboration and mission-critical operations...

19/03/2026

Net Insight appoints Larissa Grner-Meeus as Chief Product Officer (CPO)

Net Insight has appointed Larissa G rner-Meeus as Chief Product Officer. She joins the company's executive management team. G rner-Meeus holds a Dipl-Ing. ...

19/03/2026

Leader Appoints Rob Stanley as Regional Sales Manager UK & Northern Europe

Leader Electronics of Europe has appointed Rob Stanley as Regional Sales Manager for the UK and Northern Europe. In the role, he will manage key accounts and ha...

19/03/2026

FIFA and YouTube Team Up in FIFA World Cup 2026 Preferred Platform Agreement

FIFA has announced that YouTube will be a Preferred Platform for the FIFA World Cup 2026. Under the agreement, FIFA's Media Partners will be able to publis...

19/03/2026

Upgrade to NCAA March Madness Live App Expands Multi-Game Viewing, Enhances Second-Screen Experience

New features across mobile, connected devices, and automotive platforms undersco...

19/03/2026

PSSI Global Services Welcomes Ben Bradshaw as Director of Product and Network Development

PSSI Global Services has appointed Ben Bradshaw as Director of Product and Netwo...

19/03/2026

NAB 2026: Cobalt Digital to Unveil Additions to End-to-End IPMX and ST 2110 Ecosystem

Cobalt Digital has announced its NAB 2026 product lineup, which includes additio...

19/03/2026

Sportradar Releases Industry Outlook on the Future of U.S. Sports Viewing

Sportradar has released a new report, Innovation in Sports Media: The Next Era of Sports Viewing, examining how the sports viewing experience in the U.S. is evo...

19/03/2026

Matrox Video's ConvertIP Awarded in Rai Framework Agreement Supporting IP Modernization Strategy

Matrox Video has been awarded a three-year framework agreement to supply its Con...

19/03/2026

Controlled Chaos: Inside the Mighty Production Engine Behind the NCAA Men's Basketball Tournament's First Week

CBS Sports' Jason Cohen and TNT Sports' Chris Brown lead the charge on n...

19/03/2026

Loud and Fun Is the Goal for NCAA Tourney Audio

A1 Dave Grundtvig and his team deploy plenty of mics to capture the sounds and energy from the stands as well the court March Madness is a tournament in which ...

19/03/2026

Leader appoints Rob Stanley as Regional Sales Manager UK...

Test & measurement innovator, Leader Electronics of Europe, is pleased to announce the appointment of Rob Stanley as Regional Sales Manager - UK & Northern Euro...

19/03/2026

Accedo One and Magine Pro Officially Launch Leyra Deliver...

The recently announced joint venture between Accedo One and Magine Pro has been officially launched as Leyra. The new company will combine the two complementary...

19/03/2026

Lightware matrices are the go-to choice for signal manage...

Budapest, Hungary, March 2026 - Demand for traditional matrix switching remains strong across live events, rental and staging markets. With a reputation for rel...

19/03/2026

DPA Elevates 4097 Micro Shotgun With CORE Technology

DPA Microphones adds to its CORE microphone selection with the 4097 CORE Micro Shotgun, which delivers a new level of clarity, headroom and sonic transparency...

19/03/2026

Starfish highlights flexible TS Splicer releases and new...

Starfish Technologies will present the latest releases of its TS Splicer (Win) and TS Splicer (K8) at NAB Show 2026, together with a new Monitoring Dashboard de...

19/03/2026

TrueVisions Selects Bitmovin Observability

Bitmovin, a leading provider of video streaming solutions, has announced that TrueVisions NOW, a leading streaming platform in Thailand, and part of the TrueVis...

19/03/2026

Harmonic Enhances XOS Advanced Media Processor to Streaml...

Harmonic (NASDAQ: HLIT) today announced significant enhancements to its XOS Advanced Media Processor that lower the cost of broadcast distribution while enablin...

19/03/2026

Cobalt Digital to Unveil Additions to End to End IPMX and...

Cobalt Digital, the leading designer and manufacturer of award-winning signal processing products, and a founding partner in the openGear initiative has announ...

19/03/2026

Magewell Connecting Any Source Anywhere in Any Form Facto...

Magewell a developer of innovative, high-performance video I/O and IP workflow solutions will be at the 2026 NAB Show on booth C6113. In addition to several...

19/03/2026

Triveni Digital Expands NextGen TV Innovations and Suppor...

Triveni Digital, a trusted leader in ATSC 1.0 and ATSC 3.0 service delivery, data broadcasting and quality assurance solutions, today announced it will showcase...

19/03/2026

Imagine Communications to Showcase Purpose Led Innovation...

2026 NAB Show Exhibitor Preview April 19 22 Las Vegas Booth N.1328 Summary: At the 2026 NAB Show in Las Vegas, Imagine Communications will showcase the lat...

19/03/2026

Pebble demonstrates partnerships performance and future-r...

Broadcast playout leader highlights innovation and industry collaboration Pebble, the leading automation, content management and integrated channel specialist...

19/03/2026

Tuxera showcases stunning performance with Fusion SMB and...

Tuxera, a leading provider of quality-assured file systems and networking technologies, is highlighting remarkable advances in performance at NAB Show (booth N1...

19/03/2026

Intinor introduces enhanced SRT monitoring, HDR transport...

Intinor will showcase several new developments to the Direkt platform at NAB 2026, including enhanced SRT monitoring, expanded HDR transport capabilities and su...

19/03/2026

Caretta Research Finds Broadcasters Rebuild Live Operatio...

Broadcasters are rebuilding live operations workflows around IP delivery as satellite and dedicated fibre distribution decline, according to new research from C...

19/03/2026

Berklee Popular Music Institute's 24th Showcase Channels '80s Prom Nostalgia

Berklee Popular Music Institute's 24th Showcase Channels '80s Prom Nosta...

19/03/2026

Study: Creator Content Plays Growing Role in Streaming Habits

Share Copy link Facebook X Linkedin Bluesky Email...

19/03/2026

NAB Show: Studio Technologies to Debut New StudioComm System

Share Copy link Facebook X Linkedin Bluesky Email...

19/03/2026

Ateme Verified for YouTube Live

Share Copy link Facebook X Linkedin Bluesky Email...

19/03/2026

Roku Launches NCAA March Madness Zone

Share Copy link Facebook X Linkedin Bluesky Email...

19/03/2026

Harmonic Makes NextGen TV Upgrades to XOS Media Processor

Share Copy link Facebook X Linkedin Bluesky Email...

19/03/2026

FCC Removes More Drones from Covered List of Banned Products

Share Copy link Facebook X Linkedin Bluesky Email...

19/03/2026

How Chris Bolte Builds the Sound of PlayStation Games

How Chris Bolte Builds the Sound of PlayStation Games Chris Bolte '16 is a technical sound designer at Sucker Punch Productions, where he helps create the...

19/03/2026

The Highest Performing ISPs of 2025

Back to All News The Highest Performing ISPs of 2025 Product 19 March 2026 Global Link copied to clipboard Canada, South Korea, the United Kingdom, and th...