Sony Pixel Power calrec Sony

NVIDIA Advances Robot Learning and Humanoid Development With New AI and Simulation Tools

06/11/2024

www.1x.tech

Robotics developers can greatly accelerate their work on AI-enabled robots, including humanoids, using new AI and simulation tools and workflows that NVIDIA revealed this week at the Conference for Robot Learning (CoRL) in Munich, Germany.

The lineup includes the general availability of the NVIDIA Isaac Lab robot learning framework; six new humanoid robot learning workflows for Project GR00T, an initiative to accelerate humanoid robot development; and new world-model development tools for video data curation and processing, including the NVIDIA Cosmos tokenizer and NVIDIA NeMo Curator for video processing.

The open-source Cosmos tokenizer provides robotics developers superior visual tokenization by breaking down images and videos into high-quality tokens with exceptionally high compression rates. It runs up to 12x faster than current tokenizers, while NeMo Curator provides video processing curation up to 7x faster than unoptimized pipelines.

Also timed with CoRL, NVIDIA presented 23 papers and nine workshops related to robot learning and released training and workflow guides for developers. Further, Hugging Face and NVIDIA announced they're collaborating to accelerate open-source robotics research with LeRobot, NVIDIA Isaac Lab and NVIDIA Jetson for the developer community.

Accelerating Robot Development With Isaac Lab NVIDIA Isaac Lab is an open-source, robot learning framework built on NVIDIA Omniverse, a platform for developing OpenUSD applications for industrial digitalization and physical AI simulation.

Developers can use Isaac Lab to train robot policies at scale. This open-source unified robot learning framework applies to any embodiment - from humanoids to quadrupeds to collaborative robots - to handle increasingly complex movements and interactions.

Leading commercial robot makers, robotics application developers and robotics research entities around the world are adopting Isaac Lab, including 1X, Agility Robotics, The AI Institute, Berkeley Humanoid, Boston Dynamics, Field AI, Fourier, Galbot, Mentee Robotics, Skild AI, Swiss-Mile, Unitree Robotics and XPENG Robotics.

Project GR00T: Foundations for General-Purpose Humanoid Robots Building advanced humanoids is extremely difficult, demanding multilayer technological and interdisciplinary approaches to make the robots perceive, move and learn skills effectively for human-robot and robot-environment interactions.

Project GR00T is an initiative to develop accelerated libraries, foundation models and data pipelines to accelerate the global humanoid robot developer ecosystem.

Six new Project GR00T workflows provide humanoid developers with blueprints to realize the most challenging humanoid robot capabilities. They include:

GR00T-Gen for building generative AI-powered, OpenUSD-based 3D environments

GR00T-Mimic for robot motion and trajectory generation

GR00T-Dexterity for robot dexterous manipulation

GR00T-Control for whole-body control

GR00T-Mobility for robot locomotion and navigation

GR00T-Perception for multimodal sensing

Humanoid robots are the next wave of embodied AI, said Jim Fan, senior research manager of embodied AI at NVIDIA. NVIDIA research and engineering teams are collaborating across the company and our developer ecosystem to build Project GR00T to help advance the progress and development of global humanoid robot developers.

New Development Tools for World Model Builders Today, robot developers are building world models - AI representations of the world that can predict how objects and environments respond to a robot's actions. Building these world models is incredibly compute- and data-intensive, with models requiring thousands of hours of real-world, curated image or video data.

NVIDIA Cosmos tokenizers provide efficient, high-quality encoding and decoding to simplify the development of these world models. They set a new standard of minimal distortion and temporal instability, enabling high-quality video and image reconstructions.

Providing high-quality compression and up to 12x faster visual reconstruction, the Cosmos tokenizer paves the path for scalable, robust and efficient development of generative applications across a broad spectrum of visual domains.

1X, a humanoid robot company, has updated the 1X World Model Challenge dataset to use the Cosmos tokenizer.

NVIDIA Cosmos tokenizer achieves really high temporal and spatial compression of our data while still retaining visual fidelity, said Eric Jang, vice president of AI at 1X Technologies. This allows us to train world models with long horizon video generation in an even more compute-efficient manner.

Other humanoid and general-purpose robot developers, including XPENG Robotics and Hillbot, are developing with the NVIDIA Cosmos tokenizer to manage high-resolution images and videos.

NeMo Curator now includes a video processing pipeline. This enables robot developers to improve their world-model accuracy by processing large-scale text, image and video data.

Curating video data poses challenges due to its massive size, requiring scalable pipelines and efficient orchestration for load balancing across GPUs. Additionally, models for filtering, captioning and embedding need optimization to maximize throughput.

NeMo Curator overcomes these challenges by streamlining data curation with automatic pipeline orchestration, reducing processing time significantly. It supports linear scaling across multi-node, multi-GPU systems, efficiently handling over 100 petabytes of data. This simplifies AI development, reduces costs and accelerates time to market.

Advancing the Robot Learning Community at CoRL The nearly two dozen research papers the NVIDIA robotics team released with CoRL cover breakthroughs in integrating vision language models for improved environmental understanding and task execution, temporal robot navigati
LINK: https://blogs.nvidia.com/blog/robot-learning-humanoid-development/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

01/04/2026

DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION

January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION Douyin Users Can Now Create And Share Videos With Stun...

26/02/2026

AWS Launches New Tool for Vertical Video Conversion

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

Broadpeak Launches Multiview For Live Sports at 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

Griffin Media Rolls Out Bitcentral Core News At KWTV, KOTV

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

Adobes New Firefly QuickCut Gives Video Editors a Starting Point

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

Disney Gains But YouTube Continues to Dominate Screentime

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

BCNEXXT Adds HLG-Based HDR To Vipe Platform

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

FCC Launches Inquiry Into Broadcast Sports Rights

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

Avids New CPO Discusses AI, NAB Show and Newsroom Tech

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

Samsung Taps Gracenote for AI-Powered Discovery

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

Less Tools, More Visibility: TAG Video Systems at NAB 2026

Share Copy link Facebook X Linkedin Bluesky Email...

26/02/2026

Studio Technologies Dante-Based Solutions Power High-Prof...

With more than four decades of experience in radio broadcasting and live sports production, Daryl Doss, owner of Doss Technical Services and a contract engineer...

26/02/2026

BCNEXXT Deploys Live HLG-Based HDR Playout Within Vipe Pl...

BCNEXXT has deployed live HLG-based HDR playout capabilities within its Vipe platform, enabling broadcasters to integrate High Dynamic Range into live productio...

26/02/2026

TAG Video Systems at NAB 2026

TAG Video Systems (Booth W2323) will unveil new capabilities across its IP-native Realtime Media Platform at NAB 2026. New releases include visual service healt...

26/02/2026

IBC2026 unveils strategic partnership with EIT Culture an...

IBC today announced a new strategic partnership with EIT Culture & Creativity the institutional partnership for culture and creativity, supported by the Europ...

26/02/2026

Clear-Coms Arcadia Central Station and FreeSpeak Icon Bel...

Clear-Com kept the action on track at Red Bull Shay'iMoto, an adrenaline-fueled motorsport spinning event that transformed the streets of Durban, South Afr...

26/02/2026

Alcom Elevates Headend Video Service with Harmonic to Dri...

Harmonic (NASDAQ: HLIT) today announced that Alcom, a leading telco operator in Finland, is powering its next-generation white-label headend video service with ...

26/02/2026

Big Blue Marble named as Launch Partner for AWS Elemental...

Big Blue Marble, a provider of broadcast-grade, cloud-native video solutions for broadcasters, service providers, and content owners, today announced that it ha...

26/02/2026

Broadpeak launches Multiview solution to simplify multi-s...

New approach enables video service providers to deliver multiple live feeds on the same screen with lower costs and improved device compatibility Broadpeak, a ...

26/02/2026

Space42 Reports Full-Year 2025 Earnings and returns to Quarterly Growth

Final quarter revenues increase 7% year-on-year, with accelerating momentum in the second half Space Services grows revenues by 6% year-on-year and records hig...

25/02/2026

Record Global Audiences Announced as Olympic Baton Passes to French Alps 2030

With the Olympic Flag officially handed over to the organisers of the next Winter Games and the baton passed from Milano Cortina 2026 to French Alps 2030, the I...

25/02/2026

BBC Shrinks IBC Footprint as Remote Production Takes Center Stage

From a studio overlooking the Dolomites to workflows routed through Milan and into Salford, the BBC delivered a lean and mean operation for its Winter Games c...

25/02/2026

Making the Warner Bros. Discovery Sports Winter Olympics Production Work, From Monitoring to Transmission to Comms

Warner Bros. Discovery (WBD) Sports is managing a huge network of channels acros...

25/02/2026

How Warner Bros. Discovery Sports Used XR to Bring the Peaks of the Dolomites Into View

From its base in the northern Italian town of Cortina, Warner Bros. Discovery (W...

25/02/2026

New AWS Elemental Inference' Offers AI-Powered, Real-Time Vertical-Video Conversion

In addition to 16:9-to-9:16 intelligent cropping for live video, Inference autom...

25/02/2026

Netflix To Livestream Floyd Mayweather Jr. vs. Manny Pacquiao Rematch on Sept. 19 From Sphere

Longtime rivals Floyd Money Mayweather Jr. (50-0, 27 KOs) and Manny PacMan P...

25/02/2026

Portland Fire, Thorns Announce Landmark Broadcast Partnership with Gray Media's FOX 12 Plus

The WNBA's Portland Fire and NWSL's Portland Thorns announce a groundbre...

25/02/2026

NHL, Cosm Install C360 10.5K Capture Systems at the League's Arenas

Multi-angle coverage, on-demand access to ultra-high-resolution video are provided for replays and clips across multiple distribution channels The NHL and Cosm...

25/02/2026

SVG Sit-Down: Cosm's Devin Poolman and Evan Wimer on Installing 10.5K C360 Cameras at All 32 NHL Venues

The implementation standardizes an integrated workflow connecting ultra-high-res...

25/02/2026

Hangin' With the Hornets: Enjoy Basketball Goes Behind the Scenes at NBA Charlotte Franchise

Targeting a younger audience, creator-led network's Access Granted series hi...

25/02/2026

Orlando Magic Jump Into ST 2110 With New Production-Control Room at Kia Center

Alpha, the project's systems integrator, assisted in the workflow transformation Tipping off the second half of the 2025-26 home schedule against the Houst...

25/02/2026

OCVIBE and Global Digital Display Firm Daktronics Announce Technology Partnership

OCVIBE, the 100-acre mixed-use development transforming the area surrounding Hon...

25/02/2026

Level Up Your Playlists' Transitions With Smart Reorder

It's never been easier to customize your Spotify listening experience. Last year, we introduced more control over the way your playlist sounds, giving Premi...

25/02/2026

Who's Going to Lead Hip-Hop's Next Generation? Vote Now on Spotify

Hip-hop thrives on constant reinvention, with bold voices and fearless experimentation continually pushing the genre's boundaries. Every era brings new lead...

25/02/2026

Keeping America's Space Watchtower Sharp: US and Australia Work to Advance Critical Telescope Capacity

L3Harris technicians recently completed a major mirror refurbishment for the U.S...

25/02/2026

Nielsen Utilizes Scarborough To Introduce 200+ New, Advanced Audience Segments Via Nielsen ONE

This new offering helps solve for the need to move beyond traditional audience d...

25/02/2026

Samsung taps Gracenote to supercharge range of AI initiatives

Gold-standard Gracenote content metadata will power Samsung's LLM-enabled entertainment search discovery experiences and more NEW YORK February 25, 202...

25/02/2026

Afrobeats Icon Tiwa Savage Joins Forces with Berklee to Empower African Talent

Afrobeats Icon Tiwa Savage Joins Forces with Berklee to Empower African Talent In collaboration with Berklee Global, the Tiwa Savage Music Foundation will hos...

25/02/2026

Wowza Names Jon Corley as Chief Innovation Officer

Share Copy link Facebook X Linkedin Bluesky Email...

25/02/2026

Milan Cortina Winter Olympics U.S. Viewing Best Since 2014

Share Copy link Facebook X Linkedin Bluesky Email...

25/02/2026

When It Comes to the Upper C-Band, Wireless Carriers Want More

Share Copy link Facebook X Linkedin Bluesky Email...

25/02/2026

ASG Elevates Jody Boatwright to Chief Strategy Officer

Share Copy link Facebook X Linkedin Bluesky Email...

25/02/2026

Study: Premium Video Ads Outperform YouTube

Share Copy link Facebook X Linkedin Bluesky Email...

25/02/2026

TelevisaUnivision's ViX Streamer Achieves Profitability

Share Copy link Facebook X Linkedin Bluesky Email...

25/02/2026

Arch Platform Technologies and Wacom Announce Strategic I...

Arch Platform Technologies, a leading platform for creating and managing cloud workstation infrastructure, and Wacom, the world's leading manufacturer of in...

25/02/2026

Transform live video for mobile audiences with AWS Elemen...

Today, AWS is announcing AWS Elemental Inference, a fully managed AI service that automatically transforms and maximizes live and on-demand video broadcasts to ...