Sony Pixel Power calrec Sony

NVIDIA Research Wins CVPR Autonomous Grand Challenge for End-to-End Driving

17/06/2024

Making moves to accelerate self-driving car development, NVIDIA was today named an Autonomous Grand Challenge winner at the Computer Vision and Pattern Recognition (CVPR) conference, running this week in Seattle.

Building on last year's win in 3D Occupancy Prediction, NVIDIA Research topped the leaderboard this year in the End-to-End Driving at Scale category with its Hydra-MDP model, outperforming more than 400 entries worldwide.

This milestone shows the importance of generative AI in building applications for physical AI deployments in autonomous vehicle (AV) development. The technology can also be applied to industrial environments, healthcare, robotics and other areas.

The winning submission received CVPR's Innovation Award as well, recognizing NVIDIA's approach to improving any end-to-end driving model using learned open-loop proxy metrics.

In addition, NVIDIA announced NVIDIA Omniverse Cloud Sensor RTX, a set of microservices that enable physically accurate sensor simulation to accelerate the development of fully autonomous machines of every kind.

How End-to-End Driving Works The race to develop self-driving cars isn't a sprint but more a never-ending triathlon, with three distinct yet crucial parts operating simultaneously: AI training, simulation and autonomous driving. Each requires its own accelerated computing platform, and together, the full-stack systems purpose-built for these steps form a powerful triad that enables continuous development cycles, always improving in performance and safety.

To accomplish this, a model is first trained on an AI supercomputer such as NVIDIA DGX. It's then tested and validated in simulation - using the NVIDIA Omniverse platform and running on an NVIDIA OVX system - before entering the vehicle, where, lastly, the NVIDIA DRIVE AGX platform processes sensor data through the model in real time.

Building an autonomous system to navigate safely in the complex physical world is extremely challenging. The system needs to perceive and understand its surrounding environment holistically, then make correct, safe decisions in a fraction of a second. This requires human-like situational awareness to handle potentially dangerous or rare scenarios.

AV software development has traditionally been based on a modular approach, with separate components for object detection and tracking, trajectory prediction, and path planning and control.

End-to-end autonomous driving systems streamline this process using a unified model to take in sensor input and produce vehicle trajectories, helping avoid overcomplicated pipelines and providing a more holistic, data-driven approach to handle real-world scenarios.

Watch a video about the Hydra-MDP model, winner of the CVPR Autonomous Grand Challenge for End-to-End Driving:

Navigating the Grand Challenge This year's CVPR challenge asked participants to develop an end-to-end AV model, trained using the nuPlan dataset, to generate driving trajectory based on sensor data.

The models were submitted for testing inside the open-source NAVSIM simulator and were tasked with navigating thousands of scenarios they hadn't experienced yet. Model performance was scored based on metrics for safety, passenger comfort and deviation from the original recorded trajectory.

NVIDIA Research's winning end-to-end model ingests camera and lidar data, as well as the vehicle's trajectory history, to generate a safe, optimal vehicle path for five seconds post-sensor input.

The workflow NVIDIA researchers used to win the competition can be replicated in high-fidelity simulated environments with NVIDIA Omniverse. This means AV simulation developers can recreate the workflow in a physically accurate environment before testing their AVs in the real world. NVIDIA Omniverse Cloud Sensor RTX microservices will be available later this year. Sign up for early access.

In addition, NVIDIA ranked second for its submission to the CVPR Autonomous Grand Challenge for Driving with Language. NVIDIA's approach connects vision language models and autonomous driving systems, integrating the power of large language models to help make decisions and achieve generalizable, explainable driving behavior.

Learn More at CVPR More than 50 NVIDIA papers were accepted to this year's CVPR, on topics spanning automotive, healthcare, robotics and more. Over a dozen papers will cover NVIDIA automotive-related research, including:

Hydra-MDP: End-to-End Multimodal Planning With Multi-Target Hydra-Distillation

Winner of CVPR's End-to-End Driving at Scale challenge

Read the NVIDIA technical blog

Producing and Leveraging Online Map Uncertainty in Trajectory Prediction

CVPR best paper award finalist

Driving Everywhere With Large Language Model Policy Adaptation

See DRIVE Labs: LLM-Based Road Rules Guide Simplifies Driving

Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?

Improving Distant 3D Object Detection Using 2D Box Supervision

Dynamic LiDAR Resimulation Using Compositional Neural Fields

BEVNeXt: Reviving Dense BEV Frameworks for 3D Object Detection

PARA-Drive: Parallelized Architecture for Real-Time Autonomous Driving

Sanja Fidler, vice president of AI research at NVIDIA, will speak on vision language models at the CVPR Workshop on Autonomous Driving.

Learn more about NVIDIA Research, a global team of hundreds of scientists and engineers focused on topics including AI, computer graphics, computer vision, self-driving cars and robotics.

See notice regarding software product information.
LINK: https://blogs.nvidia.com/blog/auto-research-cvpr-2024/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

06/09/2026

Dolby and MagentaTV Bring Fans Closer to the FIFA World Cup 2026 in Germany with Dolby Vision and Dolby Atmos

June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

16/06/2026

Perry Sook: Big Tech Poses 'Very Urgent Threat to Broadcast Stations

Share Copy link Facebook X Linkedin Bluesky Email...

16/06/2026

FIFA World Cup Delivers Record Ratings on Fox

Share Copy link Facebook X Linkedin Bluesky Email...

16/06/2026

AIMS Launches the Official IPMX Training Series Online

Free Program Supports IPMX Education from Foundational Concepts Through System and Network Design The Alliance for IP Media Solutions (AIMS) today announced t...

16/06/2026

Share your views on Screen Australia and the future of the industry

Share your views on Screen Australia and the future of the industry 15 June 2026 Your feedback matters. Following the instrumental insights provided in 2025,...

15/06/2026

University of South Carolina's Valerie Gerfin on Gamecock Productions' Growth, Upgrades at Williams-Brice Stadium

One of the more exciting internal video production divisions within a college at...

15/06/2026

Fox Corp. To Acquire Roku, Pairs Live Sports Powerhouse With Major CTV Platform

The deal valued at $22 Billion is expected to close in the first half of 2027...

15/06/2026

Golf Channel Mobile to Live Stream 2026 Arnold Palmer Cup Beginning July 13th

Golf Channel and the Arnold Palmer Cup have announced a partnership to livestream the 2026 Arnold Palmer Cup on Golf Channel Mobile and GolfChannel.com. The tou...

15/06/2026

TikTok and Panini Launch Digital Collectible Card Experience for FIFA World Cup 2026

TikTok and Panini have announced a partnership to bring a digital collectible ca...

15/06/2026

Cosm and Monster Energy Launch First Full-Dome Immersive Advertisement in Shared Reality Venues

Cosm and Monster Energy have announced the debut of the first full-dome immersiv...

15/06/2026

Fox Nation and Real American Freestyle Sign International Media Rights Deal

Real American Freestyle (RAF) and Fox Nation have announced an exclusive streaming agreement for three RAF international events, beginning with RAF Georgia on J...

15/06/2026

FanConnect and Extreme Networks Announce IPTV Integration for Large Venue Deployments

FanConnect has announced a partnership with Extreme Networks integrating FanConn...

15/06/2026

2026 Sundance Institute Ignite x Adobe Fellows Named

Ten Emerging Filmmakers Ages 18 to 25 Will Start Fellowship Year at Ignite Lab from June 14-19 LOS ANGELES, CA, June 15, 2026 - The nonprofit Sundance Institut...

15/06/2026

Rumble from UVI

Innovative three-band soft synth introduced UVI's latest synth takes an interesting approach to synthesis, offering a trio of synth engines that each op...

15/06/2026

Oram Awards 2026: Open call announcement

Applications now open for 2026 The Oram Awards have returned for 2026 to celebrate the unusual, unique and unfiltered creative worlds of women and gender-di...

15/06/2026

PSPaudioware release PSP Levelizer

New intelligent auto-fader plug-in revealed PSPaudioware's latest release offers automatic level adjustment and provides more detailed control than many...

15/06/2026

4.78M AUSSIES TUNE IN FOR SOCCEROOS WIN OVER TRKYE ON SBS

4.78M AUSSIES TUNE IN FOR SOCCEROOS WIN OVER T RK YE ON SBS 15 June, 2026 Media releases Match had a Total TV average audience of 3.035 million, with over ...

15/06/2026

SBS Head of Commissioning John Godfrey to depart after 18 years

SBS Head of Commissioning John Godfrey to depart after 18 years 15 June, 2026 Media releases SBS Head of Commissioning John Godfrey will depart the broadca...

15/06/2026

Greater Manchester Police installs Rohde & Schwarz security scanner for custody searches

Greater Manchester Police installs Rohde & Schwarz security scanner for custody ...

15/06/2026

The New Discovery Stack: AI, Metadata and Audience Intelligence

Insights from NAGRAVISION's latest industry webinar featuring One Hungary, Liberty Global and Media Press Group In this blog, Laura Rognoni explores the k...

15/06/2026

Clear-Com Introduces Avalon IP Intercom Platform

Share Copy link Facebook X Linkedin Bluesky Email...

15/06/2026

DoJ Approves Paramount Skydance, Warner Bros. Discovery Merger

Share Copy link Facebook X Linkedin Bluesky Email...

15/06/2026

Clear-Com Introduces Avalon IP Station for Modern Communi...

Clear-Com has introduced Avalon , a purpose built 1RU IP intercom communication platform for modern networked production, designed to simplify and scale workfl...

15/06/2026

Fox Makes CTV Play with Roku Acquisition

Share Copy link Facebook X Linkedin Bluesky Email...

15/06/2026

Gray Announces Plans to Expand Lansing, Mich. Broadcast HQ

Share Copy link Facebook X Linkedin Bluesky Email...

15/06/2026

Richmond Flying Squirrels Raise the Bar for Live Baseball...

MiLB Club Deploys LDX 110 Cameras at CarMax Park to Deliver A New Standard in Engaging Fan Experience Grass Valley today announced that the Richmond Flying Sq...

15/06/2026

Detach from Direct-Attached: How Remote Editing with EVO Keeps Creative Teams Moving

Detach from Direct-Attached: How Remote Editing with EVO Keeps Creative Teams Mo...

15/06/2026

Techtel Completes Media Production Setup for a major AFL sporting organisation

Techtel Completes Media Production Setup for a major AFL sporting organisation Sports 15 June Written By Suzanne Costello (Sydney, Australia 15 June 2026)...

15/06/2026

Sky News takes viewers inside Minab in new film investigating primary school strike in Iran

Monday 15 June 2026 Sky News takes viewers inside Minab in new film investigati...

15/06/2026

Fox Corporation to Acquire Roku, Inc.

Fox Corporation to Acquire Roku, Inc. Combination Creates a Scaled Media and Technology Platform with Superior Reach, Engagement and Monetization Capability ...

14/06/2026

Detroit Drums from Iconic Instruments

Library captures 1960s R&B/pop drum sound Following on from their recent wave of plug-in effects, Iconic Instruments have just launched an all-new virtual d...

14/06/2026

HBO Comedy Rooster Shot with URSA Cine 17K 65

HBO Comedy Rooster Shot with URSA Cine 17K 65 Brie Clayton June 14, 2026 0 Comments Large format brings viewers intimately close to characters. Black...

13/06/2026

Rhythmic Filters for Devious Machines' Infiltrator

Latest expansion pack includes 252 presets Devious Machines have recently introduced another expansion for their powerful multi-effects plug-in, Infiltrator...

13/06/2026

MetaGrid Pro gains AI Builder

Create custom DAW/plug-in controllers using prompts MetaGrid have recently introduced an all-new AI Builder function to their touchscreen-based control surf...

13/06/2026

Spectrum Reach Taps Anoki AI for Contextual Intelligence

Share Copy link Facebook X Linkedin Bluesky Email...

13/06/2026

Google TV Launches Soccer Hub, New Voice Command Features

Share Copy link Facebook X Linkedin Bluesky Email...

12/06/2026

YES Network and Gotham Sports App to Air Seven Athletes Unlimited Softball League Games

YES Network and The Gotham Sports App will air seven Athletes Unlimited Softball...

12/06/2026

UFL to Feature FAST Innovation Suite at 2026 United Bowl

The United Football League will host its FAST Innovation Suite at the 2026 United Bowl presented by Credit One Bank on Saturday, June 13 at 3:00 p.m. ET at Audi...

12/06/2026

InfoComm 2026: PTZOptics and LayerJot to Demo AI-Driven Camera Control

PTZOptics and LayerJot will present live demonstrations at InfoComm 2026 showing how natural-language AI prompting, robotic camera control, and on-device comput...

12/06/2026

InfoComm 2026: MultiDyne to Debut VF-9100 Fiber Transport Platform and Crescendo Audio Monitor

MultiDyne Video and Fiber Optic Systems will exhibit at InfoComm 2026, featuring...

12/06/2026

Eurovision Services Deploys Ateme Software-Based Frame-Rate Conversion

Ateme has announced that Eurovision Services is using Ateme's software-based frame-rate conversion technology for international live event workflows. The de...

12/06/2026

Bitmovin, Simplestream, and Xperi Partner to Support OTT Services on TiVo OS

Bitmovin and Simplestream have announced a partnership with Xperi to simplify the launch of OTT streaming services on TiVo OS smart TVs and devices. The collabo...

12/06/2026

Net Insight Deploys Nimbra 520 and Nimbra Edge for Multinational Corporate Live Production Workflow

Net Insight has announced that a multinational technology company is deploying a...

12/06/2026

MLB Players Inc., Athletes First Announce Content Partnership

MLB Players Inc., the business arm of the MLB Players Association, has announced a partnership with Athletes First to develop and sell brand partnerships across...

12/06/2026

G&D and VuWall Announce CommandKeyboard-Advanced for Network-Independent Control Room Operations

Guntermann and Drunck (G&D) and VuWall have announced the CommandKeyboard-Advanc...

12/06/2026

Philadelphia Union and Comcast Deploy Smart Technology at Subaru Park and WSFS Bank Sportsplex

Comcast Smart Solutions announces a new smart technology deployment with Major L...

12/06/2026

Elevation Worship Completes First Leg of 2026 Tour Using SSL Live Consoles and New UMD192 Interface

Elevation Worship completed the initial leg of its Elevation Nights 2026 tour ...