
Making moves to accelerate self-driving car development, NVIDIA was today named an Autonomous Grand Challenge winner at the Computer Vision and Pattern Recognition (CVPR) conference, running this week in Seattle.
Building on last year's win in 3D Occupancy Prediction, NVIDIA Research topped the leaderboard this year in the End-to-End Driving at Scale category with its Hydra-MDP model, outperforming more than 400 entries worldwide.
This milestone shows the importance of generative AI in building applications for physical AI deployments in autonomous vehicle (AV) development. The technology can also be applied to industrial environments, healthcare, robotics and other areas.
The winning submission received CVPR's Innovation Award as well, recognizing NVIDIA's approach to improving any end-to-end driving model using learned open-loop proxy metrics.
In addition, NVIDIA announced NVIDIA Omniverse Cloud Sensor RTX, a set of microservices that enable physically accurate sensor simulation to accelerate the development of fully autonomous machines of every kind.
How End-to-End Driving Works The race to develop self-driving cars isn't a sprint but more a never-ending triathlon, with three distinct yet crucial parts operating simultaneously: AI training, simulation and autonomous driving. Each requires its own accelerated computing platform, and together, the full-stack systems purpose-built for these steps form a powerful triad that enables continuous development cycles, always improving in performance and safety.
To accomplish this, a model is first trained on an AI supercomputer such as NVIDIA DGX. It's then tested and validated in simulation - using the NVIDIA Omniverse platform and running on an NVIDIA OVX system - before entering the vehicle, where, lastly, the NVIDIA DRIVE AGX platform processes sensor data through the model in real time.
Building an autonomous system to navigate safely in the complex physical world is extremely challenging. The system needs to perceive and understand its surrounding environment holistically, then make correct, safe decisions in a fraction of a second. This requires human-like situational awareness to handle potentially dangerous or rare scenarios.
AV software development has traditionally been based on a modular approach, with separate components for object detection and tracking, trajectory prediction, and path planning and control.
End-to-end autonomous driving systems streamline this process using a unified model to take in sensor input and produce vehicle trajectories, helping avoid overcomplicated pipelines and providing a more holistic, data-driven approach to handle real-world scenarios.
Watch a video about the Hydra-MDP model, winner of the CVPR Autonomous Grand Challenge for End-to-End Driving:
Navigating the Grand Challenge This year's CVPR challenge asked participants to develop an end-to-end AV model, trained using the nuPlan dataset, to generate driving trajectory based on sensor data.
The models were submitted for testing inside the open-source NAVSIM simulator and were tasked with navigating thousands of scenarios they hadn't experienced yet. Model performance was scored based on metrics for safety, passenger comfort and deviation from the original recorded trajectory.
NVIDIA Research's winning end-to-end model ingests camera and lidar data, as well as the vehicle's trajectory history, to generate a safe, optimal vehicle path for five seconds post-sensor input.
The workflow NVIDIA researchers used to win the competition can be replicated in high-fidelity simulated environments with NVIDIA Omniverse. This means AV simulation developers can recreate the workflow in a physically accurate environment before testing their AVs in the real world. NVIDIA Omniverse Cloud Sensor RTX microservices will be available later this year. Sign up for early access.
In addition, NVIDIA ranked second for its submission to the CVPR Autonomous Grand Challenge for Driving with Language. NVIDIA's approach connects vision language models and autonomous driving systems, integrating the power of large language models to help make decisions and achieve generalizable, explainable driving behavior.
Learn More at CVPR More than 50 NVIDIA papers were accepted to this year's CVPR, on topics spanning automotive, healthcare, robotics and more. Over a dozen papers will cover NVIDIA automotive-related research, including:
Hydra-MDP: End-to-End Multimodal Planning With Multi-Target Hydra-Distillation
Winner of CVPR's End-to-End Driving at Scale challenge
Read the NVIDIA technical blog
Producing and Leveraging Online Map Uncertainty in Trajectory Prediction
CVPR best paper award finalist
Driving Everywhere With Large Language Model Policy Adaptation
See DRIVE Labs: LLM-Based Road Rules Guide Simplifies Driving
Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?
Improving Distant 3D Object Detection Using 2D Box Supervision
Dynamic LiDAR Resimulation Using Compositional Neural Fields
BEVNeXt: Reviving Dense BEV Frameworks for 3D Object Detection
PARA-Drive: Parallelized Architecture for Real-Time Autonomous Driving
Sanja Fidler, vice president of AI research at NVIDIA, will speak on vision language models at the CVPR Workshop on Autonomous Driving.
Learn more about NVIDIA Research, a global team of hundreds of scientists and engineers focused on topics including AI, computer graphics, computer vision, self-driving cars and robotics.
See notice regarding software product information.
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
04/08/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
04/07/2026
April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...
01/06/2026
January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026
Throughout the week, Dolby brings to life the latest innovatio...
02/05/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
01/05/2026
January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...
19/04/2026
Blackmagic Design has announced the ATEM 4 M/E Constellation IP and ATEM 4 M/E Constellation IP Plus, two SMPTE 2110-native live production switchers. The ATEM ...
19/04/2026
Grass Valley is finding the right balance between its hardware heritage with an ...
19/04/2026
Oracle's strategy rests on the foundational strengths of Oracle Cloud Infras...
19/04/2026
Program Productions, the live sports production industry's leading crewer, i...
19/04/2026
At the 2026 NAB Show in Las Vegas, SVG sat down with Joe Scionti, Account Manage...
19/04/2026
At its 2026 NAB Show keynote event, Ross Video came out swinging with one of its...
19/04/2026
Evertz (Booth N817) is set to present new services within its evertz.io platform...
19/04/2026
Evertz (Booth N817) will showcase its IPMX-certified NUCLEUS platform alongside ...
19/04/2026
Evertz (Booth N817) is set to showcase ENX at NAB 2026, a media core platform designed to support hybrid SDI and IP infrastructures in production facilities and...
19/04/2026
Evertz (Booth N817) will introduce Studer VistaVUE Touch at NAB 2026, a control surface designed to integrate audio, video and control workflows within a custom...
19/04/2026
Evertz (Booth N817) will highlight X-CALIBER at NAB 2026, an encoding and decodi...
19/04/2026
Cobalt Digital (Booth N1340) will introduce the blueCORE family of standalone si...
19/04/2026
Chyron and Asport (Booth N2441) will demonstrate an integrated sports video work...
19/04/2026
MediaKind (Booth W1743) provided an update on its Multiview deployments at NAB S...
19/04/2026
Calrec (Booth C6907) and Grass Valley (Booth C2408) announced a long-term broadc...
19/04/2026
Oracle is bringing a multi-partner demonstration of Media over QUIC (MoQ)-based live streaming to NAB Show 2026, showcasing how independent systems from multipl...
19/04/2026
Encompass Digital Media announced an expanded partnership with Oracle Cloud Infr...
19/04/2026
The NAB Show is in full swing, and the SVG and SVG Europe editorial teams are chasing down the hottest stories from all over the Las Vegas Convention Center. He...
19/04/2026
Blackmagic Design has announced the ATEM 4 M/E Constellation IP and ATEM 4 M/E Constellation IP Plus, two SMPTE 2110-native live production switchers. The ATEM ...
19/04/2026
Now available in VST3, AU and AAX formats
Waves have recently released an update that extends their vocal-alignment plug-in's capabilities to all DAWs -...
19/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
19/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
19/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
19/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
19/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
19/04/2026
Amagi, the agentic industry cloud platform for unified broadcast, streaming, and monetization, today announced that AccuWeather , the most trusted source of wea...
19/04/2026
Calrec (Booth:C6907) and Grass Valley (Booth: C2408) are today announcing a long-term broadcast audio technology partnership at NAB Show 2026. The companies are...
19/04/2026
Ikegami announces a further expansion to its range of on-camera viewfinders. Scheduled for introduction on Ikegamis Central Hall booth C3819 at the April 19th -...
19/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
19/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
19/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
19/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
19/04/2026
Wuppertal April 19, 2026
Riedel's SimplyLive Solution Powers Centralized V...
19/04/2026
Wuppertal April 19, 2026
Bridge Digital and Riedel Build Campus Wide ST 2110 N...
19/04/2026
Wuppertal April 19, 2026
Riedel Showcases Next Advances in IP-Based Production at NAB 2026MediorNet HorizoN ST 2110 MultiViewer App, SmartPanel Commentary Con...
19/04/2026
Harmonic's Cloud-Native VOS Media Software Lowers Costs by Unifying Media Playout to Delivery on a Single Platform SAN JOSE, Calif. - April 19, 2026 - Harmo...
18/04/2026
MultiDyne Video & Fiber Optic Systems has begun shipping the C16-AM-12G audio mo...
18/04/2026
FOR-A America is set to detail AI functionality for its software-defined IMPULSE...
18/04/2026
Cobalt Digital and SineSix Media have announced a partnership to integrate the v...
18/04/2026
The ATSC, the broadcast standards association, is highlighting the status of the ATSC 3.0 internet protocol-based broadcast standard at the 2026 NAB Show. The e...
18/04/2026
Bolin Technology has introduced a new range of hardware for live production envi...
18/04/2026
KMH Integration is participating in the 2026 NAB Show, focusing on its AV Casti...
18/04/2026
Appear has appointed Mike Burk as vice president of business development for North America. Burk brings over two decades of experience in the broadcast and live...
18/04/2026
Skyline Communications is showcasing its DataMiner platform and the new DataMine...