Sony Pixel Power calrec Sony

NVIDIA Advances Physical AI at CVPR With Largest Indoor Synthetic Dataset

17/06/2024

NVIDIA contributed the largest ever indoor synthetic dataset to the Computer Vision and Pattern Recognition (CVPR) conference's annual AI City Challenge - helping researchers and developers advance the development of solutions for smart cities and industrial automation.

The challenge, garnering over 700 teams from nearly 50 countries, tasks participants to develop AI models to enhance operational efficiency in physical settings, such as retail and warehouse environments, and intelligent traffic systems.

Teams tested their models on the datasets that were generated using NVIDIA Omniverse, a platform of application programming interfaces (APIs), software development kits (SDKs) and services that enable developers to build Universal Scene Description (OpenUSD)-based applications and workflows.

Creating and Simulating Digital Twins for Large Spaces In large indoor spaces like factories and warehouses, daily activities involve a steady stream of people, small vehicles and future autonomous robots. Developers need solutions that can observe and measure activities, optimize operational efficiency, and prioritize human safety in complex, large-scale settings.

Researchers are addressing that need with computer vision models that can perceive and understand the physical world. It can be used in applications like multi-camera tracking, in which a model tracks multiple entities within a given environment.

To ensure their accuracy, the models must be trained on large, ground-truth datasets for a variety of real-world scenarios. But collecting that data can be a challenging, time-consuming and costly process.

AI researchers are turning to physically based simulations - such as digital twins of the physical world - to enhance AI simulation and training. These virtual environments can help generate synthetic data used to train AI models. Simulation also provides a way to run a multitude of what-if scenarios in a safe environment while addressing privacy and AI bias issues.

Creating synthetic data is important for AI training because it offers a large, scalable, and expandable amount of data. Teams can generate a diverse set of training data by changing many parameters including lighting, object locations, textures and colors.

Building Synthetic Datasets for the AI City Challenge This year's AI City Challenge consists of five computer vision challenge tracks that span traffic management to worker safety.

NVIDIA contributed datasets for the first track, Multi-Camera Person Tracking, which saw the highest participation, with over 400 teams. The challenge used a benchmark and the largest synthetic dataset of its kind - comprising 212 hours of 1080p videos at 30 frames per second spanning 90 scenes across six virtual environments, including a warehouse, retail store and hospital.

Created in Omniverse, these scenes simulated nearly 1,000 cameras and featured around 2,500 digital human characters. It also provided a way for the researchers to generate data of the right size and fidelity to achieve the desired outcomes.

The benchmarks were created using Omniverse Replicator in NVIDIA Isaac Sim, a reference application that enables developers to design, simulate and train AI for robots, smart spaces or autonomous machines in physically based virtual environments built on NVIDIA Omniverse.

Omniverse Replicator, an SDK for building synthetic data generation pipelines, automated many manual tasks involved in generating quality synthetic data, including domain randomization, camera placement and calibration, character movement, and semantic labeling of data and ground-truth for benchmarking.

Ten institutions and organizations are collaborating with NVIDIA for the AI City Challenge:

Australian National University, Australia

Emirates Center for Mobility Research, UAE

Indian Institute of Technology Kanpur, India

Iowa State University, U.S.

Johns Hopkins University, U.S.

National Yung-Ming Chiao-Tung University, Taiwan

Santa Clara University, U.S.

The United Arab Emirates University, UAE

University at Albany - SUNY, U.S.

Woven by Toyota, Japan

Driving the Future of Generative Physical AI Researchers and companies around the world are developing infrastructure automation and robots powered by physical AI - which are models that can understand instructions and autonomously perform complex tasks in the real world.

Generative physical AI uses reinforcement learning in simulated environments, where it perceives the world using accurately simulated sensors, performs actions grounded by laws of physics, and receives feedback to reason about the next set of actions.

Developers can tap into developer SDKs and APIs, such as the NVIDIA Metropolis developer stack - which includes a multi-camera tracking reference workflow - to add enhanced perception capabilities for factories, warehouses and retail operations. And with the latest release of NVIDIA Isaac Sim, developers can supercharge robotics workflows by simulating and training AI-based robots in physically based virtual spaces before real-world deployment.

Researchers and developers are also combining high-fidelity, physics-based simulation with advanced AI to bridge the gap between simulated training and real-world application. This helps ensure that synthetic training environments closely mimic real-world conditions for more seamless robot deployment.

NVIDIA is taking the accuracy and scale of simulations further with the recently announced NVIDIA Omniverse Cloud Sensor RTX, a set of microservices that enable physically accurate sensor simulation to accelerate the development of fully autonomous machines.

This technology will allow autonomous systems, whether a factory, vehicle or robot, to gather essential data to effectively perceive, navigate and interact with the real world. Using these microservices, developers can run large-scale te
LINK: https://blogs.nvidia.com/blog/ai-city-challenge-omniverse-cvpr/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

21/04/2026

Nielsen data shows Australian outdoor and sport retailers are changing how they advertise to win over outdoor enthusiasts

Advertising strategies shift as competition grows for a large, active and qualit...

21/04/2026

ATSC Celebrates 3.0's Global Expansion

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Cinematic Feel Makes Survivor' Built to Last

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Live Event Technology Expands Fan Engagement

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

MS NOW Uses Community to Build Up Its Brand

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Why Broadcast Is Well-Positioned to Safeguard Freedom of Speech

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

AWS Demos AI Tools to Deliver Vertical Video

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Video Podcasting Leaps in Popularity

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Audio Systems Get Boost From Cloud and AI

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Layercake Deepens Bitmovin Integration to Power End-to-End Media Orchestration with Streamcake

Layercake Deepens Bitmovin Integration to Power End-to-End Media Orchestration w...

21/04/2026

Cobalt Digital Announces Launch of blueCORE at NAB Show 2026: Compact Powerhouse Signal Processors for SDI and ST 2110 Workflows

Cobalt Digital Announces Launch of blueCORE at NAB Show 2026: Compact Powerhouse...

21/04/2026

On-Premise AI Suite from Studio Network Solutions Debuts at NAB Show 2026

On-Premise AI Suite from Studio Network Solutions Debuts at NAB Show 2026 Melanie Ciotti April 21, 2026 0 Comments Unlimited processing, no cloud depe...

21/04/2026

IBC appoints Tim Banham as Chief Commercial Officer to dr...

London, 21 April 2026 IBC today announced the appointment of Tim Banham as its first Chief Commercial Officer (CCO), a newly created role that reflects the or...

21/04/2026

Motion Design Tools - April 2026

Motion Design Tools - April 2026 Roland Kahlenberg April 21, 2026 0 Comments Within 2 days, Maxon and Canva announced pro-level motion design apps - A...

21/04/2026

Chaos and Zero Density to Showcase Real-Time Ray Tracing for Virtual Studios and XR

Chaos and Zero Density to Showcase Real-Time Ray Tracing for Virtual Studios and...

21/04/2026

Diversified Appoints Tyler Affolter Chief Revenue Officer

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

TV Azteca to Bring Dolby Atmos to Free-To-Air TV in Mexico

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Maxon Announces Free Tools and Mobile Expansion of ZBrush...

Cinema 4D brings professional 3D workflows to iPad. The return of Autograph now free for individual users. ZBrush expands to Windows on Arm. See it all at NAB...

21/04/2026

Bitfocus improves availability, security and user managem...

Software version 1.6 extends enterprise functionality to place Buttons at the heart of media operations at any scale Bitfocus, the Norwegian software develope...

21/04/2026

Cobalt Digital Announces Launch of blueCORE at NAB Show 2...

Cobalt Digital Announces Launch of blueCORE at NAB Show 2026: Compact Powerhouse Signal Processors for SDI and ST 2110 Workflows Compact, multi-function stan...

21/04/2026

Applications open for 2026 AISF and Screen Australia Writer/Director Virtual Sessions

Applications open for 2026 AISF and Screen Australia Writer/Director Virtual Ses...

21/04/2026

Comscore Continues Building Momentum in Local Measurement Through New Agreements with More than 15 Clients

Comscore Continues Building Momentum in Local Measurement Through New Agreements...

21/04/2026

Cultivating creativity: Super Garden is back for another season

Summer is nearly here and Super Garden is returning to our screens to spark some gardening inspiration. The new series kicks off on Thursday 23 April at 7pm on ...

20/04/2026

Live From NAB 2026: Sonys Hugo Gaggioni Highlights HDR Advances, Software-Defined Workflows

At the 2026 NAB Show, Sony is showcasing a broad slate of innovations across liv...

20/04/2026

Live From NAB 2026: Fujinons Stosh Durbacz on Expanding the 4K Broadcast Lens Lineup With New Portable Zooms, 94x Box Lens

Fujifilm is sharpening its focus on core broadcast production with a new wave of...

20/04/2026

Live From NAB 2026: Rock-It Sports' John Walberg on Powering Logistics, Shipping for the 2026 FIFA Men's World Cup

This upcoming summer in North America is going to be a busy one. The 2026 FIFA M...

20/04/2026

NAB 2026: Glookast outlines product updates including Media Producer UX, connectors and Premiere Pro panel

Glookast (Booth W1661) announced a series of product updates at NAB Show 2026, c...

20/04/2026

NAB 2026: Matrox Video and Amagi collaborate on cloud-based broadcast workflows using ORIGIN framework

Matrox Video and Amagi announced a collaboration to integrate the Matrox ORIGIN ...

20/04/2026

NAB 2026: Riedel SimplyLive supports expanded centralised VAR system for Argentina football league

Riedel Communications (Booth C4908) announced that the Asociaci n del F tbol Arg...

20/04/2026

NAB 2026: Ikegami introduces VFE-P07D OLED viewfinder with integrated LCD monitor

Ikegami (Booth C3819) announced the VFE-P07D monocular OLED viewfinder at NAB Sh...

20/04/2026

NAB 2026: IABM rebrands as IAMT and launches AI discovery platform and global alliance

International Association of MediaTech (IAMT), formerly known as IABM, announced...

20/04/2026

NAB 2026: Harmonic supports DIRECTV DTH platform upgrade with VOS Media Software

Harmonic (Booth W2831) announced that DIRECTV is updating its US direct-to-home (DTH) video platform using Harmonic's VOS Media Software. The deployment is...

20/04/2026

NAB 2026: Wasabi Technologies acquires Seagate Lyve Cloud business

Wasabi Technologies announced that it has acquired the Lyve Cloud business from Seagate Technology. As part of the agreement, Seagate received equity in Wasabi ...

20/04/2026

NAB 2026: EVS introduces Choreon robotics orchestration platform for unified production control

EVS (Booth N1841) has launched Choreon, a robotics controller for media producti...

20/04/2026

SportsTechBuzz at NAB 2026, Day 2: Live Reports From the Show Floor in Vegas

The NAB Show is in full swing, and the SVG and SVG Europe editorial teams are chasing down the hottest stories from all over the Las Vegas Convention Center. He...

20/04/2026

NAB 2026: Skyline Communications launches DataMiner packages on Grass Valley AMPP App Store

Skyline Communications announced the availability of its DataMiner xOps platform...

20/04/2026

NAB 2026: SNS launches Outpost, Trio and AI Suite for connected post-production workflows

Studio Network Solutions (Booth N1129) introduced a set of new products at NAB S...

20/04/2026

NAB 2026: Dell Technologies and NVIDIA present AI data platform for media workflows

Dell Technologies is showcasing its Dell AI Data Platform with NVIDIA at NAB Sho...

20/04/2026

NAB 2026: Blackmagic Design Announces Fairlight Live Software Audio Mixer

Blackmagic Design has announced Fairlight Live, a software-based live audio mixer with SMPTE 2110 support and spatial audio mixing. A public beta is available n...

20/04/2026

Live From NAB 2026: Imagine Comms Jimbo Haneklau Talks Prismon, Hybrid IP/SDI Workflows, and Cloud Playout

At the 2026 NAB Show in Las Vegas, Imagine Communications VP of Sales, Sports an...

20/04/2026

Live From NAB 2026: LiveUs Phillip Broaddus on LU900Q Launch, Nexus Cloud Platform, and REMI Growth

At the 2026 NAB Show in Las Vegas, LiveU Senior Director of Sales, Sports Philli...

20/04/2026

3 New Ways to Dive Deeper Into the Music You Love

A song that perfectly captures a moment is magic. But when you uncover the story behind it, who made it, what inspired it, and the meaning woven into the lyrics...

20/04/2026

Deity Microphones announce the PR-4

Ultra-compact 32-bit recorder set for launch Deity Microphones will soon be launching a new 32-bit six-track recorder that's been designed with producti...