Sony Pixel Power calrec Sony

NVIDIA Advances Physical AI at CVPR With Largest Indoor Synthetic Dataset

17/06/2024

NVIDIA contributed the largest ever indoor synthetic dataset to the Computer Vision and Pattern Recognition (CVPR) conference's annual AI City Challenge - helping researchers and developers advance the development of solutions for smart cities and industrial automation.

The challenge, garnering over 700 teams from nearly 50 countries, tasks participants to develop AI models to enhance operational efficiency in physical settings, such as retail and warehouse environments, and intelligent traffic systems.

Teams tested their models on the datasets that were generated using NVIDIA Omniverse, a platform of application programming interfaces (APIs), software development kits (SDKs) and services that enable developers to build Universal Scene Description (OpenUSD)-based applications and workflows.

Creating and Simulating Digital Twins for Large Spaces In large indoor spaces like factories and warehouses, daily activities involve a steady stream of people, small vehicles and future autonomous robots. Developers need solutions that can observe and measure activities, optimize operational efficiency, and prioritize human safety in complex, large-scale settings.

Researchers are addressing that need with computer vision models that can perceive and understand the physical world. It can be used in applications like multi-camera tracking, in which a model tracks multiple entities within a given environment.

To ensure their accuracy, the models must be trained on large, ground-truth datasets for a variety of real-world scenarios. But collecting that data can be a challenging, time-consuming and costly process.

AI researchers are turning to physically based simulations - such as digital twins of the physical world - to enhance AI simulation and training. These virtual environments can help generate synthetic data used to train AI models. Simulation also provides a way to run a multitude of what-if scenarios in a safe environment while addressing privacy and AI bias issues.

Creating synthetic data is important for AI training because it offers a large, scalable, and expandable amount of data. Teams can generate a diverse set of training data by changing many parameters including lighting, object locations, textures and colors.

Building Synthetic Datasets for the AI City Challenge This year's AI City Challenge consists of five computer vision challenge tracks that span traffic management to worker safety.

NVIDIA contributed datasets for the first track, Multi-Camera Person Tracking, which saw the highest participation, with over 400 teams. The challenge used a benchmark and the largest synthetic dataset of its kind - comprising 212 hours of 1080p videos at 30 frames per second spanning 90 scenes across six virtual environments, including a warehouse, retail store and hospital.

Created in Omniverse, these scenes simulated nearly 1,000 cameras and featured around 2,500 digital human characters. It also provided a way for the researchers to generate data of the right size and fidelity to achieve the desired outcomes.

The benchmarks were created using Omniverse Replicator in NVIDIA Isaac Sim, a reference application that enables developers to design, simulate and train AI for robots, smart spaces or autonomous machines in physically based virtual environments built on NVIDIA Omniverse.

Omniverse Replicator, an SDK for building synthetic data generation pipelines, automated many manual tasks involved in generating quality synthetic data, including domain randomization, camera placement and calibration, character movement, and semantic labeling of data and ground-truth for benchmarking.

Ten institutions and organizations are collaborating with NVIDIA for the AI City Challenge:

Australian National University, Australia

Emirates Center for Mobility Research, UAE

Indian Institute of Technology Kanpur, India

Iowa State University, U.S.

Johns Hopkins University, U.S.

National Yung-Ming Chiao-Tung University, Taiwan

Santa Clara University, U.S.

The United Arab Emirates University, UAE

University at Albany - SUNY, U.S.

Woven by Toyota, Japan

Driving the Future of Generative Physical AI Researchers and companies around the world are developing infrastructure automation and robots powered by physical AI - which are models that can understand instructions and autonomously perform complex tasks in the real world.

Generative physical AI uses reinforcement learning in simulated environments, where it perceives the world using accurately simulated sensors, performs actions grounded by laws of physics, and receives feedback to reason about the next set of actions.

Developers can tap into developer SDKs and APIs, such as the NVIDIA Metropolis developer stack - which includes a multi-camera tracking reference workflow - to add enhanced perception capabilities for factories, warehouses and retail operations. And with the latest release of NVIDIA Isaac Sim, developers can supercharge robotics workflows by simulating and training AI-based robots in physically based virtual spaces before real-world deployment.

Researchers and developers are also combining high-fidelity, physics-based simulation with advanced AI to bridge the gap between simulated training and real-world application. This helps ensure that synthetic training environments closely mimic real-world conditions for more seamless robot deployment.

NVIDIA is taking the accuracy and scale of simulations further with the recently announced NVIDIA Omniverse Cloud Sensor RTX, a set of microservices that enable physically accurate sensor simulation to accelerate the development of fully autonomous machines.

This technology will allow autonomous systems, whether a factory, vehicle or robot, to gather essential data to effectively perceive, navigate and interact with the real world. Using these microservices, developers can run large-scale te
LINK: https://blogs.nvidia.com/blog/ai-city-challenge-omniverse-cvpr/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

10/06/2026

Microphone Maker Audix Adds Eric Reese as VP

Share Copy link Facebook X Linkedin Bluesky Email...

09/06/2026

Kiswe Expands Partnership With ONE Championship To LaunchGlobal Subscription Platform

Kiswe announces an expanded long-term partnership with ONE Championship (ONE), t...

09/06/2026

SiriusXM to Carry All 104 FIFA World Cup 2026 Matches via FOX Sports Commentary

SiriusXM will broadcast FOX Sports' English-language commentary for all 104 FIFA World Cup 2026 matches from June 11 through July 19, available to subscribe...

09/06/2026

EVS Broadcast Equipment Rebrands as EVS

EVS has announced it is changing its corporate name from EVS Broadcast Equipment to EVS, reflecting the company's expanded portfolio beyond broadcast equipm...

09/06/2026

FOX and NFL Announce Multi-Year Agreement for NFL Coverage in Mexico Starting in 2026

Fox Corporation and the NFL have announced a multi-year agreement to bring NFL c...

09/06/2026

FOX Sports and ReachTV to Carry All 104 FIFA World Cup 2026 Matches Across U.S. Airports

FOX Sports and ReachTV, an airport media network, have announced an agreement to...

09/06/2026

Cosm Atlanta Opens with Private Preview, Public Opening Set for June 10

Cosm Atlanta, a 70,000-square-foot, three-level immersive entertainment venue located within Centennial Yards adjacent to State Farm Arena and Mercedes-Benz Sta...

09/06/2026

NESN Wins Five New England Emmy Awards

NESN captured five awards at the 2026 Boston/New England Emmy Awards, including four program honors and one individual award. These awards reflect the passion...

09/06/2026

Ateme Referenced by Apple at WWDC26 for Apple Immersive Video Workflow Support

Ateme has announced that its support for Apple Immersive Video workflows was referenced by Apple during its 2026 Worldwide Developers Conference (WWDC26). Atem...

09/06/2026

Axel Springer Deploys Bitmovin Player Web X for WELT Audio News Experience

Bitmovin has announced that Axel Springer SE has deployed Player Web X, Bitmovin's web video player, to power audio readouts of news articles and an audio-o...

09/06/2026

Grass Valley and Lawo Announce Technology Collaboration on AMPP and HOME Integration

Grass Valley and Lawo have announced a technology collaboration to validate orch...

09/06/2026

LiveU Deploys Remote VAR for Oceania Football Confederation, First in Pacific Islands

LiveU has announced a deployment with the Oceania Football Confederation (OFC) t...

09/06/2026

Globecast Launches Content Exchange Platform Powered by Oracle Cloud Infrastructure

Globecast has announced the launch of its Content Exchange platform, powered by ...

09/06/2026

OT7 Championship Weekend to Air on NBC and Peacock for First Time

NBC Sports will present live coverage of Overtime's OT7 football league Championship Weekend from Sullivan Field at Loyola Marymount University in Los Angel...

09/06/2026

UFC Freedom 250: Inside the One-in-a-Lifetime Production Behind the White House Fight Card

We're in three different locations, three different production teams. Coveri...

09/06/2026

Murray States Brandon Banks on Firing Up Racers Basketball Fans Through Creative Video

The intro video for Men's Basketball won Outstanding In-Venue Video in the C...

09/06/2026

SVG Sit-Down, Part 2: FIFA's Oscar Sanchez, HBS's Paul King Dive Deep Into IBC Operations, Commentary, and Ref Cam

Today is match day minus two for FIFA and HBS. On Thursday, there will be two ma...

09/06/2026

Designing the Modern Scorebug: How Broadcast Graphics Teams Are Rethinking the Most Important Element On Screen

Since their debut, the clock-and-score graphic has drawn the affinity and ire of...

09/06/2026

Watch LE SSERAFIM's PURE FLOWERS LIVE Performance Videos and Fan Q&A, Only on Spotify

Last month, Spotify hosted PURE FLOWERS LIVE, a special event celebrating the re...

09/06/2026

Watch RADAR UK Artist Skye Newman Perform Live in London for Her SE9' Album Launch

Last month, RADAR U.K. artist Skye Newman took the stage in East London for a sp...

09/06/2026

Announcement from Wayne Jones Audio

Company to cease operating on 30 June 2026 Australian loudspeaker and amplifier manufacturer Wayne Jones Audio have announced that after much consideration,...

09/06/2026

bigBASS from fedDSP

Increases low-end weight and character The latest plug-in release from Sheffield-based fedDSP aims to offer an all-in-one solution for users in search of mo...

09/06/2026

Fender Studio Pro 8.1 arrives

Introduces AI Studio Assistant, Moises Studio integration & more Fender Studio have just announced the launch a significant update that brings an array of n...

09/06/2026

Call for Sediba Scriptwriting Training Programme Bloemfontein, Free State

The National Film and Video Foundation (NFVF) is a statutory body mandated to spearhead the equitable growth and development of the South African film and video...

09/06/2026

BBC Casts Edward Bluemel As Poirot In Hercule,' As BritBox Boards Agatha Christie Remake In U.S.

The BBC has found its Hercule Poirot. After Deadline revealed last month that t...

09/06/2026

Nielsen launches Four-Screen Ad Deduplication measurement on YouTube campaigns in Italy

Media buyers and sellers can now compare YouTube reach from computer, mobile, an...

09/06/2026

Andrew Beaudet Joins Brompton as Head of Sales, Americas

Share Copy link Facebook X Linkedin Bluesky Email...

09/06/2026

Mediaset Italia Deploys Mediagenix Platform To Modernize Scheduling

Share Copy link Facebook X Linkedin Bluesky Email...

09/06/2026

Mediaset Italia Modernizes Content Supply Chain and Chann...

A project aimed at modernizing scheduling operations across Mediaset's channel portfolio Mediagenix, a global leader in smart content solutions to profitab...

09/06/2026

ACT Entertainment Debuts tvONE 4RU CALICO PRO at InfoComm...

tvONE, an ACT Entertainment brand, debuts its most powerful video processor ever built: the 4RU CALICO PRO (C7-PRO-4200), at InfoComm 2026 (Booth N6813). Engine...

09/06/2026

Composer and Conductor Eric Whitacre to Receive Honorary Doctorate at Berklee Valencia's 2026 Commencement

Composer and Conductor Eric Whitacre to Receive Honorary Doctorate at Berklee Va...

09/06/2026

Comcast Advertising Adds Purchasing Data from Affinity Solutions

Share Copy link Facebook X Linkedin Bluesky Email...

09/06/2026

Midwich to Distribute Miri Technologies in North America

Share Copy link Facebook X Linkedin Bluesky Email...

09/06/2026

Nielsen: North American Soccer Fans Jump to More Than 136 Million

Share Copy link Facebook X Linkedin Bluesky Email...

09/06/2026

Grass Valley and Lawo Collaborate to Advance Open Dynamic...

Technology collaboration to validate AMPP and Lawo HOME orchestration integration, showcasing Dynamic Media Facility principles in practice. Grass Valley and ...

09/06/2026

Bitmovin Player Web X Powers Axel Springer New Audio News...

Bitmovin, a leading provider of video streaming solutions, today announced that Axel Springer SE, international media and technology company, has deployed Playe...

09/06/2026

LiveU Delivers First Remote VAR Deployment Across the Pac...

LiveU, the global leader in live IP-video solutions, today announced a landmark deployment with the Oceania Football Confederation (OFC) that has brought Video ...

09/06/2026

Sam Craig Joins Grass Valley as VP, Global Pre-Sales

Share Copy link Facebook X Linkedin Bluesky Email...

09/06/2026

Arkansas TV Will Continue to Air PBS Programming

Share Copy link Facebook X Linkedin Bluesky Email...

09/06/2026

Glensound to Show DARK22M Network Audio Interface at InfoComm 2026

Share Copy link Facebook X Linkedin Bluesky Email...

09/06/2026

EVS Rebrands, Trims Its Name

Share Copy link Facebook X Linkedin Bluesky Email...

09/06/2026

FCC Announces Tentative Agenda for June 25 Open Meeting

Share Copy link Facebook X Linkedin Bluesky Email...

09/06/2026

Two Alumni Win Tony Awards

Two Alumni Win Tony Awards Mike Morris and Cedric Leiba Jr. won awards for Best Orchestrations and Best Play, respectively. June 8, 2026 By Tori Donahue ...

09/06/2026

NVIDIA Confidential Computing to Help Expand Apple's Private Cloud Compute

NVIDIA GPUs with Confidential Computing are now used for confidential inference in Apple's Private Cloud Compute (PCC), as it expands beyond Apple's dat...

09/06/2026

X-Rite Pantone Launches Offset360 to Modernize Color Control Across Existing Presses

X-Rite Pantone Launches Offset360 to Modernize Color Control Across Existing Pre...

09/06/2026

VEON Appoints Serkan Ozturk as Chief of Staff & Strategy Officer

09 Jun 2026 VEON Appoints Serkan Ozturk as Chief of Staff & Strategy Officer Dubai and New York, June 9, 2026 - VEON Ltd. (NASDAQ: VEON), a global digital oper...

09/06/2026

Katie Price: Nothing to Hide, a candid and unfiltered account of three decades in the spotlight, coming to Sky and NOW on 8 July

Tuesday 9 June 2026 Katie Price: Nothing to Hide, a candid and unfiltered accou...