Sony Pixel Power calrec Sony

NVIDIA Advances Physical AI at CVPR With Largest Indoor Synthetic Dataset

17/06/2024

NVIDIA contributed the largest ever indoor synthetic dataset to the Computer Vision and Pattern Recognition (CVPR) conference's annual AI City Challenge - helping researchers and developers advance the development of solutions for smart cities and industrial automation.

The challenge, garnering over 700 teams from nearly 50 countries, tasks participants to develop AI models to enhance operational efficiency in physical settings, such as retail and warehouse environments, and intelligent traffic systems.

Teams tested their models on the datasets that were generated using NVIDIA Omniverse, a platform of application programming interfaces (APIs), software development kits (SDKs) and services that enable developers to build Universal Scene Description (OpenUSD)-based applications and workflows.

Creating and Simulating Digital Twins for Large Spaces In large indoor spaces like factories and warehouses, daily activities involve a steady stream of people, small vehicles and future autonomous robots. Developers need solutions that can observe and measure activities, optimize operational efficiency, and prioritize human safety in complex, large-scale settings.

Researchers are addressing that need with computer vision models that can perceive and understand the physical world. It can be used in applications like multi-camera tracking, in which a model tracks multiple entities within a given environment.

To ensure their accuracy, the models must be trained on large, ground-truth datasets for a variety of real-world scenarios. But collecting that data can be a challenging, time-consuming and costly process.

AI researchers are turning to physically based simulations - such as digital twins of the physical world - to enhance AI simulation and training. These virtual environments can help generate synthetic data used to train AI models. Simulation also provides a way to run a multitude of what-if scenarios in a safe environment while addressing privacy and AI bias issues.

Creating synthetic data is important for AI training because it offers a large, scalable, and expandable amount of data. Teams can generate a diverse set of training data by changing many parameters including lighting, object locations, textures and colors.

Building Synthetic Datasets for the AI City Challenge This year's AI City Challenge consists of five computer vision challenge tracks that span traffic management to worker safety.

NVIDIA contributed datasets for the first track, Multi-Camera Person Tracking, which saw the highest participation, with over 400 teams. The challenge used a benchmark and the largest synthetic dataset of its kind - comprising 212 hours of 1080p videos at 30 frames per second spanning 90 scenes across six virtual environments, including a warehouse, retail store and hospital.

Created in Omniverse, these scenes simulated nearly 1,000 cameras and featured around 2,500 digital human characters. It also provided a way for the researchers to generate data of the right size and fidelity to achieve the desired outcomes.

The benchmarks were created using Omniverse Replicator in NVIDIA Isaac Sim, a reference application that enables developers to design, simulate and train AI for robots, smart spaces or autonomous machines in physically based virtual environments built on NVIDIA Omniverse.

Omniverse Replicator, an SDK for building synthetic data generation pipelines, automated many manual tasks involved in generating quality synthetic data, including domain randomization, camera placement and calibration, character movement, and semantic labeling of data and ground-truth for benchmarking.

Ten institutions and organizations are collaborating with NVIDIA for the AI City Challenge:

Australian National University, Australia

Emirates Center for Mobility Research, UAE

Indian Institute of Technology Kanpur, India

Iowa State University, U.S.

Johns Hopkins University, U.S.

National Yung-Ming Chiao-Tung University, Taiwan

Santa Clara University, U.S.

The United Arab Emirates University, UAE

University at Albany - SUNY, U.S.

Woven by Toyota, Japan

Driving the Future of Generative Physical AI Researchers and companies around the world are developing infrastructure automation and robots powered by physical AI - which are models that can understand instructions and autonomously perform complex tasks in the real world.

Generative physical AI uses reinforcement learning in simulated environments, where it perceives the world using accurately simulated sensors, performs actions grounded by laws of physics, and receives feedback to reason about the next set of actions.

Developers can tap into developer SDKs and APIs, such as the NVIDIA Metropolis developer stack - which includes a multi-camera tracking reference workflow - to add enhanced perception capabilities for factories, warehouses and retail operations. And with the latest release of NVIDIA Isaac Sim, developers can supercharge robotics workflows by simulating and training AI-based robots in physically based virtual spaces before real-world deployment.

Researchers and developers are also combining high-fidelity, physics-based simulation with advanced AI to bridge the gap between simulated training and real-world application. This helps ensure that synthetic training environments closely mimic real-world conditions for more seamless robot deployment.

NVIDIA is taking the accuracy and scale of simulations further with the recently announced NVIDIA Omniverse Cloud Sensor RTX, a set of microservices that enable physically accurate sensor simulation to accelerate the development of fully autonomous machines.

This technology will allow autonomous systems, whether a factory, vehicle or robot, to gather essential data to effectively perceive, navigate and interact with the real world. Using these microservices, developers can run large-scale te
LINK: https://blogs.nvidia.com/blog/ai-city-challenge-omniverse-cvpr/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

06/09/2026

Dolby and MagentaTV Bring Fans Closer to the FIFA World Cup 2026 in Germany with Dolby Vision and Dolby Atmos

June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

17/06/2026

Spectrum Awards $1.1 Million in Digital Education Grants

Share Copy link Facebook X Linkedin Bluesky Email...

17/06/2026

XR Sports Alliance Adds New Members

Share Copy link Facebook X Linkedin Bluesky Email...

17/06/2026

AIMS Launches Free Online IPMX Training Series

Share Copy link Facebook X Linkedin Bluesky Email...

17/06/2026

Kiloview Partners with SFM to Expand AV-over-IP Solutions...

Montr al, Quebec, June 11, 2026 Kiloview, a leading provider of AV-over-IP and NDI -based video transmission solutions, today announced a distribution partner...

17/06/2026

Kiloview Launches U4 IP Video Dock Bringing Professional...

Changsha, China, June 15, 2026 Kiloview officially announced the launch of U4 IP Video Dock, a compact IP video decoder and output dock designed to bring prof...

17/06/2026

June 16, 2026

Calibr-Skaggs awarded $5.1M by NIH to develop long-acting hepatitis B virus therapy A new program aims to replace a daily HBV drug with once-monthly or even qua...

16/06/2026

Thomson launches new learning App

Thomson's highly regarded expert-led online learning courses are now easier to access on the go via our new App. Available now on Google Play Store, the J...

16/06/2026

Neumann MT 48 Receives Major Firmware 2.0 Update

Neumann.Berlin has released firmware version 2.0 for the MT 48 audio interface, adding plugin compatibility, expanded Dante networking options, broadcast encode...

16/06/2026

TVNewsCheck Opens Nominations for 2027 Women in Technology Awards

TVNewsCheck has announced that nominations are now open for its 2027 Women in Technology Awards, to be presented at NAB Show 2027 on Tuesday, April 6 in the Med...

16/06/2026

Clear-Com Introduces Avalon IP Intercom Platform

Clear-Com has announced Avalon, a 1RU IP intercom platform for broadcast, live events, and production environments. Designed for IP-only workflows, Avalon suppo...

16/06/2026

SNS EVO Enables Remote and Distributed Video Editing Workflows

SNS has published a guide to remote video editing workflows using its EVO shared storage platform and companion tools, covering use cases ranging from home edit...

16/06/2026

Richmond Flying Squirrels Deploy Grass Valley LDX 110 Cameras at CarMax Park

Grass Valley has announced that the Richmond Flying Squirrels, a Minor League Baseball affiliate of the San Francisco Giants, have deployed five Grass Valley LD...

16/06/2026

AIMS Launches Free Official IPMX Training Series Online

The Alliance for IP Media Solutions (AIMS) has announced the launch of the Official IPMX Training Series, a free online program covering the design, configurati...

16/06/2026

Swerve Womens Sports Announces Distribution Deals with Fubo, Plex, Amazon Fire TV, and Anoki AI

Swerve TV has announced distribution agreements with Fubo, Plex, Amazon Fire TV,...

16/06/2026

ATP and TikTok Expand Global Content Partnership

ATP and TikTok have announced an expansion of their global content partnership, extending the ATP's TikTok hub powered by TikTok GamePlan to cover all nine ...

16/06/2026

FOX Sports Turns Los Angeles Pico Lot Into Its FIFA World Cup Production Nerve Center

Network's LA facility serves as the heart of a sprawling operation built to ...

16/06/2026

300+ Records a Day, 150 TB Daily, and a Relentless Content Avalanche: Inside FOX Sports' World Cup Media Engine

At Pico, the network's media-management team is supporting a flood of HBS fe...

16/06/2026

NHL Games Leaving CBC in Canada as Sublicense With Rogers Sportsnet Ends

The NHL will no longer air on CBC after the pulic broadcasters and national rights-holder Rogers Sportsnet were unable to come to agreement. After a successfu...

16/06/2026

SVG New Sponsor Spotlight: Virtual Eye's Ben Taylor on Making Live Sports More Valuable and Entertaining Through Data-Driven Graphics

As live sports broadcasters continue to seek new ways to make complex action mor...

16/06/2026

Thats BRISK, Baby! FOX Sports' Broadcast Remote IP Studio Kits Bring World Cup Fan Energy Back to Pico

Built with the 2026 FIFA World Cup in mind, these small but mighty IP-based tran...

16/06/2026

Rumble three-band soft synth by UVI

Boasts individual synths for each band UVI's latest synth takes an interesting approach to synthesis, offering a trio of synth engines that each operate...

16/06/2026

PSP Levelizer: auto level adjustment plug-in from PSPaudioware

New intelligent auto-fader plug-in unveiled PSPaudioware's latest release offers automatic level adjustment and provides more detailed control than many...

16/06/2026

The Crow Hill Company launch Crystal Pads

New performance-focused library announced Crystal Pads is the latest addition to The Crow Hill Company's ever-growing product range, and according to th...

16/06/2026

GForce launch official Prophet-5 soft synth

Developed in partnership with Sequential In recent years, GForce Software have branched into official emulations of classic hardware synths, delivering a ha...

16/06/2026

DT 30 IE: New in-ears from beyerdynamic

Designed specifically for live performance monitoring beyerdynamic's latest announcement sees the company introduce an affordable in-ear monitoring syst...

16/06/2026

Cherry Audio recreate the Ensoniq ESQ-1

Official emulation celebrates iconic synth's 40th anniversary Cherry Audio have just introduced Ensoniq ESQ-1, an official recreation of the 1986 polyph...

16/06/2026

Australians place growing trust in SBS News

Australians place growing trust in SBS News 16 June, 2026 Media releases SBS has been recognised as one of Australia's most trusted news providers, ran...

16/06/2026

Rohde & Schwarz achieves highest number of GCF validated 3GPP NR NTN test cases for RF, RRM and PCT domains

Rohde & Schwarz achieves highest number of GCF validated 3GPP NR NTN test cases ...

16/06/2026

Hitachi and PESA Announce Strategic Partnership to Drive Growth in Poland's Rail Market

Bydgoszcz to Become a Local Centre of Excellence for Advanced Rail Technologies....

16/06/2026

Chyron Unveils Chyron Weather 2.4

Share Copy link Facebook X Linkedin Bluesky Email...

16/06/2026

Historic Zhuque-3 Reusable Rocket Test Mission Captured with URSA Cine Immersive

Historic Zhuque-3 Reusable Rocket Test Mission Captured with URSA Cine Immersive Brie Clayton June 16, 2026 0 Comments Apple Immersive Video puts view...

16/06/2026

SMPTE Plans ST 2110 Education Summer Programs

Share Copy link Facebook X Linkedin Bluesky Email...

16/06/2026

Rise Awards Returns for 2026 to Celebrate Excellence in B...

Rise WIB, the award-winning advocacy group championing gender diversity and career progression across the broadcast and media technology industry, today announc...

16/06/2026

Limecraft Expands its Media Production Platform with Team...

Limecraft today announced the availability of Limecraft 2026.4, the fourth of eight planned platform releases this year. The update introduces Team-Based Access...

16/06/2026

Perry Sook: Big Tech Poses 'Very Urgent Threat to Broadcast Stations

Share Copy link Facebook X Linkedin Bluesky Email...

16/06/2026

FIFA World Cup Delivers Record Ratings on Fox

Share Copy link Facebook X Linkedin Bluesky Email...

16/06/2026

AIMS Launches the Official IPMX Training Series Online

Free Program Supports IPMX Education from Foundational Concepts Through System and Network Design The Alliance for IP Media Solutions (AIMS) today announced t...

16/06/2026

Share your views on Screen Australia and the future of the industry

Share your views on Screen Australia and the future of the industry 15 June 2026 Your feedback matters. Following the instrumental insights provided in 2025,...

16/06/2026

HPE AI Factory With NVIDIA Expands for the Era of Agents

Enterprises are moving agentic AI from proof of concept to production - and the next generation of AI factories are built for the era of agents. At HPE Discove...

16/06/2026

Coherent Breaks Ground on Expanded Texas Facility, Scaling AI's Optical Backbone

AI runs at the speed of light. More and more, that light is made in Texas. Cohe...

16/06/2026

Techtel Supports T-Motion RCCP-2A Controller Upgrade for Major Australian Broadcaster

Techtel Supports T-Motion RCCP-2A Controller Upgrade for Major Australian Broadc...

16/06/2026

Record audiences tune in for opening weekend of ICC Womens T20 World Cup 2026 on Sky Sports

Tuesday 16 June 2026 Record audiences tune in for opening weekend of ICC Women&...

16/06/2026

Fastest, Largest, Strongest: NVIDIA Blackwell Sweeps MLPerf Training 6.0

Every breakthrough AI model starts the same way: with a training run. The infrastructure running those training jobs shapes everything: how fast teams can itera...

15/06/2026

University of South Carolina's Valerie Gerfin on Gamecock Productions' Growth, Upgrades at Williams-Brice Stadium

One of the more exciting internal video production divisions within a college at...

15/06/2026

Fox Corp. To Acquire Roku, Pairs Live Sports Powerhouse With Major CTV Platform

The deal valued at $22 Billion is expected to close in the first half of 2027...