Sony Pixel Power calrec Sony

NVIDIA Advances Physical AI at CVPR With Largest Indoor Synthetic Dataset

17/06/2024

NVIDIA contributed the largest ever indoor synthetic dataset to the Computer Vision and Pattern Recognition (CVPR) conference's annual AI City Challenge - helping researchers and developers advance the development of solutions for smart cities and industrial automation.

The challenge, garnering over 700 teams from nearly 50 countries, tasks participants to develop AI models to enhance operational efficiency in physical settings, such as retail and warehouse environments, and intelligent traffic systems.

Teams tested their models on the datasets that were generated using NVIDIA Omniverse, a platform of application programming interfaces (APIs), software development kits (SDKs) and services that enable developers to build Universal Scene Description (OpenUSD)-based applications and workflows.

Creating and Simulating Digital Twins for Large Spaces In large indoor spaces like factories and warehouses, daily activities involve a steady stream of people, small vehicles and future autonomous robots. Developers need solutions that can observe and measure activities, optimize operational efficiency, and prioritize human safety in complex, large-scale settings.

Researchers are addressing that need with computer vision models that can perceive and understand the physical world. It can be used in applications like multi-camera tracking, in which a model tracks multiple entities within a given environment.

To ensure their accuracy, the models must be trained on large, ground-truth datasets for a variety of real-world scenarios. But collecting that data can be a challenging, time-consuming and costly process.

AI researchers are turning to physically based simulations - such as digital twins of the physical world - to enhance AI simulation and training. These virtual environments can help generate synthetic data used to train AI models. Simulation also provides a way to run a multitude of what-if scenarios in a safe environment while addressing privacy and AI bias issues.

Creating synthetic data is important for AI training because it offers a large, scalable, and expandable amount of data. Teams can generate a diverse set of training data by changing many parameters including lighting, object locations, textures and colors.

Building Synthetic Datasets for the AI City Challenge This year's AI City Challenge consists of five computer vision challenge tracks that span traffic management to worker safety.

NVIDIA contributed datasets for the first track, Multi-Camera Person Tracking, which saw the highest participation, with over 400 teams. The challenge used a benchmark and the largest synthetic dataset of its kind - comprising 212 hours of 1080p videos at 30 frames per second spanning 90 scenes across six virtual environments, including a warehouse, retail store and hospital.

Created in Omniverse, these scenes simulated nearly 1,000 cameras and featured around 2,500 digital human characters. It also provided a way for the researchers to generate data of the right size and fidelity to achieve the desired outcomes.

The benchmarks were created using Omniverse Replicator in NVIDIA Isaac Sim, a reference application that enables developers to design, simulate and train AI for robots, smart spaces or autonomous machines in physically based virtual environments built on NVIDIA Omniverse.

Omniverse Replicator, an SDK for building synthetic data generation pipelines, automated many manual tasks involved in generating quality synthetic data, including domain randomization, camera placement and calibration, character movement, and semantic labeling of data and ground-truth for benchmarking.

Ten institutions and organizations are collaborating with NVIDIA for the AI City Challenge:

Australian National University, Australia

Emirates Center for Mobility Research, UAE

Indian Institute of Technology Kanpur, India

Iowa State University, U.S.

Johns Hopkins University, U.S.

National Yung-Ming Chiao-Tung University, Taiwan

Santa Clara University, U.S.

The United Arab Emirates University, UAE

University at Albany - SUNY, U.S.

Woven by Toyota, Japan

Driving the Future of Generative Physical AI Researchers and companies around the world are developing infrastructure automation and robots powered by physical AI - which are models that can understand instructions and autonomously perform complex tasks in the real world.

Generative physical AI uses reinforcement learning in simulated environments, where it perceives the world using accurately simulated sensors, performs actions grounded by laws of physics, and receives feedback to reason about the next set of actions.

Developers can tap into developer SDKs and APIs, such as the NVIDIA Metropolis developer stack - which includes a multi-camera tracking reference workflow - to add enhanced perception capabilities for factories, warehouses and retail operations. And with the latest release of NVIDIA Isaac Sim, developers can supercharge robotics workflows by simulating and training AI-based robots in physically based virtual spaces before real-world deployment.

Researchers and developers are also combining high-fidelity, physics-based simulation with advanced AI to bridge the gap between simulated training and real-world application. This helps ensure that synthetic training environments closely mimic real-world conditions for more seamless robot deployment.

NVIDIA is taking the accuracy and scale of simulations further with the recently announced NVIDIA Omniverse Cloud Sensor RTX, a set of microservices that enable physically accurate sensor simulation to accelerate the development of fully autonomous machines.

This technology will allow autonomous systems, whether a factory, vehicle or robot, to gather essential data to effectively perceive, navigate and interact with the real world. Using these microservices, developers can run large-scale te
LINK: https://blogs.nvidia.com/blog/ai-city-challenge-omniverse-cvpr/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

07/04/2026

NAB 2026: Ikegami USA To Launch Two New Viewfinders, Refine UHD Cameras

Ikegami USA will launch a refinement to the UHK-X700 and UHK-X750 3-CMOS -in. UHD cameras in the UNICAM XE series plus two new 7-in. viewfinders at NAB 2026 in...

07/04/2026

NAB 2026: NAB Leadership Foundation to Host Technology Students Career Mixer

The NAB Leadership Foundation and NAB PILOT will host a career mixer for technology students at NAB Show 2026 on April 18, 5-6:30 p.m., North Hall, Room N225/22...

07/04/2026

Survey: AVoIP Adoption Accelerating, With Interoperability and Security as Key Drivers

Audinate Group Limited and Futuresource Consulting have published results from a...

07/04/2026

NAB 2026: Eluvio Announces Commercial Availability of Content Fabric Bucharest Release

Eluvio has announced the commercial availability of its Content Fabric Bucharest...

07/04/2026

NAB 2026: Eluvio Introduces Inline AI Video Intelligence and Updated EVIE

Eluvio has unveiled a new architecture for video AI and an updated Eluvio Video Intelligence Editor (EVIE) ahead of NAB Show 2026. Eluvio AI runs analysis and i...

07/04/2026

Professional Fighters League Partners With Sky New Zealand for Exclusive Broadcast Rights

The Professional Fighters League (PFL) has announced a deal with Sky New Zealand...

07/04/2026

NBA, Enjoy Basketball' To Produce Live Game Altcast, Enjoy the NBA Trivia Show as Part of New Multi-Platform Collab

The NBA and Enjoy Basketball, the digital media company co-founded by YouTube cr...

07/04/2026

Beyond Golf: PGA of America, Ko-Mar Productions Offer Clients a Customizable Production Studio

The 4,000-sq.-ft. space in Frisco, TX, has produced live and packaged programmin...

07/04/2026

A New POV: RefCam's Rise in the Bundesliga Signals a Potential New Era for Soccer Broadcasts

What began as a referee training tool is evolving into a powerful production ass...

07/04/2026

NAB 2026: Manifold Technologies to Join NEP Platform as Deployable Application

Manifold Technologies will announce at NAB Show 2026 (Booth C.1808) that its manifold CLOUD platform will be available as a deployable application within NEP Pl...

07/04/2026

Shaquille O'Neal and TNT Sports to Launch DUNKMAN Professional Dunk League in Summer 2026

Shaquille O'Neal, Authentic Brands Group, and TNT Sports, in partnership wit...

07/04/2026

NAB 2026: Telos Alliance Unveils Omnia XII Audio Processor

Telos Alliance will debut the Omnia XII, a new FM/HD/DAB audio processor, at NAB Show 2026 in Las Vegas. Built on a 2RU hardware platform, Omnia XII features a...

07/04/2026

NAB 2026: AWS to Showcase AI and Cloud Media Technologies

Amazon Web Services (AWS) will exhibit at NAB Show 2026 (April 18-22, Las Vegas Convention Center, Booth W1701), with demonstrations, speaking sessions, and int...

07/04/2026

NAB 2026: Synamedia to Demonstrate AI by Quortex

Synamedia will demonstrate AI by Quortex at NAB Show 2026, a framework that applies AI capabilities to video workflows on demand rather than continuously. The s...

07/04/2026

NAB 2026: TVNewsCheck Announces 2026 Women in Technology Award Honorees

TVNewsCheck will present its 15th annual Women in Technology Awards on Tuesday, April 21, at 5 p.m. PT at NAB Show, in the Media and Entertainment Theater (W146...

07/04/2026

NAB 2026: Akta to Showcase AI Video Platform

Akta will demonstrate its AI video platform at NAB Show 2026, highlighting new capabilities in media processing and vertical video formatting alongside its exis...

07/04/2026

ESPN Expands to Disney+ in Europe and Select Asia-Pacific Markets

ESPN and Disney have announced the launch of ESPN on Disney in Europe and select Asia-Pacific markets, bringing the offering to 53 countries and territories a...

07/04/2026

Announcing the 2026 Sundance Institute Native Lab Fellows

LOS ANGELES, CA, April 7, 2026 - The nonprofit Sundance Institute announced today the fellows selected for the 2026 Native Lab, the signature initiative of the ...

07/04/2026

Prompted Playlist Levels Up to Include Podcasts, Helping You Explore More Interests and Curiosities

Starting today, Prompted Playlist is expanding beyond music to now include podca...

07/04/2026

VSL release Synchron Harpsichord (Blanchet)

Launched alongside piano promotion The latest instalment in VSL's Synchron Series line-up captures the sound of a faithful copy of a Fran ois tienne Bl...

07/04/2026

Roland announce SPD-SX Pro Version 2.0

Flagship sampling pad gets an update Roland's flagship sampling pad has just received a major update that kits it out with an array of new features and ...

07/04/2026

Sound Devices unveil the Astral Mini Plus

Popular compact wireless system upgraded Sound Devices have recently introduced a new and improved version of their compact wireless transmitter bodypacks, ...

07/04/2026

YEP November 2025 Newsletter

The November 2025 YEP Newsletter highlights a recent YEP Coffee Chat, offering members the chance to connect with industry professionals in an informal setting ...

07/04/2026

YEP December 2025 Newsletter

The December 2025 YEP Newsletter includes a Spotlight on Emily Vail, showcasing her career journey and work in the industry, alongside Mentorship Reflections by...

07/04/2026

US Space Force Selects L3Harris to Strengthen America's Defense with Advanced Space Surveillance

Ground-Based Electro-Optical Deep Space Surveillance (GEODSS) telescope operated...

07/04/2026

Haivision Unveils Makito ONE Live Video Contribution Platform

Share Copy link Facebook X Linkedin Bluesky Email...

07/04/2026

Neutrik To Unveil TRUE1 Data Connector Series At 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

07/04/2026

Chris Welcker Deployed Full DPA Arsenal to Record Live Mu...

Catgut Sound Owner and Production Sound Mixer Chris Welcker, CAS, has built a career at the intersection of music and film. A former musician and composer, Welc...

07/04/2026

SDVI Launches Next Generation Rally Platform to Give Medi...

SDVI Corporation today announced the next generation of its Rally media supply chain management platform, introducing a redesigned orchestration engine that rep...

07/04/2026

Avid to Debut Avid Content Core on AWS at 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

07/04/2026

Frequency Launches Smarter Ways to Operate Streaming Chan...

Frequency, the engine behind many of the world's leading streaming television channels, at NAB 2026 will be launching new Studio services to help content ow...

07/04/2026

Ikegami to Introduce Expanded Range of Broadcast Producti...

Ikegami USA has chosen NAB 2026 in Las Vegas as the launch platform for new additions to its range of broadcast-quality television production equipment. These w...

07/04/2026

Kiloview Advancing Broadcast IP Workflows with a Smarter...

April 19, 2026, Las Vegas Kiloview, an innovative provider of AV-over-IP technologies, will showcase its latest broadcast IP solutions at NAB 2026, presenting...

07/04/2026

Bitmovin Adds Support for SGAI in its Playback Products t...

Bitmovin has announced support for Server-Guided Ad Insertion (SGAI) across its playback products using HLS interstitials, enabling more advanced ad-supported s...

07/04/2026

Synamedia unveils AI by Quortex - a just-in-time AI-plugi...

Synamedia is unveiling AI by Quortex at The NAB show, a just-in-time AI plugin framework that applies intelligence only when needed across video processing, dis...

07/04/2026

Cuez Brings Four New Innovations to NAB 2026 From Story-C...

Cuez will showcase four additions to its cloud-based newsroom, rundown and automation platform at NAB Show 2026 (April 18 22, Las Vegas, Booth N1867): Cuez ...

07/04/2026

Barix Extends Transport Options for Multi-Engine IP Encoder

Barix Extends Transport Options for Multi-Engine IP Encoder Brie Clayton April 7, 2026 0 Comments New for NAB, Barix adds SRT and RIST support to Mult...

07/04/2026

Elite Media Technologies Selects Interra Systems' BATON File-Based QC Solution

Elite Media Technologies Selects Interra Systems' BATON File-Based QC Soluti...

07/04/2026

Tightrope Media Systems to Debut Cablecast LiveBridge for Simultaneous Streaming at NAB 2026

Tightrope Media Systems to Debut Cablecast LiveBridge for Simultaneous Streaming...

07/04/2026

Cuez Brings Four New Innovations to NAB 2026: From Story-Centric Newsroom to Open AI Agent Framework

Cuez Brings Four New Innovations to NAB 2026: From Story-Centric Newsroom to Ope...

07/04/2026

ASG Names Andrea Cummis VP of Systems Engineering

Share Copy link Facebook X Linkedin Bluesky Email...

07/04/2026

KTVJ Completes Major Signal Upgrade

Share Copy link Facebook X Linkedin Bluesky Email...

07/04/2026

Hearst's WDSU to Air Million Dollar Rodeo Competition

Share Copy link Facebook X Linkedin Bluesky Email...

07/04/2026

Grass Valley Launches Future Playmakers Program

Share Copy link Facebook X Linkedin Bluesky Email...

07/04/2026

Saranyu Technologies Launches MATCH - Multi-View Sports S...

Designed for synchronized multi-stream playback, low-latency delivery, and real-time analytics, MATCH introduces a unified viewing experience for sports broadca...