Sony Pixel Power calrec Sony

NVIDIA Advances Physical AI at CVPR With Largest Indoor Synthetic Dataset

17/06/2024

NVIDIA contributed the largest ever indoor synthetic dataset to the Computer Vision and Pattern Recognition (CVPR) conference's annual AI City Challenge - helping researchers and developers advance the development of solutions for smart cities and industrial automation.

The challenge, garnering over 700 teams from nearly 50 countries, tasks participants to develop AI models to enhance operational efficiency in physical settings, such as retail and warehouse environments, and intelligent traffic systems.

Teams tested their models on the datasets that were generated using NVIDIA Omniverse, a platform of application programming interfaces (APIs), software development kits (SDKs) and services that enable developers to build Universal Scene Description (OpenUSD)-based applications and workflows.

Creating and Simulating Digital Twins for Large Spaces In large indoor spaces like factories and warehouses, daily activities involve a steady stream of people, small vehicles and future autonomous robots. Developers need solutions that can observe and measure activities, optimize operational efficiency, and prioritize human safety in complex, large-scale settings.

Researchers are addressing that need with computer vision models that can perceive and understand the physical world. It can be used in applications like multi-camera tracking, in which a model tracks multiple entities within a given environment.

To ensure their accuracy, the models must be trained on large, ground-truth datasets for a variety of real-world scenarios. But collecting that data can be a challenging, time-consuming and costly process.

AI researchers are turning to physically based simulations - such as digital twins of the physical world - to enhance AI simulation and training. These virtual environments can help generate synthetic data used to train AI models. Simulation also provides a way to run a multitude of what-if scenarios in a safe environment while addressing privacy and AI bias issues.

Creating synthetic data is important for AI training because it offers a large, scalable, and expandable amount of data. Teams can generate a diverse set of training data by changing many parameters including lighting, object locations, textures and colors.

Building Synthetic Datasets for the AI City Challenge This year's AI City Challenge consists of five computer vision challenge tracks that span traffic management to worker safety.

NVIDIA contributed datasets for the first track, Multi-Camera Person Tracking, which saw the highest participation, with over 400 teams. The challenge used a benchmark and the largest synthetic dataset of its kind - comprising 212 hours of 1080p videos at 30 frames per second spanning 90 scenes across six virtual environments, including a warehouse, retail store and hospital.

Created in Omniverse, these scenes simulated nearly 1,000 cameras and featured around 2,500 digital human characters. It also provided a way for the researchers to generate data of the right size and fidelity to achieve the desired outcomes.

The benchmarks were created using Omniverse Replicator in NVIDIA Isaac Sim, a reference application that enables developers to design, simulate and train AI for robots, smart spaces or autonomous machines in physically based virtual environments built on NVIDIA Omniverse.

Omniverse Replicator, an SDK for building synthetic data generation pipelines, automated many manual tasks involved in generating quality synthetic data, including domain randomization, camera placement and calibration, character movement, and semantic labeling of data and ground-truth for benchmarking.

Ten institutions and organizations are collaborating with NVIDIA for the AI City Challenge:

Australian National University, Australia

Emirates Center for Mobility Research, UAE

Indian Institute of Technology Kanpur, India

Iowa State University, U.S.

Johns Hopkins University, U.S.

National Yung-Ming Chiao-Tung University, Taiwan

Santa Clara University, U.S.

The United Arab Emirates University, UAE

University at Albany - SUNY, U.S.

Woven by Toyota, Japan

Driving the Future of Generative Physical AI Researchers and companies around the world are developing infrastructure automation and robots powered by physical AI - which are models that can understand instructions and autonomously perform complex tasks in the real world.

Generative physical AI uses reinforcement learning in simulated environments, where it perceives the world using accurately simulated sensors, performs actions grounded by laws of physics, and receives feedback to reason about the next set of actions.

Developers can tap into developer SDKs and APIs, such as the NVIDIA Metropolis developer stack - which includes a multi-camera tracking reference workflow - to add enhanced perception capabilities for factories, warehouses and retail operations. And with the latest release of NVIDIA Isaac Sim, developers can supercharge robotics workflows by simulating and training AI-based robots in physically based virtual spaces before real-world deployment.

Researchers and developers are also combining high-fidelity, physics-based simulation with advanced AI to bridge the gap between simulated training and real-world application. This helps ensure that synthetic training environments closely mimic real-world conditions for more seamless robot deployment.

NVIDIA is taking the accuracy and scale of simulations further with the recently announced NVIDIA Omniverse Cloud Sensor RTX, a set of microservices that enable physically accurate sensor simulation to accelerate the development of fully autonomous machines.

This technology will allow autonomous systems, whether a factory, vehicle or robot, to gather essential data to effectively perceive, navigate and interact with the real world. Using these microservices, developers can run large-scale te
LINK: https://blogs.nvidia.com/blog/ai-city-challenge-omniverse-cvpr/...
See more stories from nvidia

North America Stories

22/04/2026

Bolin Demos New PTZ Cameras and Controller at 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

22/04/2026

Anchor Audio Launches Beacon 3

Share Copy link Facebook X Linkedin Bluesky Email...

22/04/2026

FCC Grants WSWB TV License Transfer to Sinclair

Share Copy link Facebook X Linkedin Bluesky Email...

22/04/2026

Telemundo Puerto Rico Streaming Channel Launches On Prime Video

Share Copy link Facebook X Linkedin Bluesky Email...

22/04/2026

Chyron Announces PRIME Translate

Share Copy link Facebook X Linkedin Bluesky Email...

22/04/2026

TV Tech Announces Winners of Best of Show Awards at 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Live From NAB 2026: BitFires Colin Bonzey on Growing Spark Platform for Live Cloud-Based Productions

Cloud-based production isnt going anywhere, and BitFire is doubling down by prov...

21/04/2026

Live From NAB 2026: AWSs Jason Dvorkin, Regina Rossi on Driving Innovation With Al-Based Workflows

The topic of artificial intelligence has a stranglehold on the sports-video-prod...

21/04/2026

Live From NAB 2026: T-Mobile for Business' Jason Schnellbacher on Enhancing 5G for Sports Fans, Broadcasters

5G is still a hot topic in live event production, and this workflow continues to...

21/04/2026

Live From NAB 2026: Appears Ed McGivern on Fox Sports Deal, New XM Platform, and VX Software Debut

At the 2026 NAB Show, Ed McGivern, GM and President of Appear US, discusses the ...

21/04/2026

NAB 2026: Studio Network Solutions launches on-premise AI suite for media production workflows

Studio Network Solutions (SNS) has announced an on-premise AI suite designed for...

21/04/2026

NAB 2026: Suite Studios integrates file-streaming technology into Frame.io Drive

Suite Studios has integrated its file-streaming technology into the newly announced Frame.io Drive, a desktop application from Adobe company Frame.io. The colla...

21/04/2026

NAB 2026: Net Insight integrates InSync FrameFormer into Nimbra Edge for media processing

Net Insight has integrated InSync Technology's FrameFormer into the Nimbra E...

21/04/2026

NAB 2026: Fox Sports selects Appear X Platform for live production infrastructure

Fox Sports has selected Appear as a technology partner to support the next phase...

21/04/2026

NAB 2026: Diversified appoints Tyler Affolter as Chief Revenue Officer

Diversified has appointed Tyler Affolter as Chief Revenue Officer (CRO) to lead the company's commercial organisation. The appointment follows the firm'...

21/04/2026

NAB 2026: Layercake integrates Bitmovin into Streamcake platform for end-to-end media orchestration

Layercake has formalised the integration of Bitmovin's video streaming infra...

21/04/2026

NAB 2026: International Judo Federation extends global content distribution partnership with SES

The International Judo Federation (IJF) has extended its distribution partnershi...

21/04/2026

NAB 2026: Glookast integrates Cinnafilm Tachyon plugin to enable GPU-accelerated video processing

Glookast has launched the Cinnafilm Tachyon plugin for its Media Producer and Me...

21/04/2026

NAB 2026: Cadena Tres selects Eutelsat for television signal distribution in Mexico

Eutelsat has entered into an agreement with Cadena Tres, a division of Grupo Ima...

21/04/2026

NAB 2026: Dolby and TV Azteca deploy Dolby Atmos for free-to-air broadcast

Dolby Laboratories and TV Azteca have partnered to introduce Dolby Atmos immersive audio to free-to-air television broadcasts. The implementation utilises the A...

21/04/2026

Verizon and FOX Entertainment leverage 5G and AI for remote production of Extracted

FOX Entertainment partnered with Verizon to overcome significant production hurd...

21/04/2026

NAB 2026: Osprey Video to showcase expanded IP infrastructure and orchestration at NAB Show 2026

Osprey Video has announced its technology showcase for the NAB Show 2026, highli...

21/04/2026

NAB 2026: Riedel introduces IP-based production updates including multiviewer, commentary control and audio connectivity solutions

Riedel Communications (Booth C4908) introduced a range of new solutions at NAB S...

21/04/2026

SportsTechBuzz at NAB 2026, Day 3: Live Reports From the Show Floor in Vegas

The NAB Show is in full swing, and the SVG and SVG Europe editorial teams are chasing down the hottest stories from all over the Las Vegas Convention Center. He...

21/04/2026

NAB 2026: Blackmagic Design Announces URSA Cine Immersive 100G and URSA Cine Live Encoder

Blackmagic Design has announced the URSA Cine Immersive 100G, an immersive cinem...

21/04/2026

Live From NAB 2026: Clark Wire & Cables David McCarthy Showcases New Connectivity, Enclosure Solutions for Modern Broadcast Workflows at NAB Show 2026

Clark Wire & Cable is continuing its evolution from cable supplier to full-scale solutions partner for broadcast and live production. At the 2026 NAB Show, we s...

21/04/2026

Ricky Sensitively Portrays a Post-Incarceration Coming-of-Age

Rashad Frett attends the 2025 Sundance Film Festival premiere of Ricky at Eccles Theatre on January 24, 2025, in Park City, UT. (Photo by George Pimentel/Shut...

21/04/2026

MAS and Lockheed Martin Announce F-35 Sustainment Partnership in Quebec

MAS and Lockheed Martin partner to establish an F-35 depot in Canada, enabling in-country sustainment and creating high-skilled aerospace jobs....

21/04/2026

Nielsen data shows Australian outdoor and sport retailers are changing how they advertise to win over outdoor enthusiasts

Advertising strategies shift as competition grows for a large, active and qualit...

21/04/2026

ATSC Celebrates 3.0's Global Expansion

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Cinematic Feel Makes Survivor' Built to Last

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Live Event Technology Expands Fan Engagement

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

MS NOW Uses Community to Build Up Its Brand

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Why Broadcast Is Well-Positioned to Safeguard Freedom of Speech

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

AWS Demos AI Tools to Deliver Vertical Video

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Video Podcasting Leaps in Popularity

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Audio Systems Get Boost From Cloud and AI

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Layercake Deepens Bitmovin Integration to Power End-to-End Media Orchestration with Streamcake

Layercake Deepens Bitmovin Integration to Power End-to-End Media Orchestration w...

21/04/2026

Cobalt Digital Announces Launch of blueCORE at NAB Show 2026: Compact Powerhouse Signal Processors for SDI and ST 2110 Workflows

Cobalt Digital Announces Launch of blueCORE at NAB Show 2026: Compact Powerhouse...

21/04/2026

On-Premise AI Suite from Studio Network Solutions Debuts at NAB Show 2026

On-Premise AI Suite from Studio Network Solutions Debuts at NAB Show 2026 Melanie Ciotti April 21, 2026 0 Comments Unlimited processing, no cloud depe...

21/04/2026

IBC appoints Tim Banham as Chief Commercial Officer to dr...

London, 21 April 2026 IBC today announced the appointment of Tim Banham as its first Chief Commercial Officer (CCO), a newly created role that reflects the or...

21/04/2026

Motion Design Tools - April 2026

Motion Design Tools - April 2026 Roland Kahlenberg April 21, 2026 0 Comments Within 2 days, Maxon and Canva announced pro-level motion design apps - A...

21/04/2026

Chaos and Zero Density to Showcase Real-Time Ray Tracing for Virtual Studios and XR

Chaos and Zero Density to Showcase Real-Time Ray Tracing for Virtual Studios and...

21/04/2026

Diversified Appoints Tyler Affolter Chief Revenue Officer

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

TV Azteca to Bring Dolby Atmos to Free-To-Air TV in Mexico

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Maxon Announces Free Tools and Mobile Expansion of ZBrush...

Cinema 4D brings professional 3D workflows to iPad. The return of Autograph now free for individual users. ZBrush expands to Windows on Arm. See it all at NAB...

21/04/2026

Bitfocus improves availability, security and user managem...

Software version 1.6 extends enterprise functionality to place Buttons at the heart of media operations at any scale Bitfocus, the Norwegian software develope...

21/04/2026

Cobalt Digital Announces Launch of blueCORE at NAB Show 2...

Cobalt Digital Announces Launch of blueCORE at NAB Show 2026: Compact Powerhouse Signal Processors for SDI and ST 2110 Workflows Compact, multi-function stan...

21/04/2026

Tribeca Festival 2026 Announces Television And Podcast Lineup

April 21st, 2026 Press Materials Available Here TRIBECA FESTIVAL 2026 ANNOUNCES TELEVISION AND PODCAST LINEUP Tribeca Television Spotlights the 50th Season o...

20/04/2026

Live From NAB 2026: Sonys Hugo Gaggioni Highlights HDR Advances, Software-Defined Workflows

At the 2026 NAB Show, Sony is showcasing a broad slate of innovations across liv...