
AI models are advancing at a rapid rate and scale.
But what might they lack that (most) humans don't? Common sense: an understanding, developed through real-world experiences, that birds can't fly backwards, mirrors are reflective and ice melts into water.
While such principles seem obvious to humans, they must be taught to AI models tasked with accurately answering complex questions and navigating unpredictable physical environments, such as industrial warehouses or roads.
NVIDIA is tackling this challenge by developing a set of tests to coach AI models on the limitations of the physical world. In other words, to teach AI common sense.
These tests are used to develop reasoning models such as NVIDIA Cosmos Reason, an open reasoning vision language model (VLM) used for physical AI applications that are proficient in generating temporally grounded responses. Cosmos Reason just topped the physical reasoning leaderboard on Hugging Face.
Cosmos Reason is unique compared with previous VLMs as it's designed to accelerate physical AI development for fields such as robotics, autonomous vehicles and smart spaces. The model can infer and reason through unprecedented scenarios using physical common-sense knowledge.
For models to understand complex environments - including industrial spaces and laboratories - they must start small. For example, in the test depicted below, the Cosmos Reason model is tasked with answering a multiple-choice question about the relative motion in the video:
https://blogs.nvidia.com/wp-content/uploads/2025/08/ModelReasoning_DrivingExample.mp4
Example from Cosmos Reason evaluation dataset
What Does Reasoning Look Like for an AI Model? To develop their reasoning capabilities, NVIDIA models are being taught physical common sense about the real world via reinforcement learning.
For example, robots don't intuitively know which way is left, right, up or down. They're taught these spatial-temporal limitations through training. AI-powered robots used in safety testing, such as vehicle crash testing, must be taught to be aware of how their physical forms interact with their surroundings.
Without embedding common sense into the training of these robots, issues can arise in deployment.
Without basic knowledge about the physical world, a robot may fall down or accidentally break something, causing danger to the surrounding people and environment, said Yin Cui, a Cosmos Reason research scientist at NVIDIA.
Distilling human common sense about the physical world into models is how NVIDIA is bringing about the next generation of AI.
Enter the NVIDIA data factory team: a group of global analysts who come from various backgrounds - including bioengineering, business and linguistics. They're working to develop, analyze and compile hundreds of thousands of data units that will be used to train generative AI models on how to reason.
The Data Curation Process One of the NVIDIA data factory team's projects focuses on the development of world foundation models for physical AI applications. These virtual environments create deep learning neural networks that are safer and more effective for training reasoning models, based on simulated domains.
It all starts with an NVIDIA annotation group that creates question-and-answer pairs based on video data. These videos are all from the real world and can include any type of footage, whether depicting chickens walking around in their coop or cars driving on a rural road.
For example, an annotator might ask about the video below: The person uses which hand to cut the spaghetti?
https://blogs.nvidia.com/wp-content/uploads/2025/08/ModelReasoning_SpaghettiExample.mp4
Example from Cosmos Reason evaluation dataset
The annotators then come up with four multiple choice answers labeled A, B, C and D. The model is fed the data and has to reason and choose the correct answer.
We're basically coming up with a test for the model, said Cui. All of our questions are multiple choice, like what students would see on a school exam.
These question-and-answer pairs are then quality checked by NVIDIA analysts, such as Michelle Li.
Li has a background in public health and data analytics, which allows her to look at the broader purpose of the data she analyzes.
For physical AI, we have a specific goal of wanting to train models on understanding the physical world, which helps me think about the bigger picture when I'm looking at the Q&A pairs and the types of questions that are being presented, Li said. I ask myself, do the Q&A pairs that I'm looking at align with our objectives for the guidelines that we have for the project?
After this, the data is reviewed by the data factory leads of the project, who make sure it's up to quality standards and ready to be sent to the Cosmos Reason research team. The scientists then feed the hundred thousands of data units - in this case the Q&A pairs - to the model, training it with reinforcement learning on the bounds and limitations of the physical world.
What Are the Applications of Reasoning AI? Reasoning models are exceptional because they can make sense of their temporal space as well as predict outcomes. They can analyze a situation, come up with a thought web of probable outcomes and infer the most likely scenario.
Simply put, reasoning AI demonstrates humanlike thinking. It shows its work, giving the user insight into the logic behind its responses.
Users can ask these models to analyze a video such as of two cars driving on a road. When asked a question like, What would happen if the cars were driving toward each other on the same lane? the model can reason and determine the most probable outcome of the proposed scenario - for example, a car crash.
We're building a pioneering reasoning model focused on physical AI, said Tsung-Yi Lin, a princ
Most recent headlines
11/12/2025
Dalet, a leading provider of cloud-native, end-to-end media workflow solutions, ...
06/12/2025
In a live broadcast from the Reagan National Defense Forum, L3Harris Chair and CEO Christopher Kubasik joined Morgan Brennan on CNBCs Closing Bell: Overtime. Ku...
06/12/2025
FORT LAUDERDALE, Fla. A new survey from Pixitmedia by Datacore revealed a major shift in the Media & Entertainment industry in media archiving, with 85% of resp...
06/12/2025
HACKENSACK, N.J. LiveU has announced that the national public broadcaster Czech Television has completed one of the largest LiveU live production deployments fo...
06/12/2025
NEW YORK The National Academy of Television Arts and Sciences (NATAS) presented the Excellence in Production Technology Emmy Award to NASA+ and Dr. Tom Leight...
05/12/2025
2025 Sports Broadcasting Hall of Fame: Curt Gowdy Jr. - Master Storyteller, Nati...
05/12/2025
SVG Sit-Down: Veritone's Sean King on the Power of Mining Video, Audio DataThe company's Data Refinery offers users total control and governance over da...
05/12/2025
Platinum White Paper: Inside the Nashville Predators' Unified, Flexible, Sca...
05/12/2025
Netflix Reaches Agreement To Acquire Warner Bros. Following Planned WBD SplitThe deal does not include WBDs sports assets like TNT Sports (US, UK, LatAm), Euros...
05/12/2025
FOX Sports Returns to Indianapolis for Primetime Broadcast of Big Ten Championsh...
05/12/2025
SVG Summit 2025 Preview: Digital Engagement & Monetization Workshop Tackles the ...
05/12/2025
Atlanta United Lights Up New Emory Healthcare Studio With First Live Broadcast f...
05/12/2025
As Messi Takes the Pitch, MLS, Apple, NEP Roll Out Largest MLS Cup Production Ev...
05/12/2025
ESPN Enters College Football's Most Intense Month With Elevated Workflows fo...
05/12/2025
It's about that time! Awards season is in full swing, and the Film Independe...
05/12/2025
Every year, Spotify Wrapped offers a personalized look back at the audio that defined your year. It's a snapshot of your listening habits, designed to tell ...
05/12/2025
In 2025, Spotify's EQUAL, GLOW, and RADAR programs celebrated women, LGBTQIA , and emerging artists who turned moments into milestones. From breaking record...
05/12/2025
In our latest blog, we explain how Wi-Fi 7 rollouts can drive consumer loyalty with value-add services such as consumer cybersecurity. We also explore how this ...
05/12/2025
LOS ANGELES Netflix announced it has entered into an agreement to acquire the assets of Warner Bros. for $82.7 billion....
05/12/2025
NEW YORK Nielsens Gracenote has launched Gracenote Content Connect, a new ad platform that provides agencies, brands, supply-side platforms (SSPs) and demand-si...
05/12/2025
NEW YORK In an most important update to the workings of deal-based programmatic advertising, IAB Tech Lab has released version 1.0 of its Deals API for public c...
05/12/2025
NEW YORK Pass the turkey. Pass the stuffing. Pass the cranberry sauce. All are common requests of Americans celebrating Thanksgiving Day with family and f...
05/12/2025
NEW YORK Iris, the new cloud-connected camera control platform, has officially launched with features that turn virtually any PTZ camera into a software-connect...
05/12/2025
HOLLYWOOD, Calif. Netflix announced today that it has entered into an agreement to acquire the assets of Warner Bros. for $82.7 billion....
05/12/2025
NEW YORK Iris, the new cloud-connected camera control platform, has officially launched with features that turn virtually any PTZ camera into a software-connect...
05/12/2025
WASHINGTON The Federal Communications Commission has approved AT&T's $1.02 billion acquisition of spectrum from UScellular in a decision that was issued sho...
05/12/2025
The Best Coldplay Songs: 21 Tracks That Shoot for the Stars From Yellow to Viva La Vida, Fix You to Paradise, this playlist goes back to the start.
December ...
05/12/2025
Zafris Lecture Series Brings Nabil Ayers to Berklee The 32nd annual James G. Zafris Distinguished Lecture series was held on Thursday, November 13, with guest...
05/12/2025
Introducing New Perks to Help You Get Even More from LinkedIn Premium Published on Dec 5, 2025 Categories: Company News, Product News
LinkedIn Corporate Co...
05/12/2025
Friday 5 December 2025
A new Game of Thrones Tale: Official trailer for Sky Exc...
05/12/2025
Back to All News
Don Lee, Lee Jin-uk, and Lalisa Manobal to Star in Netflix Act...
05/12/2025
Tis the season of giving once again and this year we've taken our Give Back Fridays' concept and turned it on its head.
In the autumn we were approach...
05/12/2025
Brayden Gogis doesn't remember a time when he wasn't completely fixated on games in all forms. In preschool, when they asked us to dress up as what we ...
05/12/2025
The Grinch steals the spotlight as the theme for The Late Late Toy Show 2025
Tune in tonight at 9:35pm on RT One and worldwide on RT Player
#LateLateToyShow...
05/12/2025
RT Announces New Presenters of Flagship News Programmes
New RT Six One News co-presenter Tommy Meskill
Sarah McInerney & Justin McCarthy join Morning Ir...
04/12/2025
ToolsOnAir Blackmagic Design HyperDeck Event Presets for just:in mac pro 2025 & ...
04/12/2025
ToolsOnAir AJA Ki Pro Event Presets for just:in mac pro 2025 & just:in linux
More Details:Starting with version 5.5, both just:in mac pro and just:in linux sol...
04/12/2025
Wangu Kanuri from Kenya and Godwin Asediba from Ghana are two of this years finalists for Thomsons Young Journalist of the Year Award. The pair are runners-up i...
04/12/2025
SVG Sit-Down: ProximaVision's Claudio Lisman on Why Tethered Drones Could Be...
04/12/2025
SVG Campus Shot Callers: Imry Halevi, Senior Associate Director of Athletics, Co...
04/12/2025
Platinum White Paper: LiveU Lightweight Sports Production: A Step Change in Spor...
04/12/2025
London to Riyadh: DAZN brings the boxing glamour to new production levels for Be...
04/12/2025
Analysis: Paramount bets on the battering ram' with Champions League play By Callum McCarthy, Editor-at-Large
Tuesday, December 2, 2025 - 10:12
Print ...
04/12/2025
Space City Home Network Launches SCHN DTC App for Astros and RocketsThe Rockets and Astros were previously the lone NBA and MLB teams without a DTC appBy Jason...
04/12/2025
SVG Summit 2025 Preview: Content Workflows Workshop Spotlights Evolution of Spor...
04/12/2025
New Sponsor Spotlight: Geotech's Patrick Wambold On the Unreal Engine Revolu...
04/12/2025
Curt Gowdy Jr. - Master Storyteller, Nationally and RegionallyBy Jason Dachman, Editorial Director, U.S.
Thursday, December 4, 2025 - 1:52 pm
Print This Sto...
04/12/2025
(L-R) Rebecca Lichtenfeld, Mohammadreza Eyni, Sara Khaki, and Judith Helfand att...
04/12/2025
SBS launches Future Frames initiative to support emerging First Nations video ed...