
AI models are advancing at a rapid rate and scale.
But what might they lack that (most) humans don't? Common sense: an understanding, developed through real-world experiences, that birds can't fly backwards, mirrors are reflective and ice melts into water.
While such principles seem obvious to humans, they must be taught to AI models tasked with accurately answering complex questions and navigating unpredictable physical environments, such as industrial warehouses or roads.
NVIDIA is tackling this challenge by developing a set of tests to coach AI models on the limitations of the physical world. In other words, to teach AI common sense.
These tests are used to develop reasoning models such as NVIDIA Cosmos Reason, an open reasoning vision language model (VLM) used for physical AI applications that are proficient in generating temporally grounded responses. Cosmos Reason just topped the physical reasoning leaderboard on Hugging Face.
Cosmos Reason is unique compared with previous VLMs as it's designed to accelerate physical AI development for fields such as robotics, autonomous vehicles and smart spaces. The model can infer and reason through unprecedented scenarios using physical common-sense knowledge.
For models to understand complex environments - including industrial spaces and laboratories - they must start small. For example, in the test depicted below, the Cosmos Reason model is tasked with answering a multiple-choice question about the relative motion in the video:
https://blogs.nvidia.com/wp-content/uploads/2025/08/ModelReasoning_DrivingExample.mp4
Example from Cosmos Reason evaluation dataset
What Does Reasoning Look Like for an AI Model? To develop their reasoning capabilities, NVIDIA models are being taught physical common sense about the real world via reinforcement learning.
For example, robots don't intuitively know which way is left, right, up or down. They're taught these spatial-temporal limitations through training. AI-powered robots used in safety testing, such as vehicle crash testing, must be taught to be aware of how their physical forms interact with their surroundings.
Without embedding common sense into the training of these robots, issues can arise in deployment.
Without basic knowledge about the physical world, a robot may fall down or accidentally break something, causing danger to the surrounding people and environment, said Yin Cui, a Cosmos Reason research scientist at NVIDIA.
Distilling human common sense about the physical world into models is how NVIDIA is bringing about the next generation of AI.
Enter the NVIDIA data factory team: a group of global analysts who come from various backgrounds - including bioengineering, business and linguistics. They're working to develop, analyze and compile hundreds of thousands of data units that will be used to train generative AI models on how to reason.
The Data Curation Process One of the NVIDIA data factory team's projects focuses on the development of world foundation models for physical AI applications. These virtual environments create deep learning neural networks that are safer and more effective for training reasoning models, based on simulated domains.
It all starts with an NVIDIA annotation group that creates question-and-answer pairs based on video data. These videos are all from the real world and can include any type of footage, whether depicting chickens walking around in their coop or cars driving on a rural road.
For example, an annotator might ask about the video below: The person uses which hand to cut the spaghetti?
https://blogs.nvidia.com/wp-content/uploads/2025/08/ModelReasoning_SpaghettiExample.mp4
Example from Cosmos Reason evaluation dataset
The annotators then come up with four multiple choice answers labeled A, B, C and D. The model is fed the data and has to reason and choose the correct answer.
We're basically coming up with a test for the model, said Cui. All of our questions are multiple choice, like what students would see on a school exam.
These question-and-answer pairs are then quality checked by NVIDIA analysts, such as Michelle Li.
Li has a background in public health and data analytics, which allows her to look at the broader purpose of the data she analyzes.
For physical AI, we have a specific goal of wanting to train models on understanding the physical world, which helps me think about the bigger picture when I'm looking at the Q&A pairs and the types of questions that are being presented, Li said. I ask myself, do the Q&A pairs that I'm looking at align with our objectives for the guidelines that we have for the project?
After this, the data is reviewed by the data factory leads of the project, who make sure it's up to quality standards and ready to be sent to the Cosmos Reason research team. The scientists then feed the hundred thousands of data units - in this case the Q&A pairs - to the model, training it with reinforcement learning on the bounds and limitations of the physical world.
What Are the Applications of Reasoning AI? Reasoning models are exceptional because they can make sense of their temporal space as well as predict outcomes. They can analyze a situation, come up with a thought web of probable outcomes and infer the most likely scenario.
Simply put, reasoning AI demonstrates humanlike thinking. It shows its work, giving the user insight into the logic behind its responses.
Users can ask these models to analyze a video such as of two cars driving on a road. When asked a question like, What would happen if the cars were driving toward each other on the same lane? the model can reason and determine the most probable outcome of the proposed scenario - for example, a car crash.
We're building a pioneering reasoning model focused on physical AI, said Tsung-Yi Lin, a princ
Most recent headlines
06/10/2025
France T l visions, France's leading broadcaster, has received the 2025 EBU ...
04/09/2025
Monumental Sports & Entertainment (MSE), in collaboration with Dalet, has been a...
28/08/2025
By Kristin Feeley, Director, Documentary Film & Artist Programs
If you want to tell untold stories, if you want to give voice to the voiceless, you've got ...
28/08/2025
Directed by Steven Bognar and Julia Reichert, Sundance Institute-supported Amer...
28/08/2025
Corridos have been a cornerstone of M sica Mexicana for generations, telling stories rooted in everyday life. Now, a new chapter is taking shape: motivational c...
28/08/2025
Los corridos han sido un pilar de la M sica Mexicana durante generaciones, contando historias enraizadas en la vida cotidiana. Ahora, un nuevo cap tulo est tom...
28/08/2025
Earlier this month, we promised our Verano Forever party would bring the heat, a...
28/08/2025
L3Harris will provide the Polish F-16V fleet with the Viper Shield electronic warfare system as part of an upgrade program....
28/08/2025
Bilbao, August 26, 2025 - AgileTV, an international television and video technol...
28/08/2025
Ken Wilkinson is an Emmy Awards nominated New York audio engineer who specialises in production sound mixing for film, commercial, episodic and documentary work...
28/08/2025
NEW YORK FuboTV today announced that it will launch Fubo Sports, a skinny bundle that focuses on sports with a subscription price of $56 monthly....
28/08/2025
NEVADA City, Calif. At IBC2025, Sept. 12-15 at the RAI Amsterdam, Telestream will debut its new Global Ingest strategy, introducing a next-generation ingest arc...
28/08/2025
Dr. Rhoda Bernard Releases Groundbreaking Debut Book on Accessible Arts Educatio...
28/08/2025
TAG Video Systems, the leader in software-based IP end-to-end workflow monitoring, deep probing, and real-time visualization, has named Oliver Gappa as Sales Di...
28/08/2025
AI-based voice enhancement will be among a series of innovations making their IBC 2025 debut on the DHD stand B46 in Hall 8 at the RAI Amsterdam Convention Cent...
28/08/2025
Telef nica Servicios Audiovisuales (TSA), the leading system integrator and service provider in the media sector in Spain, with the support of Appear, the globa...
28/08/2025
To fully immerse sailing fans in the world's biggest offshore yacht race, production company, Optical Media turned to LiveU's On-site Production solutio...
28/08/2025
Working with Calrec on its most recent overhaul, radio and television broadcaster, WNED has migrated to a fully IP infrastructure with multiple Type R consoles,...
28/08/2025
Cleeng, the Subscriber Retention Management (SRM ) inventor, has unveiled Cleeng Pro, the first-ever direct-to-consumer (D2C) subscription management platform t...
28/08/2025
Zixi, the industry leader in live broadcast-quality video over IP, today announced that French media distribution platform OKAST has selected Zixi to enable rel...
28/08/2025
Solution offers a streamlined, speaker-free architecture to optimize integration with premium external loudspeakers and advanced loudness metering
Nixer Pro Au...
28/08/2025
Cinegy, the premier provider of software-defined television technology, has announced a strategic partnership with Vision One Touch Film Production Services L.L...
28/08/2025
Telestream, a global leader in media workflow technologies, will debut its new Global Ingest strategy at IBC2025, introducing a next-generation ingest architect...
28/08/2025
Tier 1 operator selects Broadpeak to power high-performance, unified CDN solution across Norway, Sweden and Finland
Broadpeak, a leader in streaming and moneti...
28/08/2025
Leading video software provider, Synamedia, today announced that 24 Frames Digital, one of India's leading live event streaming service providers, has chose...
28/08/2025
Meet VisualOn at IBC2025: See What's Next in AI-Powered Video Streaming Join VisualOn at IBC2025 and discover how our AI-driven Optimizer and advanced media...
28/08/2025
IBC stand 5.F81
Wowza to Reveal Next-Gen Video Streaming Innovations at IBC 2025
Amsterdam, August 28, 2025 Wowza, a leader in video streaming infrastructur...
28/08/2025
VIDA, the secure cloud-native media asset management platform, is launching at IBC Show 2025 Media Factory, a drag-and-drop workflow automation engine designed ...
28/08/2025
WASHINGTON The Federal Communications Commission has admitted that it inadvertently removed some rules relating to NextGen TV/ATSC 3.0 and has moved to correct ...
28/08/2025
WASHINGTON The National Association of Broadcasters (NAB) has announced the launch of the NextGen TV News Technology Lab, a three-year initiative designed to he...
28/08/2025
BRASILIA Brazilian President Luiz In cio Lula da Silva has signed an official presidential decree establishing DTV+ (TV 3.0) incorporating many parts of the ATS...
28/08/2025
With the FCC's spectrum auction authority back in hand, lines in the sand are being drawn for the potential reallocation of the Upper C-Band for 5G mobile b...
28/08/2025
As the use of generative AI becomes more common in media operations and production, Netflix has laid out detailed guidelines for their use and provided guidance...
28/08/2025
Fox and YouTube TV have agreed to a short-term extension in their carriage talks as they try to replace the existing agreement that expired on August 27 at 5 pm...
28/08/2025
Boston Conservatory at Berklee Announces Center Stage Performances for 2025-2026...
28/08/2025
28 Aug 2025
VEON and Kyivstar to Host Investor Meeting on August 28, 2025 New York, August 28, 2025 - VEON Ltd. (Nasdaq: VEON), a global digital operator and K...
28/08/2025
UKTV has acquired the UK premiere of Art Detectives, the compelling new crime an...
28/08/2025
The new oil is sports': Saudis share masterplan to boost esports on global stage By Adrian Pennington
Tuesday, August 26, 2025 - 08:49
Print This Story...
28/08/2025
Esports World Cup 2025: Team Falcons Defend Title as Broadcast Production Ramps ...
28/08/2025
Report: Mobile-first content has overtaken the big screen as the way fans watch ...
28/08/2025
New Era: A Pro's Guide to ESPN's Expanded College Football Playoff Graph...
28/08/2025
College Football Kickoff 2025: The CW Heads Into ACC, Pac-12 Schedule With New P...
28/08/2025
Live From US Open 2025: Tennis Channel Returns With Onsite Studio Show New set has a prime site; production is handled remotely By Ken Kerschbaumer, Editorial ...
28/08/2025
NBC Sports Drives Its Coverage of NASCAR Cup Series Into Postseason Key technical upgrades for 2025 include modified Peacock Pit Box By Mark J Burns, SVG Contr...
28/08/2025
College Football Kickoff 2025: TNT Sports Launches First Big 12 Campaign With On...
28/08/2025
College Football Kickoff 2025: ESPN Gears Up for 1,000+ Games With Massive Opera...
28/08/2025
F&F's GTX21, the Company's Largest Production Unit, Hits the Road for CB...
28/08/2025
MSPO 2025: True spectrum dominance in all domains - powered by Rohde & Schwarz Rohde & Schwarz will showcase its solutions and capabilities supporting multi-d...
28/08/2025
Back to All News
Genie, Make a Wish' Teaser Hints at a Magical Showdown Be...
28/08/2025
Back to All News
Stuck on Repeat: Same Day with Someone' Trailer Unveils H...