Sony Pixel Power calrec Sony

How Do You Teach an AI Model to Reason? With Humans

27/08/2025

AI models are advancing at a rapid rate and scale.

But what might they lack that (most) humans don't? Common sense: an understanding, developed through real-world experiences, that birds can't fly backwards, mirrors are reflective and ice melts into water.

While such principles seem obvious to humans, they must be taught to AI models tasked with accurately answering complex questions and navigating unpredictable physical environments, such as industrial warehouses or roads.

NVIDIA is tackling this challenge by developing a set of tests to coach AI models on the limitations of the physical world. In other words, to teach AI common sense.

These tests are used to develop reasoning models such as NVIDIA Cosmos Reason, an open reasoning vision language model (VLM) used for physical AI applications that are proficient in generating temporally grounded responses. Cosmos Reason just topped the physical reasoning leaderboard on Hugging Face.

Cosmos Reason is unique compared with previous VLMs as it's designed to accelerate physical AI development for fields such as robotics, autonomous vehicles and smart spaces. The model can infer and reason through unprecedented scenarios using physical common-sense knowledge.

For models to understand complex environments - including industrial spaces and laboratories - they must start small. For example, in the test depicted below, the Cosmos Reason model is tasked with answering a multiple-choice question about the relative motion in the video:

https://blogs.nvidia.com/wp-content/uploads/2025/08/ModelReasoning_DrivingExample.mp4

Example from Cosmos Reason evaluation dataset

What Does Reasoning Look Like for an AI Model? To develop their reasoning capabilities, NVIDIA models are being taught physical common sense about the real world via reinforcement learning.

For example, robots don't intuitively know which way is left, right, up or down. They're taught these spatial-temporal limitations through training. AI-powered robots used in safety testing, such as vehicle crash testing, must be taught to be aware of how their physical forms interact with their surroundings.

Without embedding common sense into the training of these robots, issues can arise in deployment.

Without basic knowledge about the physical world, a robot may fall down or accidentally break something, causing danger to the surrounding people and environment, said Yin Cui, a Cosmos Reason research scientist at NVIDIA.

Distilling human common sense about the physical world into models is how NVIDIA is bringing about the next generation of AI.

Enter the NVIDIA data factory team: a group of global analysts who come from various backgrounds - including bioengineering, business and linguistics. They're working to develop, analyze and compile hundreds of thousands of data units that will be used to train generative AI models on how to reason.

The Data Curation Process One of the NVIDIA data factory team's projects focuses on the development of world foundation models for physical AI applications. These virtual environments create deep learning neural networks that are safer and more effective for training reasoning models, based on simulated domains.

It all starts with an NVIDIA annotation group that creates question-and-answer pairs based on video data. These videos are all from the real world and can include any type of footage, whether depicting chickens walking around in their coop or cars driving on a rural road.

For example, an annotator might ask about the video below: The person uses which hand to cut the spaghetti?

https://blogs.nvidia.com/wp-content/uploads/2025/08/ModelReasoning_SpaghettiExample.mp4

Example from Cosmos Reason evaluation dataset

The annotators then come up with four multiple choice answers labeled A, B, C and D. The model is fed the data and has to reason and choose the correct answer.

We're basically coming up with a test for the model, said Cui. All of our questions are multiple choice, like what students would see on a school exam.

These question-and-answer pairs are then quality checked by NVIDIA analysts, such as Michelle Li.

Li has a background in public health and data analytics, which allows her to look at the broader purpose of the data she analyzes.

For physical AI, we have a specific goal of wanting to train models on understanding the physical world, which helps me think about the bigger picture when I'm looking at the Q&A pairs and the types of questions that are being presented, Li said. I ask myself, do the Q&A pairs that I'm looking at align with our objectives for the guidelines that we have for the project?

After this, the data is reviewed by the data factory leads of the project, who make sure it's up to quality standards and ready to be sent to the Cosmos Reason research team. The scientists then feed the hundred thousands of data units - in this case the Q&A pairs - to the model, training it with reinforcement learning on the bounds and limitations of the physical world.

What Are the Applications of Reasoning AI? Reasoning models are exceptional because they can make sense of their temporal space as well as predict outcomes. They can analyze a situation, come up with a thought web of probable outcomes and infer the most likely scenario.

Simply put, reasoning AI demonstrates humanlike thinking. It shows its work, giving the user insight into the logic behind its responses.

Users can ask these models to analyze a video such as of two cars driving on a road. When asked a question like, What would happen if the cars were driving toward each other on the same lane? the model can reason and determine the most probable outcome of the proposed scenario - for example, a car crash.

We're building a pioneering reasoning model focused on physical AI, said Tsung-Yi Lin, a princ
LINK: https://blogs.nvidia.com/blog/ai-reasoning-cosmos/...
See more stories from nvidia

Most recent headlines

06/12/2025

L3Harris Chair and CEO Appears on CNBC at Reagan National Defense Forum

In a live broadcast from the Reagan National Defense Forum, L3Harris Chair and CEO Christopher Kubasik joined Morgan Brennan on CNBCs Closing Bell: Overtime. Ku...

06/12/2025

Survey: M&E Embraces Horizontally Integrated Media Archiving Approach

FORT LAUDERDALE, Fla. A new survey from Pixitmedia by Datacore revealed a major shift in the Media & Entertainment industry in media archiving, with 85% of resp...

06/12/2025

Czech TV Deploys LiveU Solutions in 10 OB Vans

HACKENSACK, N.J. LiveU has announced that the national public broadcaster Czech Television has completed one of the largest LiveU live production deployments fo...

06/12/2025

NATAS Celebrates 76th Technology & Engineering Emmy Award Honorees

NEW YORK The National Academy of Television Arts and Sciences (NATAS) presented the Excellence in Production Technology Emmy Award to NASA+ and Dr. Tom Leight...

05/12/2025

2025 Sports Broadcasting Hall of Fame: Curt Gowdy Jr. - Master Storyteller, Nationally and Regionally

2025 Sports Broadcasting Hall of Fame: Curt Gowdy Jr. - Master Storyteller, Nati...

05/12/2025

SVG Sit-Down: Veritone's Sean King on the Power of Mining Video, Audio Data

SVG Sit-Down: Veritone's Sean King on the Power of Mining Video, Audio DataThe company's Data Refinery offers users total control and governance over da...

05/12/2025

Platinum White Paper: Inside the Nashville Predators' Unified, Flexible, Scalable Production System with Ross Video

Platinum White Paper: Inside the Nashville Predators' Unified, Flexible, Sca...

05/12/2025

Netflix Reaches Agreement To Acquire Warner Bros. Following Planned WBD Split

Netflix Reaches Agreement To Acquire Warner Bros. Following Planned WBD SplitThe deal does not include WBDs sports assets like TNT Sports (US, UK, LatAm), Euros...

05/12/2025

FOX Sports Returns to Indianapolis for Primetime Broadcast of Big Ten Championship

FOX Sports Returns to Indianapolis for Primetime Broadcast of Big Ten Championsh...

05/12/2025

SVG Summit 2025 Preview: Digital Engagement & Monetization Workshop Tackles the Future of the Viewer Experience

SVG Summit 2025 Preview: Digital Engagement & Monetization Workshop Tackles the ...

05/12/2025

Atlanta United Lights Up New Emory Healthcare Studio With First Live Broadcast for World Cup Draw

Atlanta United Lights Up New Emory Healthcare Studio With First Live Broadcast f...

05/12/2025

As Messi Takes the Pitch, MLS, Apple, NEP Roll Out Largest MLS Cup Production Ever

As Messi Takes the Pitch, MLS, Apple, NEP Roll Out Largest MLS Cup Production Ev...

05/12/2025

ESPN Enters College Football's Most Intense Month With Elevated Workflows for Championship Week

ESPN Enters College Football's Most Intense Month With Elevated Workflows fo...

05/12/2025

Sorry, Baby, Peter Hujar's Day, Among Sundance Institute-Supported Projects Nominated for 2026 Film Independent Spirit Awards

It's about that time! Awards season is in full swing, and the Film Independe...

05/12/2025

Surprised by Your 2025 Wrapped? Here's a Look at How the Data Comes to Life

Every year, Spotify Wrapped offers a personalized look back at the audio that defined your year. It's a snapshot of your listening habits, designed to tell ...

05/12/2025

Celebrating Spotify's GLOW, RADAR, and EQUAL Artists of 2025

In 2025, Spotify's EQUAL, GLOW, and RADAR programs celebrated women, LGBTQIA , and emerging artists who turned moments into milestones. From breaking record...

05/12/2025

Wi-Fi 7 - Go Beyond Speed to Deliver Security, Trust and Value

In our latest blog, we explain how Wi-Fi 7 rollouts can drive consumer loyalty with value-add services such as consumer cybersecurity. We also explore how this ...

05/12/2025

Netflix to Acquire Warner Bros. in Deal Worth $82.7 Billon

LOS ANGELES Netflix announced it has entered into an agreement to acquire the assets of Warner Bros. for $82.7 billion....

05/12/2025

Gracenote Launches New CTV Ad Platform

NEW YORK Nielsens Gracenote has launched Gracenote Content Connect, a new ad platform that provides agencies, brands, supply-side platforms (SSPs) and demand-si...

05/12/2025

IAB Tech Lab Releases Deals API

NEW YORK In an most important update to the workings of deal-based programmatic advertising, IAB Tech Lab has released version 1.0 of its Deals API for public c...

05/12/2025

Nielsen: NFL Thanksgiving Games Score Big Audiences

NEW YORK Pass the turkey. Pass the stuffing. Pass the cranberry sauce. All are common requests of Americans celebrating Thanksgiving Day with family and f...

05/12/2025

Iris Cloud-Connected Camera Control Platform Now Available

NEW YORK Iris, the new cloud-connected camera control platform, has officially launched with features that turn virtually any PTZ camera into a software-connect...

05/12/2025

Netflix to Acquire Warner Bros. in Deal Worth $82.7B

HOLLYWOOD, Calif. Netflix announced today that it has entered into an agreement to acquire the assets of Warner Bros. for $82.7 billion....

05/12/2025

Iris Cloud-Connected Camera Control Platform Is Now Available

NEW YORK Iris, the new cloud-connected camera control platform, has officially launched with features that turn virtually any PTZ camera into a software-connect...

05/12/2025

FCC Approves AT&T's $1 Billion Acquisition of UScellular Spectrum

WASHINGTON The Federal Communications Commission has approved AT&T's $1.02 billion acquisition of spectrum from UScellular in a decision that was issued sho...

05/12/2025

The Best Coldplay Songs: 21 Tracks That Shoot for the Stars

The Best Coldplay Songs: 21 Tracks That Shoot for the Stars From Yellow to Viva La Vida, Fix You to Paradise, this playlist goes back to the start. December ...

05/12/2025

Zafris Lecture Series Brings Nabil Ayers to Berklee

Zafris Lecture Series Brings Nabil Ayers to Berklee The 32nd annual James G. Zafris Distinguished Lecture series was held on Thursday, November 13, with guest...

05/12/2025

Introducing New Perks to Help You Get Even More from...

Introducing New Perks to Help You Get Even More from LinkedIn Premium Published on Dec 5, 2025 Categories: Company News, Product News LinkedIn Corporate Co...

05/12/2025

A new Game of Thrones Tale: Official trailer for Sky Exclusive series A Knight of the Seven Kingdoms lands today

Friday 5 December 2025 A new Game of Thrones Tale: Official trailer for Sky Exc...

05/12/2025

Don Lee, Lee Jin-uk, and Lalisa Manobal to Star in Netflix Action Thriller 'TYGO'

Back to All News Don Lee, Lee Jin-uk, and Lalisa Manobal to Star in Netflix Act...

05/12/2025

Give Back Fridays with a twist is back for 2025!

Tis the season of giving once again and this year we've taken our Give Back Fridays' concept and turned it on its head. In the autumn we were approach...

05/12/2025

2025-11-06

Brayden Gogis doesn't remember a time when he wasn't completely fixated on games in all forms. In preschool, when they asked us to dress up as what we ...

05/12/2025

The Grinch steals the spotlight as the theme for The Late Late Toy Show 2025

The Grinch steals the spotlight as the theme for The Late Late Toy Show 2025 Tune in tonight at 9:35pm on RT One and worldwide on RT Player #LateLateToyShow...

05/12/2025

RT Announces New Presenters of Flagship News Programmes

RT Announces New Presenters of Flagship News Programmes New RT Six One News co-presenter Tommy Meskill Sarah McInerney & Justin McCarthy join Morning Ir...

04/12/2025

ToolsOnAir Blackmagic Design HyperDeck Event Presets for just:in mac pro 2025 & just:in linux

ToolsOnAir Blackmagic Design HyperDeck Event Presets for just:in mac pro 2025 & ...

04/12/2025

ToolsOnAir AJA Ki Pro Event Presets for just:in mac pro 2025 & just:in linux

ToolsOnAir AJA Ki Pro Event Presets for just:in mac pro 2025 & just:in linux More Details:Starting with version 5.5, both just:in mac pro and just:in linux sol...

04/12/2025

Young Journalist finalists looking to the future

Wangu Kanuri from Kenya and Godwin Asediba from Ghana are two of this years finalists for Thomsons Young Journalist of the Year Award. The pair are runners-up i...

04/12/2025

SVG Sit-Down: ProximaVision's Claudio Lisman on Why Tethered Drones Could Be a Game-Changer for Live Sports Production

SVG Sit-Down: ProximaVision's Claudio Lisman on Why Tethered Drones Could Be...

04/12/2025

SVG Campus Shot Callers: Imry Halevi, Senior Associate Director of Athletics, Content & Strategic Communications, Harvard University

SVG Campus Shot Callers: Imry Halevi, Senior Associate Director of Athletics, Co...

04/12/2025

Platinum White Paper: LiveU Lightweight Sports Production: A Step Change in Sports Storytelling

Platinum White Paper: LiveU Lightweight Sports Production: A Step Change in Spor...

04/12/2025

London to Riyadh: DAZN Brings the Boxing Glamour to New Production Levels for Benavidez v Yarde in Saudi Arabia

London to Riyadh: DAZN brings the boxing glamour to new production levels for Be...

04/12/2025

Analysis: Paramount Bets on the Battering Ram' with Champions League Play

Analysis: Paramount bets on the battering ram' with Champions League play By Callum McCarthy, Editor-at-Large Tuesday, December 2, 2025 - 10:12 Print ...

04/12/2025

Space City Home Network Launches SCHN+ DTC App for Astros and Rockets

Space City Home Network Launches SCHN DTC App for Astros and RocketsThe Rockets and Astros were previously the lone NBA and MLB teams without a DTC appBy Jason...

04/12/2025

SVG Summit 2025 Preview: Content Workflows Workshop Spotlights Evolution of Sports Media Supply Chain

SVG Summit 2025 Preview: Content Workflows Workshop Spotlights Evolution of Spor...

04/12/2025

New Sponsor Spotlight: Geotech's Patrick Wambold On the Unreal Engine Revolution Taking Place in Sports Broadcasting

New Sponsor Spotlight: Geotech's Patrick Wambold On the Unreal Engine Revolu...

04/12/2025

Curt Gowdy Jr. - Master Storyteller, Nationally and Regionally

Curt Gowdy Jr. - Master Storyteller, Nationally and RegionallyBy Jason Dachman, Editorial Director, U.S. Thursday, December 4, 2025 - 1:52 pm Print This Sto...

04/12/2025

Cutting Through Rocks ( ) Shows the Difference That One Person Can Make for Change

(L-R) Rebecca Lichtenfeld, Mohammadreza Eyni, Sara Khaki, and Judith Helfand att...

04/12/2025

SBS launches Future Frames initiative to support emerging First Nations video editing talent

SBS launches Future Frames initiative to support emerging First Nations video ed...