Sony Pixel Power calrec Sony

How Do You Teach an AI Model to Reason? With Humans

27/08/2025

AI models are advancing at a rapid rate and scale.

But what might they lack that (most) humans don't? Common sense: an understanding, developed through real-world experiences, that birds can't fly backwards, mirrors are reflective and ice melts into water.

While such principles seem obvious to humans, they must be taught to AI models tasked with accurately answering complex questions and navigating unpredictable physical environments, such as industrial warehouses or roads.

NVIDIA is tackling this challenge by developing a set of tests to coach AI models on the limitations of the physical world. In other words, to teach AI common sense.

These tests are used to develop reasoning models such as NVIDIA Cosmos Reason, an open reasoning vision language model (VLM) used for physical AI applications that are proficient in generating temporally grounded responses. Cosmos Reason just topped the physical reasoning leaderboard on Hugging Face.

Cosmos Reason is unique compared with previous VLMs as it's designed to accelerate physical AI development for fields such as robotics, autonomous vehicles and smart spaces. The model can infer and reason through unprecedented scenarios using physical common-sense knowledge.

For models to understand complex environments - including industrial spaces and laboratories - they must start small. For example, in the test depicted below, the Cosmos Reason model is tasked with answering a multiple-choice question about the relative motion in the video:

https://blogs.nvidia.com/wp-content/uploads/2025/08/ModelReasoning_DrivingExample.mp4

Example from Cosmos Reason evaluation dataset

What Does Reasoning Look Like for an AI Model? To develop their reasoning capabilities, NVIDIA models are being taught physical common sense about the real world via reinforcement learning.

For example, robots don't intuitively know which way is left, right, up or down. They're taught these spatial-temporal limitations through training. AI-powered robots used in safety testing, such as vehicle crash testing, must be taught to be aware of how their physical forms interact with their surroundings.

Without embedding common sense into the training of these robots, issues can arise in deployment.

Without basic knowledge about the physical world, a robot may fall down or accidentally break something, causing danger to the surrounding people and environment, said Yin Cui, a Cosmos Reason research scientist at NVIDIA.

Distilling human common sense about the physical world into models is how NVIDIA is bringing about the next generation of AI.

Enter the NVIDIA data factory team: a group of global analysts who come from various backgrounds - including bioengineering, business and linguistics. They're working to develop, analyze and compile hundreds of thousands of data units that will be used to train generative AI models on how to reason.

The Data Curation Process One of the NVIDIA data factory team's projects focuses on the development of world foundation models for physical AI applications. These virtual environments create deep learning neural networks that are safer and more effective for training reasoning models, based on simulated domains.

It all starts with an NVIDIA annotation group that creates question-and-answer pairs based on video data. These videos are all from the real world and can include any type of footage, whether depicting chickens walking around in their coop or cars driving on a rural road.

For example, an annotator might ask about the video below: The person uses which hand to cut the spaghetti?

https://blogs.nvidia.com/wp-content/uploads/2025/08/ModelReasoning_SpaghettiExample.mp4

Example from Cosmos Reason evaluation dataset

The annotators then come up with four multiple choice answers labeled A, B, C and D. The model is fed the data and has to reason and choose the correct answer.

We're basically coming up with a test for the model, said Cui. All of our questions are multiple choice, like what students would see on a school exam.

These question-and-answer pairs are then quality checked by NVIDIA analysts, such as Michelle Li.

Li has a background in public health and data analytics, which allows her to look at the broader purpose of the data she analyzes.

For physical AI, we have a specific goal of wanting to train models on understanding the physical world, which helps me think about the bigger picture when I'm looking at the Q&A pairs and the types of questions that are being presented, Li said. I ask myself, do the Q&A pairs that I'm looking at align with our objectives for the guidelines that we have for the project?

After this, the data is reviewed by the data factory leads of the project, who make sure it's up to quality standards and ready to be sent to the Cosmos Reason research team. The scientists then feed the hundred thousands of data units - in this case the Q&A pairs - to the model, training it with reinforcement learning on the bounds and limitations of the physical world.

What Are the Applications of Reasoning AI? Reasoning models are exceptional because they can make sense of their temporal space as well as predict outcomes. They can analyze a situation, come up with a thought web of probable outcomes and infer the most likely scenario.

Simply put, reasoning AI demonstrates humanlike thinking. It shows its work, giving the user insight into the logic behind its responses.

Users can ask these models to analyze a video such as of two cars driving on a road. When asked a question like, What would happen if the cars were driving toward each other on the same lane? the model can reason and determine the most probable outcome of the proposed scenario - for example, a car crash.

We're building a pioneering reasoning model focused on physical AI, said Tsung-Yi Lin, a princ
LINK: https://blogs.nvidia.com/blog/ai-reasoning-cosmos/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

01/04/2026

DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION

January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION Douyin Users Can Now Create And Share Videos With Stun...

28/02/2026

FOX Sports Waves Green Flag on 2026 IndyCar Season With Driver's Eye, Heads Up Display, Live Drones

With two features seen in Formula 1 coverage, the broadcaster aims to bring view...

28/02/2026

Secretary of War Pete Hegseth Visits L3Harris Solid Rocket Motor Site

Secretary of War Pete Hegseth addresses a crowd of approximately 1,500 L3Harris employees in Camden, Arkansas, as part of his Arsenal of Freedom tour....

28/02/2026

2026 NAB Show Expands Sports Summit to Four Days

Share Copy link Facebook X Linkedin Bluesky Email...

28/02/2026

Three TV5Monde FAST Channels Launch on Plex

Share Copy link Facebook X Linkedin Bluesky Email...

28/02/2026

Nielsen Introduces 200+ New Advanced Audience Segments

Share Copy link Facebook X Linkedin Bluesky Email...

28/02/2026

SBE Launches Annual Membership Drive

Share Copy link Facebook X Linkedin Bluesky Email...

28/02/2026

Gray's Fox 12 Inks Deal for Portland Fire And Portland Thorns Games

Share Copy link Facebook X Linkedin Bluesky Email...

28/02/2026

Berklee Presents Mambo Mania: Eguie Castrillo and the Berklee All-Stars Big Band's Tribute to Eddie Palmieri

Berklee Presents Mambo Mania: Eguie Castrillo and the Berklee All-Stars Big Band...

28/02/2026

Berklee Announces Two New Summer Programs in Los Angeles

Berklee Announces Two New Summer Programs in Los Angeles The Berklee Music Business Program and Electronic Music Production and Sound Design Workshop bring imme...

28/02/2026

NVIDIA and Partners Show That Software-Defined AI-RAN Is the Next Wireless Generation

AI-RAN is moving from lab to field, showing that a software-defined approach is ...

28/02/2026

NVIDIA Advances Autonomous Networks With Agentic AI Blueprints and Telco Reasoning Models

Autonomous networks - intelligent, self-managing telecommunications operations -...

28/02/2026

Final Trailer for BEASTARS Final Season Part 2' Roars Toward the Series' Emotional Conclusion

Back to All News Final Trailer for BEASTARS Final Season Part 2' Roars Tow...

28/02/2026

February 27, 2026

New way to intentionally discover molecular glues could expand drug discovery Scripps Research scientists and colleagues show how drugs that eliminate certain d...

27/02/2026

Scripps Appoints Oliver Gray as Vice President, Network Sports and Client Partnerships

The E.W. Scripps Company names Oliver Gray as Vice President, Network Sports and...

27/02/2026

Gotham Sports App Now Available for Purchase Through Prime Video

The Gotham Sports App, the exclusive direct-to-consumer streaming home of MSG Networks and the YES Network, is now available for purchase through Prime Video fo...

27/02/2026

ESPN, Horizon League Extend Long-Standing Rights Agreement

ESPN and the Horizon League announce a new multi-year, multi-platform media rights agreement, continuing a 38-year collaboration that began with the 1988 Midwes...

27/02/2026

NETGEAR to Showcase Expanded Broadcast Portfolio at 2026 NAB Show

At the 2026 NAB Show in Las Vegas, NETGEAR will highlight its new switch models and major updates to its Engage Controller software. The company's network d...

27/02/2026

Teatro alla Scala Elevates Backstage Communication with Riedel's Bolero Wireless Intercom System

Riedel Communications announces that Fondazione Teatro alla Scala has deployed a...

27/02/2026

Lyuno Leverages Dante AV for Synchronization of Audio and Video Content

Lyuno specializes in media localization, including translation, dubbing, subtitling, and voice-over services for a wide array of entertainment content. The comp...

27/02/2026

Chyron Releases Weather 2.3 with New Data and Automation Features

Chyron Weather 2.3, the latest edition of Chyron's weather visualization suite for broadcasters and meteorologists, recently launched. The release includes...

27/02/2026

Telestream Advances Production-Ready AI Across its Product Portfolio

Telestream, which concentrates in media workflow technologies, announces expanded practical AI enhancements across its Vantage, Vantage Cloud, EDC, Stanza, and ...

27/02/2026

Horizon Sports & Experiences, TOGETHXR Launch Joint Venture to Power Future of Women's Sports

Horizon Sports & Experiences (HS&E), a global sports marketing, media, and live ...

27/02/2026

NBC Reunites Bob Costas, Doug Collins and 1990s NBA Legends for Spurs-Sixers on March 3

Legendary sports broadcasters Bob Costas, Doug Collins, Mike Czar of the Telest...

27/02/2026

IndyCar Season Opener: A Look into the Productions' Fast-Track IP Transformation

Beginning on March 1st, IndyCar will be kicking off their 31st season on the str...

27/02/2026

SVG GameDay, Ep. 5: Philadelphia Eagles' Alessandra Lane Behind the Birds in South Philly

In-venue and creative video staffers at the professional and collegiate level ha...

27/02/2026

Ratings Roundup: NBC Sports Takes Gold With Milano Cortina 2026 Viewership Hitting All Time Highs

Ratings Roundup is a rundown of recent rating news and is derived from press rel...

27/02/2026

SMX League Partners with Owl AI as Official AI Trailblazer to Revolutionize Action Sports Technology

Owl AI a pioneer in artificial intelligence for professional sports, announces a...

27/02/2026

Formula 1 Re-Ups Broadcast Deals With beIN SPORTS in Asia, ESPN in Latin America/Caribbean,

With over 447 million fans in APAC, Formula 1 and beIN will continue to innovate...

27/02/2026

YES Network Spotlights Brooklyn Youth With Kid Reporter' Broadcast Initiative

12-year-old Noelle Taylor will be the Kid Reporter when the Brooklyn Nets host t...

27/02/2026

ESPN Explores New POVs With CapCam, EarCam for NBA All-Star Celebrity Game, College Softball

Entire CapCam system - including camera unit, RF transmitter, and battery - is h...

27/02/2026

Gorillaz Invites Fans Into Its World With Exclusive Spotify Experience and London Mural Quest

Since its inception, Gorillaz has been known for blending art with genre-bending...

27/02/2026

Find Your Next Great Listen With Spotify's New Audiobook Charts

This week, Spotify introduced Audiobook Charts for the U.S. and U.K. The charts make it easy to discover your next favorite book by showing what's popular a...

27/02/2026

Rohde & Schwarz and Viasat to collaborate on NB-NTN IoT test plan for connectivity via satellite

Rohde & Schwarz and Viasat to collaborate on NB-NTN IoT test plan for connectivi...

27/02/2026

Designing AI based features in the MSC

In media technology, big features often steal the spotlight - AI integrations, cloud transformations, automation frameworks. But for the people who use these to...

27/02/2026

Has Video outgrown your DAM?

Digital Asset Management systems sit at the heart of most marcoms operations. They centralise content, organise it, and make it discoverable. Integrated with th...

27/02/2026

NAB

The AI Wild West comes to NAB 2026 and Blue Lucy is bringing the Sheriff The AI Wild West is here, and media organisations are feeling the heat. On Booth W23...

27/02/2026

32.6 Million Watch 2026 State of the Union Address

NEW YORK - February 26, 2026 - An estimated 32.6 million people watched President Donald J. Trump deliver the 2026 State of the Union address on Tuesday, Februa...

27/02/2026

Atomos Introduces Ninja RAW HDR Monitor-Recorder At CP+ Trade Show

Share Copy link Facebook X Linkedin Bluesky Email...

27/02/2026

Scripps to Reacquire 23 ION Stations

Share Copy link Facebook X Linkedin Bluesky Email...

27/02/2026

Survey: 88% of Sports Execs Bullish on Industry's Prospects

Share Copy link Facebook X Linkedin Bluesky Email...

27/02/2026

Chyron Releases Weather 2.3 With New Data and Automation Features

Share Copy link Facebook X Linkedin Bluesky Email...

27/02/2026

Samba: Essentials, Utilities Tally Most Ad Impressions in 2nd-Half 2025

Share Copy link Facebook X Linkedin Bluesky Email...

27/02/2026

TVB: Linear TV Still King

Share Copy link Facebook X Linkedin Bluesky Email...

27/02/2026

AJA U-TAP Helps Stary Technologies with Courtroom Evidenc...

Video is one of the lawyer's most powerful storytelling tools in civil litigation today, whether used to transport jurors to an incident scene or challenge ...

27/02/2026

Foundry releases Nuke 17-0

Creative software developer Foundry today released Nuke 17.0, the latest version of its powerful compositing tool for visual effects and animation. Marking one ...