Sony Pixel Power calrec Sony

How Do You Teach an AI Model to Reason? With Humans

27/08/2025

AI models are advancing at a rapid rate and scale.

But what might they lack that (most) humans don't? Common sense: an understanding, developed through real-world experiences, that birds can't fly backwards, mirrors are reflective and ice melts into water.

While such principles seem obvious to humans, they must be taught to AI models tasked with accurately answering complex questions and navigating unpredictable physical environments, such as industrial warehouses or roads.

NVIDIA is tackling this challenge by developing a set of tests to coach AI models on the limitations of the physical world. In other words, to teach AI common sense.

These tests are used to develop reasoning models such as NVIDIA Cosmos Reason, an open reasoning vision language model (VLM) used for physical AI applications that are proficient in generating temporally grounded responses. Cosmos Reason just topped the physical reasoning leaderboard on Hugging Face.

Cosmos Reason is unique compared with previous VLMs as it's designed to accelerate physical AI development for fields such as robotics, autonomous vehicles and smart spaces. The model can infer and reason through unprecedented scenarios using physical common-sense knowledge.

For models to understand complex environments - including industrial spaces and laboratories - they must start small. For example, in the test depicted below, the Cosmos Reason model is tasked with answering a multiple-choice question about the relative motion in the video:

https://blogs.nvidia.com/wp-content/uploads/2025/08/ModelReasoning_DrivingExample.mp4

Example from Cosmos Reason evaluation dataset

What Does Reasoning Look Like for an AI Model? To develop their reasoning capabilities, NVIDIA models are being taught physical common sense about the real world via reinforcement learning.

For example, robots don't intuitively know which way is left, right, up or down. They're taught these spatial-temporal limitations through training. AI-powered robots used in safety testing, such as vehicle crash testing, must be taught to be aware of how their physical forms interact with their surroundings.

Without embedding common sense into the training of these robots, issues can arise in deployment.

Without basic knowledge about the physical world, a robot may fall down or accidentally break something, causing danger to the surrounding people and environment, said Yin Cui, a Cosmos Reason research scientist at NVIDIA.

Distilling human common sense about the physical world into models is how NVIDIA is bringing about the next generation of AI.

Enter the NVIDIA data factory team: a group of global analysts who come from various backgrounds - including bioengineering, business and linguistics. They're working to develop, analyze and compile hundreds of thousands of data units that will be used to train generative AI models on how to reason.

The Data Curation Process One of the NVIDIA data factory team's projects focuses on the development of world foundation models for physical AI applications. These virtual environments create deep learning neural networks that are safer and more effective for training reasoning models, based on simulated domains.

It all starts with an NVIDIA annotation group that creates question-and-answer pairs based on video data. These videos are all from the real world and can include any type of footage, whether depicting chickens walking around in their coop or cars driving on a rural road.

For example, an annotator might ask about the video below: The person uses which hand to cut the spaghetti?

https://blogs.nvidia.com/wp-content/uploads/2025/08/ModelReasoning_SpaghettiExample.mp4

Example from Cosmos Reason evaluation dataset

The annotators then come up with four multiple choice answers labeled A, B, C and D. The model is fed the data and has to reason and choose the correct answer.

We're basically coming up with a test for the model, said Cui. All of our questions are multiple choice, like what students would see on a school exam.

These question-and-answer pairs are then quality checked by NVIDIA analysts, such as Michelle Li.

Li has a background in public health and data analytics, which allows her to look at the broader purpose of the data she analyzes.

For physical AI, we have a specific goal of wanting to train models on understanding the physical world, which helps me think about the bigger picture when I'm looking at the Q&A pairs and the types of questions that are being presented, Li said. I ask myself, do the Q&A pairs that I'm looking at align with our objectives for the guidelines that we have for the project?

After this, the data is reviewed by the data factory leads of the project, who make sure it's up to quality standards and ready to be sent to the Cosmos Reason research team. The scientists then feed the hundred thousands of data units - in this case the Q&A pairs - to the model, training it with reinforcement learning on the bounds and limitations of the physical world.

What Are the Applications of Reasoning AI? Reasoning models are exceptional because they can make sense of their temporal space as well as predict outcomes. They can analyze a situation, come up with a thought web of probable outcomes and infer the most likely scenario.

Simply put, reasoning AI demonstrates humanlike thinking. It shows its work, giving the user insight into the logic behind its responses.

Users can ask these models to analyze a video such as of two cars driving on a road. When asked a question like, What would happen if the cars were driving toward each other on the same lane? the model can reason and determine the most probable outcome of the proposed scenario - for example, a car crash.

We're building a pioneering reasoning model focused on physical AI, said Tsung-Yi Lin, a princ
LINK: https://blogs.nvidia.com/blog/ai-reasoning-cosmos/...
See more stories from nvidia

Most recent headlines

06/10/2025

France Tlvisions Wins Prestigious 2025 EBU Technology & Innovation Award in Groundbreaking Collaboration with Dalet

France T l visions, France's leading broadcaster, has received the 2025 EBU ...

04/09/2025

Monumental Sports & Entertainment and Dalet Win Prestigious 2025 NAB Show Project of the Year Award

Monumental Sports & Entertainment (MSE), in collaboration with Dalet, has been a...

28/08/2025

Meet the 2025 Sundance Institute Documentary Edit Residency Artists

By Kristin Feeley, Director, Documentary Film & Artist Programs If you want to tell untold stories, if you want to give voice to the voiceless, you've got ...

28/08/2025

Watch These 9 Sundance Institute-Supported Documentaries That Spotlight Workers' Rights

Directed by Steven Bognar and Julia Reichert, Sundance Institute-supported Amer...

28/08/2025

Motivational Corridos: The New Sound of Resilience in Msica Mexicana

Corridos have been a cornerstone of M sica Mexicana for generations, telling stories rooted in everyday life. Now, a new chapter is taking shape: motivational c...

28/08/2025

Corridos Motivadores: El Nuevo Sonido de la Resiliencia en Mxico

Los corridos han sido un pilar de la M sica Mexicana durante generaciones, contando historias enraizadas en la vida cotidiana. Ahora, un nuevo cap tulo est tom...

28/08/2025

Verano Forever Brings Myke Towers, Bele, Elena Rose, and More to Miami for an Unforgettable Latin Summer Celebration

Earlier this month, we promised our Verano Forever party would bring the heat, a...

28/08/2025

Poland Selects L3Harris Electronic Warfare System for F-16 Fleet

L3Harris will provide the Polish F-16V fleet with the Viper Shield electronic warfare system as part of an upgrade program....

28/08/2025

AgileTV consolidates its technological leadership with the development of Lowi TV in Spain

Bilbao, August 26, 2025 - AgileTV, an international television and video technol...

28/08/2025

Craft Interview: Ken Wilkinson, Audio Engineer

Ken Wilkinson is an Emmy Awards nominated New York audio engineer who specialises in production sound mixing for film, commercial, episodic and documentary work...

28/08/2025

Fubo to Launch Fubo Sports Skinny Bundle for $56 Per Month

NEW YORK FuboTV today announced that it will launch Fubo Sports, a skinny bundle that focuses on sports with a subscription price of $56 monthly....

28/08/2025

Telestream to Launch 'Global Ingest Workflow at IBC2025

NEVADA City, Calif. At IBC2025, Sept. 12-15 at the RAI Amsterdam, Telestream will debut its new Global Ingest strategy, introducing a next-generation ingest arc...

28/08/2025

Dr. Rhoda Bernard Releases Groundbreaking Debut Book on Accessible Arts Education

Dr. Rhoda Bernard Releases Groundbreaking Debut Book on Accessible Arts Educatio...

28/08/2025

TAG Strengthens Regional Presence with Appointment of Oli...

TAG Video Systems, the leader in software-based IP end-to-end workflow monitoring, deep probing, and real-time visualization, has named Oliver Gappa as Sales Di...

28/08/2025

DHD to Demonstrate AI-Based Voice Enhancement at IBC 2025

AI-based voice enhancement will be among a series of innovations making their IBC 2025 debut on the DHD stand B46 in Hall 8 at the RAI Amsterdam Convention Cent...

28/08/2025

Telefonica Servicios Audiovisuales Hit the Back of the Ne...

Telef nica Servicios Audiovisuales (TSA), the leading system integrator and service provider in the media sector in Spain, with the support of Appear, the globa...

28/08/2025

Optical Media Anchors LiveU IQ into its On site Productio...

To fully immerse sailing fans in the world's biggest offshore yacht race, production company, Optical Media turned to LiveU's On-site Production solutio...

28/08/2025

WNED Adopts Calrec Type R console to weather any storm an...

Working with Calrec on its most recent overhaul, radio and television broadcaster, WNED has migrated to a fully IP infrastructure with multiple Type R consoles,...

28/08/2025

Cleeng unveils first ever free D2C subscription platform...

Cleeng, the Subscriber Retention Management (SRM ) inventor, has unveiled Cleeng Pro, the first-ever direct-to-consumer (D2C) subscription management platform t...

28/08/2025

Zixi and OKAST Partner to Power Scalable Global FAST Chan...

Zixi, the industry leader in live broadcast-quality video over IP, today announced that French media distribution platform OKAST has selected Zixi to enable rel...

28/08/2025

Nixer to unveil CV1 AoIP monitoring tool to address evolv...

Solution offers a streamlined, speaker-free architecture to optimize integration with premium external loudspeakers and advanced loudness metering Nixer Pro Au...

28/08/2025

Cinegy Announces Strategic Partnership with One Touch Pro...

Cinegy, the premier provider of software-defined television technology, has announced a strategic partnership with Vision One Touch Film Production Services L.L...

28/08/2025

Telestream Global Ingest Workflow Powered by Vantage Open...

Telestream, a global leader in media workflow technologies, will debut its new Global Ingest strategy at IBC2025, introducing a next-generation ingest architect...

28/08/2025

Telenor partners with Broadpeak for multi-country content...

Tier 1 operator selects Broadpeak to power high-performance, unified CDN solution across Norway, Sweden and Finland Broadpeak, a leader in streaming and moneti...

28/08/2025

24 Frames Digital goes live with Synamedia Quortex Play f...

Leading video software provider, Synamedia, today announced that 24 Frames Digital, one of India's leading live event streaming service providers, has chose...

28/08/2025

VisualOn at IBC 2025 - Whats Next in AI Powered Video Str...

Meet VisualOn at IBC2025: See What's Next in AI-Powered Video Streaming Join VisualOn at IBC2025 and discover how our AI-driven Optimizer and advanced media...

28/08/2025

Wowza to Reveal Next-Gen Video Streaming Innovations at I...

IBC stand 5.F81 Wowza to Reveal Next-Gen Video Streaming Innovations at IBC 2025 Amsterdam, August 28, 2025 Wowza, a leader in video streaming infrastructur...

28/08/2025

VIDA Launches Media Factory Agentic AI-Powered Workflow A...

VIDA, the secure cloud-native media asset management platform, is launching at IBC Show 2025 Media Factory, a drag-and-drop workflow automation engine designed ...

28/08/2025

FCC Restores Accidentally Deleted ATSC 3.0 Rules

WASHINGTON The Federal Communications Commission has admitted that it inadvertently removed some rules relating to NextGen TV/ATSC 3.0 and has moved to correct ...

28/08/2025

NAB Launches NextGen TV News Technology Lab to Advance Local Journalism

WASHINGTON The National Association of Broadcasters (NAB) has announced the launch of the NextGen TV News Technology Lab, a three-year initiative designed to he...

28/08/2025

Brazil Makes It Official: New DTV+ Standard Leverages ATSC 3.0 Tech

BRASILIA Brazilian President Luiz In cio Lula da Silva has signed an official presidential decree establishing DTV+ (TV 3.0) incorporating many parts of the ATS...

28/08/2025

Airwaves Battle Brews Over Upper C-Band at FCC

With the FCC's spectrum auction authority back in hand, lines in the sand are being drawn for the potential reallocation of the Upper C-Band for 5G mobile b...

28/08/2025

Netflix Lays Out Guidance for Using Generative AI in Content Production

As the use of generative AI becomes more common in media operations and production, Netflix has laid out detailed guidelines for their use and provided guidance...

28/08/2025

Fox, YouTube TV Agree on Short Extension in Carriage Talks

Fox and YouTube TV have agreed to a short-term extension in their carriage talks as they try to replace the existing agreement that expired on August 27 at 5 pm...

28/08/2025

Boston Conservatory at Berklee Announces Center Stage Performances for 2025-2026 Season

Boston Conservatory at Berklee Announces Center Stage Performances for 2025-2026...

28/08/2025

VEON and Kyivstar to Host Investor Meeting on August 28, 2025

28 Aug 2025 VEON and Kyivstar to Host Investor Meeting on August 28, 2025 New York, August 28, 2025 - VEON Ltd. (Nasdaq: VEON), a global digital operator and K...

28/08/2025

UKTV acquires UK premiere of Acorn TV breakout hit Art Detectives in deal with Dynamic Television

UKTV has acquired the UK premiere of Art Detectives, the compelling new crime an...

28/08/2025

Esports World Cup 2025: Saudis Share Masterplan to Boost Esports on Global Stage

The new oil is sports': Saudis share masterplan to boost esports on global stage By Adrian Pennington Tuesday, August 26, 2025 - 08:49 Print This Story...

28/08/2025

Esports World Cup 2025: Team Falcons Defend Title as Broadcast Production Ramps Up the Game

Esports World Cup 2025: Team Falcons Defend Title as Broadcast Production Ramps ...

28/08/2025

Report: Mobile-First Content has Overtaken the Big Screen as the Way Fans Watch Sports

Report: Mobile-first content has overtaken the big screen as the way fans watch ...

28/08/2025

New Era: A Pro's Guide to ESPN's Expanded College Football Playoff Graphics Package

New Era: A Pro's Guide to ESPN's Expanded College Football Playoff Graph...

28/08/2025

College Football Kickoff 2025: The CW Heads Into ACC, Pac-12 Schedule With New Pregame CW Football Saturday Countdown'

College Football Kickoff 2025: The CW Heads Into ACC, Pac-12 Schedule With New P...

28/08/2025

Live From US Open 2025: Tennis Channel Returns With Onsite Studio Show

Live From US Open 2025: Tennis Channel Returns With Onsite Studio Show New set has a prime site; production is handled remotely By Ken Kerschbaumer, Editorial ...

28/08/2025

NBC Sports Drives Its Coverage of NASCAR Cup Series Into Postseason

NBC Sports Drives Its Coverage of NASCAR Cup Series Into Postseason Key technical upgrades for 2025 include modified Peacock Pit Box By Mark J Burns, SVG Contr...

28/08/2025

College Football Kickoff 2025: TNT Sports Launches First Big 12 Campaign With Onsite Crews, NEP ND6, 16 Cameras

College Football Kickoff 2025: TNT Sports Launches First Big 12 Campaign With On...

28/08/2025

College Football Kickoff 2025: ESPN Gears Up for 1,000+ Games With Massive Operational Footprint

College Football Kickoff 2025: ESPN Gears Up for 1,000+ Games With Massive Opera...

28/08/2025

F&F's GTX21, the Company's Largest Production Unit, Hits the Road for CBS College Football

F&F's GTX21, the Company's Largest Production Unit, Hits the Road for CB...

28/08/2025

MSPO 2025: True spectrum dominance in all domains - powered by Rohde & Schwarz

MSPO 2025: True spectrum dominance in all domains - powered by Rohde & Schwarz Rohde & Schwarz will showcase its solutions and capabilities supporting multi-d...

28/08/2025

Genie, Make a Wish' Teaser Hints at a Magical Showdown Between Kim Woo-bin and Suzy

Back to All News Genie, Make a Wish' Teaser Hints at a Magical Showdown Be...

28/08/2025

Stuck on Repeat: Same Day with Someone' Trailer Unveils Heartfelt Journey of Embracing Your Worst Days

Back to All News Stuck on Repeat: Same Day with Someone' Trailer Unveils H...