Sony Pixel Power calrec Sony

AI On: 3 Ways to Bring Agentic AI to Computer Vision Applications

13/11/2025

Editor's note: This post is part of the AI On blog series, which explores the latest techniques and real-world applications of agentic AI, chatbots and copilots. The series also highlights the NVIDIA software and hardware powering advanced AI agents, which form the foundation of AI query engines that gather insights and perform tasks to transform everyday experiences and reshape industries.

Today's computer vision systems excel at identifying what happens in physical spaces and processes, but lack the abilities to explain the details of a scene and why they matter, as well as reason about what might happen next.

Agentic intelligence powered by vision language models (VLMs) can help bridge this gap, giving teams quick, easy access to key insights and analyses that connect text descriptors with spatial-temporal information and billions of visual data points captured by their systems every day.

Three approaches organizations can use to boost their legacy computer vision systems with agentic intelligence are to:

Apply dense captioning for searchable visual content.

Augment system alerts with detailed context.

Use AI reasoning to summarize information from complex scenarios and answer questions.

Making Visual Content Searchable With Dense Captions Traditional convolutional neural network (CNN)-powered video search tools are constrained by limited training, context and semantics, making gleaning insights manual, tedious and time-consuming. CNNs are tuned to perform specific visual tasks, like spotting an anomaly, and lack the multimodal ability to translate what they see into text.

Businesses can embed VLMs directly into their existing applications to generate highly detailed captions of images and videos. These captions turn unstructured content into rich, searchable metadata, enabling visual search that's far more flexible - not constrained by file names or basic tags.

For example, automated vehicle-inspection system UVeye processes over 700 million high-resolution images each month to build one of the world's largest vehicle and component datasets. By applying VLMs, UVeye converts this visual data into structured condition reports, detecting subtle defects, modifications or foreign objects with exceptional accuracy and reliability for search.

VLM-powered visual understanding adds essential context, ensuring transparent, consistent insights for compliance, safety and quality control. UVeye detects 96% of defects compared with 24% using manual methods, enabling early intervention to reduce downtime and control maintenance costs.

https://blogs.nvidia.com/wp-content/uploads/2025/11/UVeye-video-1.mp4

Relo Metrics, a provider of AI-powered sports marketing measurement, helps brands quantify the value of their media investments and optimize their spending. By combining VLMs with computer vision, Relo Metrics moves beyond basic logo detection to capture context - like a courtside banner shown during a game-winning shot - and translate it into real-time monetary value.

This contextual-insight capability highlights when and how logos appear, especially in high-impact moments, giving marketers a clearer view of return on investment and ways to optimize strategy. For example, Stanley Black & Decker, including its Dewalt brand, previously relied on end-of-season reports to evaluate sponsor asset performance, limiting timely decision-making. Using Relo Metrics for real-time insights, Stanley Black & Decker adjusted signage positioning and saved $1.3 million in potentially lost sponsor media value.

Augmenting Computer Vision System Alerts With VLM Reasoning CNN-based computer vision systems often generate binary detection alerts such as yes or no, and true or false. Without the reasoning power of VLMs, that can mean false positives and missed details - leading to costly mistakes in safety and security, as well as lost business intelligence.Rather than replacing these CNN-based computer vision systems entirely, VLMs can easily augment these systems as an intelligent add-on. With a VLM layered on top of CNN-based computer vision systems, detection alerts are not only flagged but reviewed with contextual understanding - explaining where, how and why the incident occurred.

For smarter city traffic management, Linker Vision uses VLMs to verify critical city alerts, such as traffic accidents, flooding, or falling poles and trees from storms. This reduces false positives and adds vital context to each event to improve real-time municipal response.

https://blogs.nvidia.com/wp-content/uploads/2025/11/Updated-VLM-1-1.mp4

Linker Vision's architecture for agentic AI involves automating event analysis from over 50,000 diverse smart city camera streams to enable cross-department remediation - coordinating actions across teams like traffic control, utilities and first responders when incidents occur. The ability to query across all camera streams simultaneously enables systems to quickly and automatically turn observations into insights and trigger recommendations for next best actions.

Automatic Analysis of Complex Scenarios With Agentic AI Agentic AI systems can process, reason and answer complex queries across video streams and modalities - such as audio, text, video and sensor data. This is possible by combining VLMs with reasoning models, large language models (LLMs), retrieval-augmented generation (RAG), computer vision and speech transcription.

Basic integration of a VLM into an existing computer vision pipeline is helpful in verifying short video clips of key moments. However this approach is limited by how many visual tokens a single model can process at once, resulting in surface-level answers without context over longer time periods and external knowledge.

In contrast, whole architectures built on agentic AI enable scalable, accurate processing of lengthy and multichanne
LINK: https://blogs.nvidia.com/blog/ways-to-bring-agentic-ai-to-computer-vis...
See more stories from nvidia

North America Stories

10/03/2026

Harvey Arnold, Bert Goldman to Be Honored at the 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

10/03/2026

Senators Urge FCC to Preserve Citizens Broadband Radio Service

Share Copy link Facebook X Linkedin Bluesky Email...

10/03/2026

SCTE TechExpo26 Issues Call for Content, Technical Papers

Share Copy link Facebook X Linkedin Bluesky Email...

10/03/2026

Zefr Receives MRC Accreditation

Share Copy link Facebook X Linkedin Bluesky Email...

10/03/2026

Study: Overloaded Sports Fans Fed Up with Fragmented Viewing Options

Share Copy link Facebook X Linkedin Bluesky Email...

10/03/2026

NVIDIA and Thinking Machines Lab Announce Long-Term Gigawatt-Scale Strategic Partnership

NVIDIA and Thinking Machines Lab announced today a multiyear strategic partnersh...

09/03/2026

Foos Gone Wild, Combate Global Launch New Televised MMA Fight Series

Foos Gone Wild and Combate Global have teamed up to create a twist on combat sports competition, announcing the launch of a special amateur Mixed Martial Arts (...

09/03/2026

Harmonic Accelerates Streaming and Broadcast Transformations

At the 2026 NAB Show, Harmonic will introduce significant enhancements to its video appliances and SaaS solutions, highlighted by a next-generation media server...

09/03/2026

ESPN Delivers Most-Watched MLB Spring Training Game in 10 years with Team USA vs. San Francisco Giants

ESPN's March 3 spring training matchup between Team USA and the San Francisc...

09/03/2026

Most Valuable Promotions Launches Women's Boxing Platform, Signs Multi-Year Deal with ESPN

Most Valuable Promotions (MVP) announces the launch of MVPW, a new global platfo...

09/03/2026

Behind The Mic: CBS Sports and TNT Sports Share NCAA Division 1 Mens's Basketball Tournament Commentators

Behind The Mic provides a roundup of recent news regarding on-air talent, includ...

09/03/2026

SVG All-Stars: Jenna McKeon, Senior Director, Remote Technical Operations, CBS Sports

From Super Bowl compounds to Final Four setups, the Hofstra graduate helps coord...

09/03/2026

NBC's Paralympic Effort Embraces Cloud for Signal Transport

Stamford plays a key role, but a small team in Cortina and Milan powers local presence and mixed-zone coverage...

09/03/2026

Save the Date: SVG's New Cloud & Content Workflows Summit in NYC on July 28

The event brings together SVG's previous Cloud Production and Content Management Forums into a single, comprehensive day of programming...

09/03/2026

2,000-Year-Old Arena Hosts Paralympics Opening Ceremony, Olympics Closing Ceremony

Updated Mar 9, 2026 Live surround sound has been a part of the plan for Roman a...

09/03/2026

Judge Rules VOA's Kari Lake Has Acted Unlawfully'

Share Copy link Facebook X Linkedin Bluesky Email...

09/03/2026

Utah Scientific Adds Three Companies To Technology Partner Program

Share Copy link Facebook X Linkedin Bluesky Email...

09/03/2026

Broadpeak Showcases Premium Live Streaming Advanced Monet...

Broadpeak, a leader in streaming and monetization at scale, will showcase its latest innovations for broadcasters and streaming platforms at NAB Show 2026 (boot...

09/03/2026

'The Predator of Seville' premieres on Netflix on 27 March

Back to All News The Predator of Seville premieres on Netflix on 27 March Entertainment 09 March 2026 GlobalSpain Link copied to clipboard Download the im...

09/03/2026

Netflix Debuts the Trailer for 'Love is Blind: Sweden' Season 3

Back to All News Netflix Debuts the Trailer for Love is Blind: Sweden Season 3 Entertainment 09 March 2026 GlobalSweden Link copied to clipboard That wait...

09/03/2026

How AI Is Driving Revenue, Cutting Costs and Boosting Productivity for Every Industry in 2026

AI is everywhere and accelerating everything - becoming essential infrastructure...

09/03/2026

ABB Robotics Taps NVIDIA Omniverse to Deliver IndustrialGrade Physical AI at Scale

ABB Robotics and NVIDIA today announced a breakthrough partnership that brings i...

07/03/2026

NAB Show: Tedial to Showcase Solutions for Future of Media Operations

Share Copy link Facebook X Linkedin Bluesky Email...

06/03/2026

TNT Sports Acquires Exclusive U.S. English Language Broadcast Rights to FIBA Men's and Women's Tournaments

TNT Sports and the International Basketball Federation (FIBA) have reached a mul...

06/03/2026

OffBall to Partner with TOGETHXR Across Commercial Strategy and Operations

OffBall and TOGETHXR, two influential young media companies in sports, announce a strategic and operational partnership in a shared push to scale and create inn...

06/03/2026

InfoComm 2026 Names Shure as Exclusive Headline Partner, Showcasing Audio and Innovation Across Key Activations and Stages

InfoComm 2026, a destination for AV, IT, broadcast, and AI-driven systems, annou...

06/03/2026

LTN, MediaKind Partner to Deliver Integrated Reliable IP Transport and Edge Processing

LTN and MediaKind announce a strategic partnership to integrate MediaKind's ...

06/03/2026

X Games Brings First-Ever Summer Championship Event to New Orleans in July

X Games and the Greater New Orleans Sports Foundation (GNOSF) announce that New Orleans, Louisiana, will host the first-ever X Games Championship event - the fi...

06/03/2026

Chyron Releases New Edition of AXIS Maps

As part of a busy start of the year at Chyron, the AXIS team developed a set of improvements for AXIS Maps. The features released empower users with more flexib...

06/03/2026

BeckTV Launches BeckFlow at 2026 NAB Show While Highlighting its Latest Design and Integration Projects

At the 2026 NAB Show, BeckTV, a premier systems integrator for the broadcast ind...

06/03/2026

Net Insight Sets New Standard for Live Media Operations with Nimbra Live Intelligence

At NAB Show 2026, Net Insight introduces Nimbra Live Intelligence, which definin...

06/03/2026

SES Adds New MEO Capacity as Latest O3b mPOWER Satellites Enter Commercial Service

SES announces that it has added new Medium Earth Orbit (MEO) satellite capacity ...

06/03/2026

LucidLink Launches Connect to Extend Instant Access to Data Stores

LucidLink, the cloud-native file streaming platform for instant, secure access to large files, announce LucidLink Connect, a new solution that enables real-time...

06/03/2026

Case Study: Panasonic Projection Brings Paddington to Life in London's West End

Panasonic helped bring the world of Paddington: The Musical to life through imme...

06/03/2026

SVG GameDay, Ep. 6: Cincinnati Bengals' Alex Schweppe - Welcome to the Jungle

In-venue and creative video staffers at the professional and collegiate level ha...

06/03/2026

Ratings Roundup: Post Olympics High, NHL on ESPN Viewership Spikes Over 50%

Ratings Roundup is a rundown of recent rating news and is derived from press releases and reports around the industry. In this week's edition, NASCAR Cup Se...

06/03/2026

Riedel Communications Demos Product Innovations at 2026 NAB Show

At the 2026 NAB Show, Riedel Communications opens with a clear message to the North American market: production technology does not have to be complex. This ye...

06/03/2026

SVG Sit-Down: Midco Sports' Andy Price and Craig DeWit on How the Dakotas-Based RSN Is Redefining Regional Sports Media

Midco Sports isn't your typical regional sports network. Backed by nearly a ...

06/03/2026

Inside the Paralympics with OBS Producer Josephine Xiaofan

The stories around the Paralympics are really touching and the athletes, the atmosphere it is all amazing and we want to use new technologies to help us tell th...

06/03/2026

Apple TV Kicks Off F1 Era in U.S. With Driver Tracker, On-Board Cameras, Multiview, Sky Sports Feed

Apple TV subscribers will have access to as many as 30 additional live feeds acr...

06/03/2026

Nielsen: 46 Billion Minutes of Women's Sports were Consumed in 2025*

Nielsen Highlights Women's Sports Viewership Milestones Ahead of International Women's Day on March 8 New York March 5, 2026 According to Nielsen&#...

06/03/2026

MXL Moves Into Stable Production Release

Share Copy link Facebook X Linkedin Bluesky Email...

06/03/2026

LTN, MediaKind Partner on Industry Transition From C-Band to IP

Share Copy link Facebook X Linkedin Bluesky Email...

06/03/2026

Roku Unveils 'Roklue' Interactive Content Discovery Feature

Share Copy link Facebook X Linkedin Bluesky Email...

06/03/2026

FCC Releases Tentative Agenda for March Open Meeting

Share Copy link Facebook X Linkedin Bluesky Email...

06/03/2026

ESPN to Air Animated Version of Capitals vs. Rangers NHL Game

Share Copy link Facebook X Linkedin Bluesky Email...

06/03/2026

Tedial to Highlight AI-Fueled Media Lifecycle at 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

06/03/2026

esRadio Advances with DHD RX2 Audio Production Consoles

Spanish FM and online broadcaster esRadio has selected DHD RX2 audio production consoles for use at its studio headquarters in Madrid. Part of the Libertad Digi...

06/03/2026

LucidLink launches Connect to extend instant access to da...

LucidLink, the cloud-native file streaming platform for instant, secure access to large files, today announced LucidLink Connect, a new solution that enables re...

06/03/2026

BeckTV Launches BeckFlow at 2026 NAB Show While Highlight...

At the 2026 NAB Show, BeckTV, a premier systems integrator for the broadcast industry, is launching BeckFlow, the first web-based schematic documentation platfo...