Sony Pixel Power calrec Sony

AI On: 3 Ways to Bring Agentic AI to Computer Vision Applications

13/11/2025

Editor's note: This post is part of the AI On blog series, which explores the latest techniques and real-world applications of agentic AI, chatbots and copilots. The series also highlights the NVIDIA software and hardware powering advanced AI agents, which form the foundation of AI query engines that gather insights and perform tasks to transform everyday experiences and reshape industries.

Today's computer vision systems excel at identifying what happens in physical spaces and processes, but lack the abilities to explain the details of a scene and why they matter, as well as reason about what might happen next.

Agentic intelligence powered by vision language models (VLMs) can help bridge this gap, giving teams quick, easy access to key insights and analyses that connect text descriptors with spatial-temporal information and billions of visual data points captured by their systems every day.

Three approaches organizations can use to boost their legacy computer vision systems with agentic intelligence are to:

Apply dense captioning for searchable visual content.

Augment system alerts with detailed context.

Use AI reasoning to summarize information from complex scenarios and answer questions.

Making Visual Content Searchable With Dense Captions Traditional convolutional neural network (CNN)-powered video search tools are constrained by limited training, context and semantics, making gleaning insights manual, tedious and time-consuming. CNNs are tuned to perform specific visual tasks, like spotting an anomaly, and lack the multimodal ability to translate what they see into text.

Businesses can embed VLMs directly into their existing applications to generate highly detailed captions of images and videos. These captions turn unstructured content into rich, searchable metadata, enabling visual search that's far more flexible - not constrained by file names or basic tags.

For example, automated vehicle-inspection system UVeye processes over 700 million high-resolution images each month to build one of the world's largest vehicle and component datasets. By applying VLMs, UVeye converts this visual data into structured condition reports, detecting subtle defects, modifications or foreign objects with exceptional accuracy and reliability for search.

VLM-powered visual understanding adds essential context, ensuring transparent, consistent insights for compliance, safety and quality control. UVeye detects 96% of defects compared with 24% using manual methods, enabling early intervention to reduce downtime and control maintenance costs.

https://blogs.nvidia.com/wp-content/uploads/2025/11/UVeye-video-1.mp4

Relo Metrics, a provider of AI-powered sports marketing measurement, helps brands quantify the value of their media investments and optimize their spending. By combining VLMs with computer vision, Relo Metrics moves beyond basic logo detection to capture context - like a courtside banner shown during a game-winning shot - and translate it into real-time monetary value.

This contextual-insight capability highlights when and how logos appear, especially in high-impact moments, giving marketers a clearer view of return on investment and ways to optimize strategy. For example, Stanley Black & Decker, including its Dewalt brand, previously relied on end-of-season reports to evaluate sponsor asset performance, limiting timely decision-making. Using Relo Metrics for real-time insights, Stanley Black & Decker adjusted signage positioning and saved $1.3 million in potentially lost sponsor media value.

Augmenting Computer Vision System Alerts With VLM Reasoning CNN-based computer vision systems often generate binary detection alerts such as yes or no, and true or false. Without the reasoning power of VLMs, that can mean false positives and missed details - leading to costly mistakes in safety and security, as well as lost business intelligence.Rather than replacing these CNN-based computer vision systems entirely, VLMs can easily augment these systems as an intelligent add-on. With a VLM layered on top of CNN-based computer vision systems, detection alerts are not only flagged but reviewed with contextual understanding - explaining where, how and why the incident occurred.

For smarter city traffic management, Linker Vision uses VLMs to verify critical city alerts, such as traffic accidents, flooding, or falling poles and trees from storms. This reduces false positives and adds vital context to each event to improve real-time municipal response.

https://blogs.nvidia.com/wp-content/uploads/2025/11/Updated-VLM-1-1.mp4

Linker Vision's architecture for agentic AI involves automating event analysis from over 50,000 diverse smart city camera streams to enable cross-department remediation - coordinating actions across teams like traffic control, utilities and first responders when incidents occur. The ability to query across all camera streams simultaneously enables systems to quickly and automatically turn observations into insights and trigger recommendations for next best actions.

Automatic Analysis of Complex Scenarios With Agentic AI Agentic AI systems can process, reason and answer complex queries across video streams and modalities - such as audio, text, video and sensor data. This is possible by combining VLMs with reasoning models, large language models (LLMs), retrieval-augmented generation (RAG), computer vision and speech transcription.

Basic integration of a VLM into an existing computer vision pipeline is helpful in verifying short video clips of key moments. However this approach is limited by how many visual tokens a single model can process at once, resulting in surface-level answers without context over longer time periods and external knowledge.

In contrast, whole architectures built on agentic AI enable scalable, accurate processing of lengthy and multichanne
LINK: https://blogs.nvidia.com/blog/ways-to-bring-agentic-ai-to-computer-vis...
See more stories from nvidia

North America Stories

10/12/2025

Sound-Alike Commercials Are Part of Sports' Soundtrack

Sound-Alike Commercials Are Part of Sports' Soundtrack Johnny Cash for Coca-Cola is the latest in a long litany of sonic approximationsBy Dan Daley, Audio ...

10/12/2025

Immersive Sound Is Logical Next Step for Sports Venues

Immersive Sound Is Logical Next Step for Sports VenuesSound-systems suppliers are sanguine, but the market has its challengesBy Dan Daley, Audio Editor Wednes...

10/12/2025

The Romans Built Arenas for Immersive Sound 2,000 Years Ago

The Romans Built Arenas for Immersive Sound 2,000 Years AgoThe historic Arena of Nimes in France is still in use todayBy Dan Daley, Audio Editor Wednesday, De...

10/12/2025

SVG Summit 2025 Preview: Audio Workshop Hits on Immersive, Virtualized, and Next-Gen Streaming Workflows

SVG Summit 2025 Preview: Audio Workshop Hits on Immersive, Virtualized, and Next...

10/12/2025

SVG Summit 2025 Technology Exhibits Preview: Audio Spotlight

SVG Summit 2025 Technology Exhibits Preview: Audio SpotlightBy SVG Staff Wednesday, December 10, 2025 - 8:21 am Print This Story | Subscribe Story Highlig...

10/12/2025

SVG Europe Audio: Listening to the Sounds of Powder and Ice at Milano Cortina with a Behind the Scenes Tour of OBS and NBC's Audio Set Ups

SVG Europe Audio: Listening to the sounds of powder and ice at Milano Cortina wi...

10/12/2025

Advancements in Audio Technology: Capturing the Atmosphere of Live Sports

Advancements in audio technology: Capturing the atmosphere of live sports By David Davies Tuesday, November 25, 2025 - 09:27 Print This Story Although wor...

10/12/2025

Everything Smelled of Popcorn: The Art of Bringing the Complex Sound of Esports to Fans With Sound Supervisor Matt Gilbert

Everything smelled of popcorn: The art of bringing the complex sound of esports ...

10/12/2025

2026 Sundance Film Festival Unveils 97 Projects Selected for the Feature Film and Episodic Program

Top L-R: Ha-Chan, Shake Your Booty!, Hanging by a Wire, Broken English, Buddy C...

10/12/2025

L3Harris to Produce Additional Solid Rocket Motors for Precision-Guided Artillery System

L3Harris' new contract for Guided Multiple Launch Rocket System Insensitive ...

10/12/2025

US Space Force Expands Offensive Space Programs Through L3Harris Foreign Sales

L3Harris Meadowlands system has been designed with an open architecture software system that allows for more flexible and efficient software updates. This capab...

10/12/2025

Football Shifts TV Viewing Towards Ad Supported, Nielsen's Q3 2025 Ad Supported Gauge Finds

During this interval, streaming comprised the majority of ad supported TV (46.4%...

10/12/2025

Bitcentral Names Venture Capital Exec Rick Arnold to Board

NEWPORT BEACH, Calif. Bitcentral, a provider of production, asset management, playout and streaming workflow solutions, has named technology veteran Rick Arnold...

10/12/2025

TV Tech Announces Winners of 2025 Best in Market Awards for M&E Tech

TV Tech is delighted to reveal the winners of the 2025 Media & Entertainment: Best in Market Awards....

10/12/2025

AIMS, VSF, AMWA, EBU To Hold Inaugural IPMX Testing, Certification Event

BOTHELL, Wash. The Alliance for IP Media Solutions (AIMS), the Video Services Forum (VSF), the Advanced Media Workflow Association (AMWA) and the European Broad...

10/12/2025

DirecTV Launches Peacock Games

In a notable example of how pay TV operators are integrating streaming services into their lineup and using those services to retain or attract subscribers, Dir...

10/12/2025

Chaos Brings Real-Time Rendering to Maya and Houdini

Today, Chaos builds instant feedback into the viewport, connecting Maya and Houdini to Chaos Vantage's real-time path tracer. Artists can now assess 3D asse...

10/12/2025

Smeup doubles capacity with Cubbit under a new agreement...

Smeup, a key partner for companies engaged in digital transformation, today announced the expansion of its adoption of Cubbit, the first geo-distributed cloud s...

10/12/2025

Mediagenix Strengthens Its Security Posture with ISO 2700...

Mediagenix, a global leader in smart content solutions to profitably connect the right content to the right audience, today announced two significant milestones...

10/12/2025

HDR10+ Technologies Unveils HDR10+ ADVANCED Dynamic Metadata Technology

BEAVERTON, Ore. HDR10+ Technologies, LLC has announced that they will soon begin the licensing and certification of devices, content, and services that support ...

10/12/2025

SMPTE, EBU, ETC Publish Report on AI's Impact on the Media

SMPTE has joined forces with the European Broadcasting Union (EBU) and Entertainment Technology Center (ETC) to publish an updated report on AI and its impact o...

10/12/2025

Clear-Com Appoints Kris Koch as New Director of Sales - N...

Clear-Com is pleased to announce the appointment of Kris Koch as Director of Sales - North & South America. In this expanded leadership role, Kris will oversee...

10/12/2025

Mavis Camera Launches Film Kit Unlocking LUT Workflows an...

Mavis today announced the latest version of Mavis Camera (v7.4), a major update to its professional iOS camera app, headlined by the launch of Film Kit - an opt...

10/12/2025

Creamsource Taps Industry Heavyweight Markus Zeiler as Gr...

Creamsource, renowned for its Vortex series of cinematic lighting, is laying the groundwork for its next phase of growth with the addition of Markus Zeiler as G...

10/12/2025

Digital Alert Systems Introduces DAS3-DC-PS DASDEC-III DC...

Digital Alert Systems, a global leader in emergency communications solutions for media providers, today announced that the DAS3-DC-PS, a new DC power supply opt...

10/12/2025

Riedel and Racing Electronics Announce Strategic Partners...

Riedel Communications today announced it has formed a strategic partnership with Racing Electronics, a premier provider of motorsport communication equipment in...

10/12/2025

GALSNGEAR Announces 2026 Leadership Retreats on East and...

#GALSNGEAR is launching two signature leadership retreats in early 2026, designed to equip women in media, entertainment, and technology with the tools to lead...

10/12/2025

CVP Launches Global Price Guarantee for Seamless Internat...

Providing worldwide customers with total confidence through transparent, all-inclusive pricing CVP, one of Europe's leading suppliers of professional video...

10/12/2025

Securing the Future of Broadcast TV in the U.S.

With the Federal Communications Commission working on new rules for the deployment of NextGen TV, next year promises to be an important one for both the future ...

10/12/2025

Former Charter CEO Tom Rutledge to Receive Cable Centers Bresnan Award

DENVER Tom Rutledge, director emeritus and former president and CEO of Charter Communications, will be honored with the 2026 Bresnan Ethics in Business Award by...

10/12/2025

Cadent Acquires YouTube Measurement Firm VuePlanner

NEW YORK Novocap's Cadent has acquired VuePlanner, a YouTube video ad planning, optimization, and measurement company in a deal that will help Cadent expand...

10/12/2025

Avoid Playlist Conflicts: Scheduling Back-to-Back Special Playlists

In preparation for the madness of March, here are some important reminders for scheduling back-to-back Special Playlists. The first Special Playlist MUST end b...

10/12/2025

Tribeca Films to Release the Independent Documentary Film Beam Me Up, Sulu by Timour Gregory and Sasha Schneider

December 10th, 2025 TRIBECA FILMS TO RELEASE THE INDEPENDENT DOCUMENTARY FILM...

10/12/2025

2026 Starts With a Swoon: Kim Seon-ho and Go Youn-jung Lead Can This Love Be Translated?', Premiering January 16

Back to All News 2026 Starts With a Swoon: Kim Seon-ho and Go Youn-jung Lead C...

10/12/2025

'Berlin and the Lady with an Ermine' Arrives to Netflix on May 15

Back to All News Berlin and the Lady with an Ermine Arrives to Netflix on May 15 Entertainment 10 December 2025 GlobalSpain Link copied to clipboard THE N...

09/12/2025

2025 Sports Broadcasting Hall of Fame: Pam Oliver, Sideline Icon Who Redefined the Role

2025 Sports Broadcasting Hall of Fame: Pam Oliver, Sideline Icon Who Redefined t...

09/12/2025

SVG Summit 2025 Technology Exhibits Preview, Part 2

SVG Summit 2025 Technology Exhibits Preview, Part 2By Jason Dachman, Editorial Director, U.S. Tuesday, December 9, 2025 - 7:17 am Print This Story | Subscr...

09/12/2025

SVG Summit 2025 Preview: Cloud Production Workshop Spotlights Live and Non-Live Workflows in the Cloud

SVG Summit 2025 Preview: Cloud Production Workshop Spotlights Live and Non-Live ...

09/12/2025

Next-Generation Content Protection: Multi-Technology Security is Integral to Combating New Threats

Next-generation content protection: Multi-technology security is integral to com...

09/12/2025

CBS Sports Provides One-of-a-Kind Production' for UEFA Champions League Crossover Event

CBS Sports Provides One-of-a-Kind Production' for UEFA Champions League Cro...

09/12/2025

Spanish Professional Basketball League Relies on NETGEAR AV, MAM Tech for Seamless Production

Spanish Professional Basketball League Relies on NETGEAR AV, MAM Tech for Seamle...

09/12/2025

SVG Sit-Down: St. Thomas's Mike Gallagher and Casey Eakins on the Tommies' Bold Leap to Division I and How Video Plays a Key Role

SVG Sit-Down: St. Thomas's Mike Gallagher and Casey Eakins on the Tommies...

09/12/2025

Free Registration for the 2025 SVG Summit Closes Today at 5 p.m. ET!

Free Registration for the 2025 SVG Summit Closes Today at 5 p.m. ET!After the deadline, tickets will cost $150 to attend the eventBy SVG Staff Tuesday, Decemb...

09/12/2025

University of St. Thomas Ushers in Division I Era With New Arena and a Broadcast Operation Built for the Big Time

University of St. Thomas Ushers in Division I Era With New Arena and a Broadcast...

09/12/2025

Leading the Charge: L3Harris' Advanced EW Technologies for Superior Battlefield Advantage

For decades, customers have turned to L3Harris capabilities in electronic warfar...

09/12/2025

L3Harris Propulsion Solutions: Securing the High Ground in Space

L3Harris propulsion systems enable agile maneuvering and resilience for U.S. spacecraft, supporting national security and mission success in the evolving space ...

09/12/2025

L3Harris Successfully Completes Critical Design Review of Key Components for Japan's New Geostationary Meteorological Satellite

L3Harris Technologies' imaging and sounding instruments will play a critical...

09/12/2025

James Shears Joins ThinkAnalytics as Senior VP, Advertising

LOS ANGELES James Shears has joined ThinkAnalytics as senior vice president, advertising, tasked with leading global strategy and commercial expansion of the co...

09/12/2025

Kathleen Kirby, Heidi Raphael Join Media Institute Board

VIENNA, Va. The Media Institute, a nonprofit, nonpartisan organization specializing in communications policy and the First Amendment, has named Kathleen Kirby o...

09/12/2025

Hollywood's Ecosystem for Combating Piracy

Throughout its century-plus existence, the motion picture industry has had to fight battles on multiple fronts to protect its content from piracy. From protecti...