
Editor's note: This post is part of the AI On blog series, which explores the latest techniques and real-world applications of agentic AI, chatbots and copilots. The series also highlights the NVIDIA software and hardware powering advanced AI agents, which form the foundation of AI query engines that gather insights and perform tasks to transform everyday experiences and reshape industries.
Today's computer vision systems excel at identifying what happens in physical spaces and processes, but lack the abilities to explain the details of a scene and why they matter, as well as reason about what might happen next.
Agentic intelligence powered by vision language models (VLMs) can help bridge this gap, giving teams quick, easy access to key insights and analyses that connect text descriptors with spatial-temporal information and billions of visual data points captured by their systems every day.
Three approaches organizations can use to boost their legacy computer vision systems with agentic intelligence are to:
Apply dense captioning for searchable visual content.
Augment system alerts with detailed context.
Use AI reasoning to summarize information from complex scenarios and answer questions.
Making Visual Content Searchable With Dense Captions Traditional convolutional neural network (CNN)-powered video search tools are constrained by limited training, context and semantics, making gleaning insights manual, tedious and time-consuming. CNNs are tuned to perform specific visual tasks, like spotting an anomaly, and lack the multimodal ability to translate what they see into text.
Businesses can embed VLMs directly into their existing applications to generate highly detailed captions of images and videos. These captions turn unstructured content into rich, searchable metadata, enabling visual search that's far more flexible - not constrained by file names or basic tags.
For example, automated vehicle-inspection system UVeye processes over 700 million high-resolution images each month to build one of the world's largest vehicle and component datasets. By applying VLMs, UVeye converts this visual data into structured condition reports, detecting subtle defects, modifications or foreign objects with exceptional accuracy and reliability for search.
VLM-powered visual understanding adds essential context, ensuring transparent, consistent insights for compliance, safety and quality control. UVeye detects 96% of defects compared with 24% using manual methods, enabling early intervention to reduce downtime and control maintenance costs.
https://blogs.nvidia.com/wp-content/uploads/2025/11/UVeye-video-1.mp4
Relo Metrics, a provider of AI-powered sports marketing measurement, helps brands quantify the value of their media investments and optimize their spending. By combining VLMs with computer vision, Relo Metrics moves beyond basic logo detection to capture context - like a courtside banner shown during a game-winning shot - and translate it into real-time monetary value.
This contextual-insight capability highlights when and how logos appear, especially in high-impact moments, giving marketers a clearer view of return on investment and ways to optimize strategy. For example, Stanley Black & Decker, including its Dewalt brand, previously relied on end-of-season reports to evaluate sponsor asset performance, limiting timely decision-making. Using Relo Metrics for real-time insights, Stanley Black & Decker adjusted signage positioning and saved $1.3 million in potentially lost sponsor media value.
Augmenting Computer Vision System Alerts With VLM Reasoning CNN-based computer vision systems often generate binary detection alerts such as yes or no, and true or false. Without the reasoning power of VLMs, that can mean false positives and missed details - leading to costly mistakes in safety and security, as well as lost business intelligence.Rather than replacing these CNN-based computer vision systems entirely, VLMs can easily augment these systems as an intelligent add-on. With a VLM layered on top of CNN-based computer vision systems, detection alerts are not only flagged but reviewed with contextual understanding - explaining where, how and why the incident occurred.
For smarter city traffic management, Linker Vision uses VLMs to verify critical city alerts, such as traffic accidents, flooding, or falling poles and trees from storms. This reduces false positives and adds vital context to each event to improve real-time municipal response.
https://blogs.nvidia.com/wp-content/uploads/2025/11/Updated-VLM-1-1.mp4
Linker Vision's architecture for agentic AI involves automating event analysis from over 50,000 diverse smart city camera streams to enable cross-department remediation - coordinating actions across teams like traffic control, utilities and first responders when incidents occur. The ability to query across all camera streams simultaneously enables systems to quickly and automatically turn observations into insights and trigger recommendations for next best actions.
Automatic Analysis of Complex Scenarios With Agentic AI Agentic AI systems can process, reason and answer complex queries across video streams and modalities - such as audio, text, video and sensor data. This is possible by combining VLMs with reasoning models, large language models (LLMs), retrieval-augmented generation (RAG), computer vision and speech transcription.
Basic integration of a VLM into an existing computer vision pipeline is helpful in verifying short video clips of key moments. However this approach is limited by how many visual tokens a single model can process at once, resulting in surface-level answers without context over longer time periods and external knowledge.
In contrast, whole architectures built on agentic AI enable scalable, accurate processing of lengthy and multichanne
Most recent headlines
11/12/2025
Dalet, a leading provider of cloud-native, end-to-end media workflow solutions, ...
05/12/2025
2025 Sports Broadcasting Hall of Fame: Curt Gowdy Jr. - Master Storyteller, Nati...
05/12/2025
SVG Sit-Down: Veritone's Sean King on the Power of Mining Video, Audio DataThe company's Data Refinery offers users total control and governance over da...
05/12/2025
Platinum White Paper: Inside the Nashville Predators' Unified, Flexible, Sca...
05/12/2025
Netflix Reaches Agreement To Acquire Warner Bros. Following Planned WBD SplitThe deal does not include WBDs sports assets like TNT Sports (US, UK, LatAm), Euros...
05/12/2025
FOX Sports Returns to Indianapolis for Primetime Broadcast of Big Ten Championsh...
05/12/2025
SVG Summit 2025 Preview: Digital Engagement & Monetization Workshop Tackles the ...
05/12/2025
Atlanta United Lights Up New Emory Healthcare Studio With First Live Broadcast f...
05/12/2025
As Messi Takes the Pitch, MLS, Apple, NEP Roll Out Largest MLS Cup Production Ev...
05/12/2025
ESPN Enters College Football's Most Intense Month With Elevated Workflows fo...
05/12/2025
It's about that time! Awards season is in full swing, and the Film Independe...
05/12/2025
Every year, Spotify Wrapped offers a personalized look back at the audio that defined your year. It's a snapshot of your listening habits, designed to tell ...
05/12/2025
In 2025, Spotify's EQUAL, GLOW, and RADAR programs celebrated women, LGBTQIA , and emerging artists who turned moments into milestones. From breaking record...
05/12/2025
In our latest blog, we explain how Wi-Fi 7 rollouts can drive consumer loyalty with value-add services such as consumer cybersecurity. We also explore how this ...
05/12/2025
LOS ANGELES Netflix announced it has entered into an agreement to acquire the assets of Warner Bros. for $82.7 billion....
05/12/2025
NEW YORK Nielsens Gracenote has launched Gracenote Content Connect, a new ad platform that provides agencies, brands, supply-side platforms (SSPs) and demand-si...
05/12/2025
NEW YORK In an most important update to the workings of deal-based programmatic advertising, IAB Tech Lab has released version 1.0 of its Deals API for public c...
05/12/2025
NEW YORK Pass the turkey. Pass the stuffing. Pass the cranberry sauce. All are common requests of Americans celebrating Thanksgiving Day with family and f...
05/12/2025
NEW YORK Iris, the new cloud-connected camera control platform, has officially launched with features that turn virtually any PTZ camera into a software-connect...
05/12/2025
HOLLYWOOD, Calif. Netflix announced today that it has entered into an agreement to acquire the assets of Warner Bros. for $82.7 billion....
05/12/2025
NEW YORK Iris, the new cloud-connected camera control platform, has officially launched with features that turn virtually any PTZ camera into a software-connect...
05/12/2025
WASHINGTON The Federal Communications Commission has approved AT&T's $1.02 billion acquisition of spectrum from UScellular in a decision that was issued sho...
05/12/2025
The Best Coldplay Songs: 21 Tracks That Shoot for the Stars From Yellow to Viva La Vida, Fix You to Paradise, this playlist goes back to the start.
December ...
05/12/2025
Zafris Lecture Series Brings Nabil Ayers to Berklee The 32nd annual James G. Zafris Distinguished Lecture series was held on Thursday, November 13 with guest ...
05/12/2025
Friday 5 December 2025
A new Game of Thrones Tale: Official trailer for Sky Exc...
05/12/2025
Back to All News
Don Lee, Lee Jin-uk, and Lalisa Manobal to Star in Netflix Act...
05/12/2025
Tis the season of giving once again and this year we've taken our Give Back Fridays' concept and turned it on its head.
In the autumn we were approach...
05/12/2025
Brayden Gogis doesn't remember a time when he wasn't completely fixated on games in all forms. In preschool, when they asked us to dress up as what we ...
05/12/2025
The Grinch steals the spotlight as the theme for The Late Late Toy Show 2025
Tune in tonight at 9:35pm on RT One and worldwide on RT Player
#LateLateToyShow...
05/12/2025
RT Announces New Presenters of Flagship News Programmes
New RT Six One News co-presenter Tommy Meskill
Sarah McInerney & Justin McCarthy join Morning Ir...
04/12/2025
ToolsOnAir Blackmagic Design HyperDeck Event Presets for just:in mac pro 2025 & ...
04/12/2025
ToolsOnAir AJA Ki Pro Event Presets for just:in mac pro 2025 & just:in linux
More Details:Starting with version 5.5, both just:in mac pro and just:in linux sol...
04/12/2025
Wangu Kanuri from Kenya and Godwin Asediba from Ghana are two of this years finalists for Thomsons Young Journalist of the Year Award. The pair are runners-up i...
04/12/2025
SVG Sit-Down: ProximaVision's Claudio Lisman on Why Tethered Drones Could Be...
04/12/2025
SVG Campus Shot Callers: Imry Halevi, Senior Associate Director of Athletics, Co...
04/12/2025
Platinum White Paper: LiveU Lightweight Sports Production: A Step Change in Spor...
04/12/2025
London to Riyadh: DAZN brings the boxing glamour to new production levels for Be...
04/12/2025
Analysis: Paramount bets on the battering ram' with Champions League play By Callum McCarthy, Editor-at-Large
Tuesday, December 2, 2025 - 10:12
Print ...
04/12/2025
Space City Home Network Launches SCHN DTC App for Astros and RocketsThe Rockets and Astros were previously the lone NBA and MLB teams without a DTC appBy Jason...
04/12/2025
SVG Summit 2025 Preview: Content Workflows Workshop Spotlights Evolution of Spor...
04/12/2025
New Sponsor Spotlight: Geotech's Patrick Wambold On the Unreal Engine Revolu...
04/12/2025
Curt Gowdy Jr. - Master Storyteller, Nationally and RegionallyBy Jason Dachman, Editorial Director, U.S.
Thursday, December 4, 2025 - 1:52 pm
Print This Sto...
04/12/2025
(L-R) Rebecca Lichtenfeld, Mohammadreza Eyni, Sara Khaki, and Judith Helfand att...
04/12/2025
SBS launches Future Frames initiative to support emerging First Nations video ed...
04/12/2025
Coronal mass ejections caused by eruptions on the surface of the sun can have fa...
04/12/2025
Gracenote Content Connect enables media ecosystem to precisely align ad campaigns and programming based on rich content signals
NEW YORK - December 4, 2025 - N...
04/12/2025
Lightware, a global specialist in AV connectivity, is looking back on a year defined by new advancements, strong collaboration and continued growth. Across the ...
04/12/2025
Riedel Communications today announced a new partnership with Haivision, a leading global provider of mission-critical, real-time video networking and visual col...
04/12/2025
Harmonic (NASDAQ: HLIT) and Normann Engineering today announced a major milestone in their strategic collaboration, celebrating 20 successful broadband deployme...