Sony Pixel Power calrec Sony

Give AI a Look: Any Industry Can Now Search and Summarize Vast Volumes of Visual Data

04/11/2024

Enterprises and public sector organizations around the world are developing AI agents to boost the capabilities of workforces that rely on visual information from a growing number of devices - including cameras, IoT sensors and vehicles.

To support their work, a new NVIDIA AI Blueprint for video search and summarization will enable developers in virtually any industry to build visual AI agents that analyze video and image content. These agents can answer user questions, generate summaries and enable alerts for specific scenarios.

Part of NVIDIA Metropolis, a set of developer tools for building vision AI applications, the blueprint is a customizable workflow that combines NVIDIA computer vision and generative AI technologies.

Global systems integrators and technology solutions providers including Accenture, Dell Technologies and Lenovo are bringing the NVIDIA AI Blueprint for visual search and summarization to businesses and cities worldwide, jump-starting the next wave of AI applications that can be deployed to boost productivity and safety in factories, warehouses, shops, airports, traffic intersections and more.

Announced ahead of the Smart City Expo World Congress, the NVIDIA AI Blueprint gives visual computing developers a full suite of optimized software for building and deploying generative AI-powered agents that can ingest and understand massive volumes of live video streams or data archives.

Users can customize these visual AI agents with natural language prompts instead of rigid software code, lowering the barrier to deploying virtual assistants across industries and smart city applications.

NVIDIA AI Blueprint Harnesses Vision Language Models Visual AI agents are powered by vision language models (VLMs), a class of generative AI models that combine computer vision and language understanding to interpret the physical world and perform reasoning tasks.

The NVIDIA AI Blueprint for video search and summarization can be configured with NVIDIA NIM microservices for VLMs like NVIDIA VILA, LLMs like Meta's Llama 3.1 405B and AI models for GPU-accelerated question answering and context-aware retrieval-augmented generation. Developers can easily swap in other VLMs, LLMs and graph databases and fine-tune them using the NVIDIA NeMo platform for their unique environments and use cases.

Adopting the NVIDIA AI Blueprint could save developers months of effort on investigating and optimizing generative AI models for smart city applications. Deployed on NVIDIA GPUs at the edge, on premises or in the cloud, it can vastly accelerate the process of combing through video archives to identify key moments.

In a warehouse environment, an AI agent built with this workflow could alert workers if safety protocols are breached. At busy intersections, an AI agent could identify traffic collisions and generate reports to aid emergency response efforts. And in the field of public infrastructure, maintenance workers could ask AI agents to review aerial footage and identify degrading roads, train tracks or bridges to support proactive maintenance.

Beyond smart spaces, visual AI agents could also be used to summarize videos for people with impaired vision, automatically generate recaps of sporting events and help label massive visual datasets to train other AI models.

The video search and summarization workflow joins a collection of NVIDIA AI Blueprints that make it easy to create AI-powered digital avatars, build virtual assistants for personalized customer service and extract enterprise insights from PDF data.

NVIDIA AI Blueprints are free for developers to experience and download, and can be deployed in production across accelerated data centers and clouds with NVIDIA AI Enterprise, an end-to-end software platform that accelerates data science pipelines and streamlines generative AI development and deployment.

AI Agents to Deliver Insights From Warehouses to World Capitals Enterprise and public sector customers can also harness the full collection of NVIDIA AI Blueprints with the help of NVIDIA's partner ecosystem.

Global professional services company Accenture has integrated NVIDIA AI Blueprints into its Accenture AI Refinery, which is built on NVIDIA AI Foundry and enables customers to develop custom AI models trained on enterprise data.

Global systems integrators in Southeast Asia - including ITMAX in Malaysia and FPT in Vietnam - are building AI agents based on the video search and summarization NVIDIA AI Blueprint for smart city and intelligent transportation applications.

Developers can also build and deploy NVIDIA AI Blueprints on NVIDIA AI platforms with compute, networking and software provided by global server manufacturers.

Dell will use VLM and agent approaches with Dell's NativeEdge platform to enhance existing edge AI applications and create new edge AI-enabled capabilities. Dell Reference Designs for the Dell AI Factory with NVIDIA and the NVIDIA AI Blueprint for video search and summarization will support VLM capabilities in dedicated AI workflows for data center, edge and on-premises multimodal enterprise use cases.

NVIDIA AI Blueprints are also incorporated in Lenovo Hybrid AI solutions powered by NVIDIA.

Companies like K2K, a smart city application provider in the NVIDIA Metropolis ecosystem, will use the new NVIDIA AI Blueprint to build AI agents that analyze live traffic cameras in real time. This will enable city officials to ask questions about street activity and receive recommendations on ways to improve operations. The company also is working with city traffic managers in Palermo, Italy, to deploy visual AI agents using NIM microservices and NVIDIA AI Blueprints.

Discover more about the NVIDIA AI Blueprint for video search and summarization by visiting the NVIDIA booth at the Smart Cities Expo World Congress, taking place in Barcelona through Nov. 7.

Le
LINK: https://blogs.nvidia.com/blog/video-search-summarization-ai-agents/...
See more stories from nvidia

North America Stories

18/03/2026

Neutrik To Showcase opticalCON ADVANCED Connectors At 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

18/03/2026

SMPTE Details 2026 NAB Show Educational Sessions

Share Copy link Facebook X Linkedin Bluesky Email...

18/03/2026

Ben Bradshaw Joins PSSI as Director, Product and Network Development

Share Copy link Facebook X Linkedin Bluesky Email...

18/03/2026

Peter Thordarson Joins ASG as Technical Account Executive

Share Copy link Facebook X Linkedin Bluesky Email...

18/03/2026

Survey: Voters Trust TV News Over AI, Social and Search

Share Copy link Facebook X Linkedin Bluesky Email...

18/03/2026

2026 NAB Show Exhibitor Insight: Amazon Web Services (AWS)

Share Copy link Facebook X Linkedin Bluesky Email...

18/03/2026

SMPTE Unveils 2026 NAB Show Educational Presentations

SMPTE , the home of media professionals, technologists, and engineers, today unveiled its educational presentations for the 2026 NAB Show. This year SMPTE will ...

18/03/2026

Maxon Marks Its Official Entry Into the AEC Market With I...

Maxon, maker of powerful, approachable software solutions for creators working in 2D and 3D design, motion graphics, visual effects, gaming, and more, today ann...

18/03/2026

Digital Alert Systems NAB Preview 2026

Digital Alert Systems Preview 2026 NAB Show April 19 - 22 Booth C3452 At the 2026 NAB Show, Digital Alert Systems will showcase Version 6.0 of its DASDEC ...

18/03/2026

Setplex Transforms Video Streaming with AI and Super Aggr...

Setplex today announced that it will showcase its complete, fully integrated Zapflex platform for the first time at the 2026 NAB Show, introducing powerful new ...

18/03/2026

COW Jobs: Seeking DP for Low Budget Dramedy - Chicago

COW Jobs: Seeking DP for Low Budget Dramedy - Chicago Brie Clayton March 17, 2026 0 Comments Seeking Director of Photography for Low Budget Dramedy Fe...

18/03/2026

COW Jobs: Seeking Gaffer for Low Budget Dramedy - Chicago

COW Jobs: Seeking Gaffer for Low Budget Dramedy - Chicago Brie Clayton March 17, 2026 0 Comments Seeking Gaffer for Low Budget Dramedy Feature Film- I...

18/03/2026

COW Jobs: Seeking Location, Sound for Low Budget Dramedy - Chicago

COW Jobs: Seeking Location, Sound for Low Budget Dramedy - Chicago Brie Clayton March 17, 2026 0 Comments Seeking Location/Sound for Low Budget Dramed...

18/03/2026

COW Jobs: Seeking Child Wrangler for Low Budget Film - Chicago

COW Jobs: Seeking Child Wrangler for Low Budget Film - Chicago Brie Clayton March 17, 2026 0 Comments Seeking Child Wrangler for Low Budget Dramedy Fe...

18/03/2026

Calrec Redefines Broadcast Workflows at NAB 2026 with its Most Powerful Hardware, Virtual and Hybrid Audio Lineup Yet

Calrec Redefines Broadcast Workflows at NAB 2026 with its Most Powerful Hardware...

18/03/2026

Oscar Nominated Two People Exchanging Saliva Posted with DaVinci Resolve Studio

Oscar Nominated Two People Exchanging Saliva Posted with DaVinci Resolve Studio Brie Clayton March 17, 2026 0 Comments DaVinci Resolve Studio handle...

18/03/2026

Boston Conservatory Presents Celebrated Musical Satire Urinetown

Boston Conservatory Presents Celebrated Musical Satire Urinetown Performances for this Center Stage production will take place at Boston Conservatory Theater ...

18/03/2026

Charlie Puth Joins Switched On Pop at Berklee NYC

Charlie Puth Joins Switched On Pop at Berklee NYC The Berklee alum spoke with host and Berklee NYC professor Charlie Harding for a live taping, answering audi...

17/03/2026

NASA+ Prepares To Live Stream Historic Artemis II Mission, Bringing Deep-Space Exploration to Global Audiences

NASA+'s Rebecca Sirmons and Brittany Brown offer unique look at live streami...

17/03/2026

BBright's TTML & SMPTE ST 2110-43: One Single Stream For the Whole World

The transition to IP has fundamentally reshaped professional media infrastructures. Video, audio, and increasingly metadata now circulate as independent, precis...

17/03/2026

Op-Ed: How Generative AI Is Transforming Live Sports Streaming Optimization

Live sports streaming can push every element in your video delivery chain to its limit, exposing every potential weakness in seconds. When the Super Bowl, the O...

17/03/2026

Dell Case Study: Powering the Future of Sports Media One Experience at a Time at UT Austin

Texas Athletics sought to modernize its media production, enhance fan experience...

17/03/2026

NAB 2026: Ikegami to Showcase Latest Generation TV Production Cameras, Controllers and Monitors

Ikegami USA will demonstrate the latest additions to its wide range of broadcast...

17/03/2026

TNA Wrestling and iHeartMedia Announce Major Multi-Platform Collaboration

TNA Wrestling and iHeartMedia announces a new multi-platform collaboration that will integrate iHeartMedia across TNA's premium live events, weekly televisi...

17/03/2026

The Miami Dolphins and Dell Boost Fan Experience, Safety, and Efficiency at Hard Rock Stadium

The goal was to transform Hard Rock Stadium into a global leader in sports and e...

17/03/2026

Spectrum Launches Multiview for NCAA Basketball Tournaments

Spectrum has announced the launch of its new Multiview feature in the Spectrum TV App, giving customers the ability to watch up to four NCAA men's or women&...

17/03/2026

Pac-12 Inks Integrity/Data Deals With Genius Sports, IC360

Genius Sports deal also covers data technology, AI, fan engagement, and performance analysis....

17/03/2026

Rede Massa Chooses Net Insight to Enable State-Wide Centralized Operations

Net Insight is supporting the rollout of a new state-wide centralized operation with Rede Massa, which is an SBT affiliate, the Brazilian regional television ne...

17/03/2026

F1 The Movie' Wins the Academy Award for Best Sound

Featuring audio from practice sessions, qualifying races, and Grand Prix races, the film represents Apple's sports-media ambitions At Sunday night's Ac...

17/03/2026

SVG New Sponsor Spotlight: Oracle's Mark Ramberg on the Future of Live Broadcast in the Cloud with OCI

Live broadcast has always been one of the most demanding environments in media a...

17/03/2026

DIRECTV Adds Multiview and Sports Central Features Ahead of NCAA Tournament

DirecTV is introducing several new viewing features, including a multi-screen March Madness Mix channel and an updated Sports Central mobile app hub, ahead of...

17/03/2026

Deltatre and ATP Media Announce Multi-Year Broadcast Graphics/Data Partnership

Deltatre has announced a multi-year partnership with ATP Media, the media arm of the ATP Tour, covering broadcast graphics, data, and production across the 2026...

17/03/2026

Detroit Pistons, Scripps Sports To Air Five Games Free Over the Air on TV-20 Detroit

The Detroit Pistons have announced a third consecutive season partnering with Sc...

17/03/2026

NAB Appoints Two New Members to Television Board of Directors

Share Copy link Facebook X Linkedin Bluesky Email...

17/03/2026

PMVG's TechConnect Goes Virtual for 2026

Share Copy link Facebook X Linkedin Bluesky Email...

17/03/2026

Miris unlocks high-fidelity 3D asset streaming at scale

3D streaming infrastructure provider Miris today announced the launch of a public beta for its new 3D asset streaming platform. Miris is building the infrastruc...

17/03/2026

Tedial Powers the Future of Media Operations at NAB Show...

As media organizations face mounting pressure to produce more content, faster, while maximizing value and operational efficiency, Tedial, a leading provider of ...

17/03/2026

Brainstorm transforms productivity and sustainability wit...

Brainstorm, a leading manufacturer of real-time graphics, augmented and virtual production, is launching the newest version of its platform, Brainstorm Suite 7,...

17/03/2026

Limecraft Introduces New Platform Update Adding Greater C...

Limecraft today announces the release of Limecraft 2026.2, the second platform update in its 2026 release cycle. Limecraft is an AI-powered production platform ...

17/03/2026

Pioneering the Next Era of Sports Broadcasting - Broadcas...

Broadcast Solutions, a leading system integrator and provider of innovative solutions for the broadcast and media industry, showcased its latest broadcast and V...

17/03/2026

FCC Announces TV Translator Call Sign Changes

Share Copy link Facebook X Linkedin Bluesky Email...

17/03/2026

2026 NAB Show Offering Free Show Floor Passes to Creators

Share Copy link Facebook X Linkedin Bluesky Email...

17/03/2026

QuickLink's Latest StudioEdge Models to Make North American Debut at NAB 202

QuickLink's Latest StudioEdge Models to Make North American Debut at NAB 202 Brie Clayton March 16, 2026 0 Comments The Multi-platform Remote Gues...

17/03/2026

Frankenstein Graded with DaVinci Resolve Studio

Frankenstein Graded with DaVinci Resolve Studio Brie Clayton March 16, 2026 0 Comments Sonnenfeld enhances the controlled interplay between warm and c...

17/03/2026

New Voyavox from Link Electronics with Real-Time Speech-to-Text Captioning to be Featured in NAB Booth #W2910

New Voyavox from Link Electronics with Real-Time Speech-to-Text Captioning to be...

17/03/2026

Berklee City Music Stewards META Fellowship Supporting Massachusetts Music Educators

Berklee City Music Stewards META Fellowship Supporting Massachusetts Music Educa...

17/03/2026

Fantasy Action Series Agent from Above' Puts Contemporary Spin on Temple Culture and Traditional Folklore in New Trailer

Back to All News Fantasy Action Series Agent from Above' Puts Contemporary...

17/03/2026

Netflix Presents the Trailer for the Final Season of 'Turn of the Tide'

Back to All News Netflix Presents the Trailer for the Final Season of Turn of the Tide Entertainment 17 March 2026 GlobalSpainPortugal Link copied to clipb...

17/03/2026

NVIDIA, Telecom Leaders Build AI Grids to Optimize Inference on Distributed Networks

As AI native applications scale to more users, agents and devices, the telecommu...