Sony Pixel Power calrec Sony

Give AI a Look: Any Industry Can Now Search and Summarize Vast Volumes of Visual Data

04/11/2024

Enterprises and public sector organizations around the world are developing AI agents to boost the capabilities of workforces that rely on visual information from a growing number of devices - including cameras, IoT sensors and vehicles.

To support their work, a new NVIDIA AI Blueprint for video search and summarization will enable developers in virtually any industry to build visual AI agents that analyze video and image content. These agents can answer user questions, generate summaries and enable alerts for specific scenarios.

Part of NVIDIA Metropolis, a set of developer tools for building vision AI applications, the blueprint is a customizable workflow that combines NVIDIA computer vision and generative AI technologies.

Global systems integrators and technology solutions providers including Accenture, Dell Technologies and Lenovo are bringing the NVIDIA AI Blueprint for visual search and summarization to businesses and cities worldwide, jump-starting the next wave of AI applications that can be deployed to boost productivity and safety in factories, warehouses, shops, airports, traffic intersections and more.

Announced ahead of the Smart City Expo World Congress, the NVIDIA AI Blueprint gives visual computing developers a full suite of optimized software for building and deploying generative AI-powered agents that can ingest and understand massive volumes of live video streams or data archives.

Users can customize these visual AI agents with natural language prompts instead of rigid software code, lowering the barrier to deploying virtual assistants across industries and smart city applications.

NVIDIA AI Blueprint Harnesses Vision Language Models Visual AI agents are powered by vision language models (VLMs), a class of generative AI models that combine computer vision and language understanding to interpret the physical world and perform reasoning tasks.

The NVIDIA AI Blueprint for video search and summarization can be configured with NVIDIA NIM microservices for VLMs like NVIDIA VILA, LLMs like Meta's Llama 3.1 405B and AI models for GPU-accelerated question answering and context-aware retrieval-augmented generation. Developers can easily swap in other VLMs, LLMs and graph databases and fine-tune them using the NVIDIA NeMo platform for their unique environments and use cases.

Adopting the NVIDIA AI Blueprint could save developers months of effort on investigating and optimizing generative AI models for smart city applications. Deployed on NVIDIA GPUs at the edge, on premises or in the cloud, it can vastly accelerate the process of combing through video archives to identify key moments.

In a warehouse environment, an AI agent built with this workflow could alert workers if safety protocols are breached. At busy intersections, an AI agent could identify traffic collisions and generate reports to aid emergency response efforts. And in the field of public infrastructure, maintenance workers could ask AI agents to review aerial footage and identify degrading roads, train tracks or bridges to support proactive maintenance.

Beyond smart spaces, visual AI agents could also be used to summarize videos for people with impaired vision, automatically generate recaps of sporting events and help label massive visual datasets to train other AI models.

The video search and summarization workflow joins a collection of NVIDIA AI Blueprints that make it easy to create AI-powered digital avatars, build virtual assistants for personalized customer service and extract enterprise insights from PDF data.

NVIDIA AI Blueprints are free for developers to experience and download, and can be deployed in production across accelerated data centers and clouds with NVIDIA AI Enterprise, an end-to-end software platform that accelerates data science pipelines and streamlines generative AI development and deployment.

AI Agents to Deliver Insights From Warehouses to World Capitals Enterprise and public sector customers can also harness the full collection of NVIDIA AI Blueprints with the help of NVIDIA's partner ecosystem.

Global professional services company Accenture has integrated NVIDIA AI Blueprints into its Accenture AI Refinery, which is built on NVIDIA AI Foundry and enables customers to develop custom AI models trained on enterprise data.

Global systems integrators in Southeast Asia - including ITMAX in Malaysia and FPT in Vietnam - are building AI agents based on the video search and summarization NVIDIA AI Blueprint for smart city and intelligent transportation applications.

Developers can also build and deploy NVIDIA AI Blueprints on NVIDIA AI platforms with compute, networking and software provided by global server manufacturers.

Dell will use VLM and agent approaches with Dell's NativeEdge platform to enhance existing edge AI applications and create new edge AI-enabled capabilities. Dell Reference Designs for the Dell AI Factory with NVIDIA and the NVIDIA AI Blueprint for video search and summarization will support VLM capabilities in dedicated AI workflows for data center, edge and on-premises multimodal enterprise use cases.

NVIDIA AI Blueprints are also incorporated in Lenovo Hybrid AI solutions powered by NVIDIA.

Companies like K2K, a smart city application provider in the NVIDIA Metropolis ecosystem, will use the new NVIDIA AI Blueprint to build AI agents that analyze live traffic cameras in real time. This will enable city officials to ask questions about street activity and receive recommendations on ways to improve operations. The company also is working with city traffic managers in Palermo, Italy, to deploy visual AI agents using NIM microservices and NVIDIA AI Blueprints.

Discover more about the NVIDIA AI Blueprint for video search and summarization by visiting the NVIDIA booth at the Smart Cities Expo World Congress, taking place in Barcelona through Nov. 7.

Le
LINK: https://blogs.nvidia.com/blog/video-search-summarization-ai-agents/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

02/04/2026

Scripps Completes Sale of WRTV to Circle City Broadcasting

Share Copy link Facebook X Linkedin Bluesky Email...

02/04/2026

GoVertical! AiDi Powers Real-Time 9:16 Autocropping for I...

Already deployed extensively by NBC Sports, FOR-A Corporation will demonstrate GoVertical! AiDi, the real-time 9:16 autocropping feature of viztrick AiDi, durin...

02/04/2026

Elite Media Technologies Selects Interra Systems BATON Fi...

Interra Systems, a provider of end-to-end quality assurance solutions for the digital media industry, announced that Elite Media Technologies has selected its B...

02/04/2026

TDF Expands Broadcast Channel Lineup with Harmonic

Harmonic's Media Processing Solutions Maximize Bandwidth Efficiency for Terrestrial Broadcast Delivery Harmonic (NASDAQ: HLIT) today announced that TDF, a...

02/04/2026

FOR-A's Software-Defined, AI-Powered Development Advances...

NBC Sports Deploys viztrick AiDi to Stream Live Events in 9:16 Mobile-First Formats with Auto Tracking, Development Signals Strategic Shift for FOR-A Long reco...

02/04/2026

Evergent showcases innovations in sports streaming and mo...

Evergent will showcase new innovations in subscriber lifecycle management and monetization at NAB Show 2026 (Las Vegas, April 18 22), including: New advances i...

02/04/2026

Binghamton University Strengthens Student Run Productions...

Riedel Communications is proud to be part of Binghamton University, State University of New York, Athletics' milestone year, celebrating the university'...

02/04/2026

Techex and Encompass Launch Industry-Leading Cloud-Based...

Encompass Digital Media and Techex have today announced new, fully managed, cloud-native Master Control services designed to meet the growing operational demand...

02/04/2026

Winning in the new media economy - Avid debuts fully avai...

Avid today announced it will showcase new innovations designed to help media companies win in the new media economy at NAB Show 2026 (April 18 22, Las Vegas Co...

02/04/2026

PlayBox Neo reinforces MIMO Tech with new Playout capabil...

PlayBox Neo helps AIS PLAY kick-off premier football content direct to fans PlayBox Neo has provided MIMO Tech with a brand-new major installation to extend it...

02/04/2026

Globo transitions primary distribution to SRT over IP wit...

Globo has transitioned its primary content distribution to Secure Reliable Transport over a fully IP-based managed backbone using Synamedia's Quortex PowerV...

02/04/2026

Nexstar Says Pausing Tegna Merger Creates 'Impossible' Challenges

Share Copy link Facebook X Linkedin Bluesky Email...

02/04/2026

FCC Launches Efforts to Strengthen U.S. Drone Ecosystem

Share Copy link Facebook X Linkedin Bluesky Email...

02/04/2026

WAPA+ to Launch on Dish, DishLatino, Sling TV and Sling Freestream

Share Copy link Facebook X Linkedin Bluesky Email...

02/04/2026

Student Spotlight: Al-Fadl Salem

Student Spotlight: Al-Fadl Salem The Danish singer recently performed for the queen of Denmark. April 1, 2026 By Editorial Staff Image by Junia Morrow Wh...

02/04/2026

Taku Hirano's Career Is Defined by Identity

Taku Hirano's Career Is Defined by Identity Whether he's performing, composing, teaching, or developing instruments, the do-it-all percussionist sees ...

02/04/2026

Design Perspective Intelligent Hybrid Software Platforms to Survive the Evolutionary Avalanche

By Lance Maurer, CEO Image generated by AI Engineering is supposed to be fun....

02/04/2026

Continuing to connect with Young Ireland: 2FM Announces Brand-New Daytime Schedule

2FM Breakfast to extend on weekday mornings from 6am to 10am Doireann Garrihy m...

02/04/2026

RT NEWS ANNOUNCES BARRY LENIHAN AS NEW POLITICAL CORRESPONDENT

RT News & Current Affairs is pleased to announce the appointment of RT Radio 1 reporter, Barry Lenihan, as Political Correspondent. Barry has reported across...

02/04/2026

Press Start on April: GeForce NOW Brings 10 Games to the Cloud

No joke - GFN Thursday is skipping the tricks and heading straight into the games. April kicks off with ten new titles, bringing fresh adventures to GeForce NOW...

01/04/2026

SVG New Sponsor Spotlight: Flowstate AI's Sahil Shah on Transforming Video Content with Intelligent AI Agents

As sports media organization continue to seek out new ways to streamline their p...

01/04/2026

SVG GFX Forum 2026: Sessions Now Available to Watch on SVG PLAY

The SVG GFX Forum hit New York City earlier this month for a day packed with sessions focused on the creative strategy and technology behind today's cutting...

01/04/2026

From Buenos Aires to Mexico City, EQUAL Days Bring Latin America Together for Women in Audio

This year, Spotify celebrates the five-year anniversary of EQUAL, our global pro...

01/04/2026

FourFingers announce Tape Splice Pro plug-in

Analogue-style tape splicing in the digital domain In this era of digital recording and multiple layers of Undo, it seems that the fading art of tape splici...

01/04/2026

Zero G introduce Morphology Evolved

Latest release introduces new Orbita Engine Zero G's latest release marks the start of a new series of libraries, as well as introducing an all-new engi...

01/04/2026

Warm Audio introduce the WA-8TRX

Until now, one format has largely been left behind Warm Audio's extensive product range includes modern-day recreations of all manner of sought-after s...

01/04/2026

The Crow Hill Company announce Crystal Pianos

A piano with glass vessels for strings! The Crow Hill Company's recently released Gong Piano offered a refreshing new take on piano libraries, harnessin...

01/04/2026

ESSENCE RS from Aim Audio

Remote Streaming Studio Condenser Aim Audio have just revealed their latest creation, the ESSENCE RS Remote Streaming Studio Condenser, which becomes the wo...

01/04/2026

Call for NFVF funding applications to attend Film Festivals and Markets taking place from 08 - 31 May 2026

The National Film and Video Foundation (NFVF) is pleased to announce that the ca...

01/04/2026

AgileTV powers Liwest's next-generation TV experience with the launch of next IPTV platform in Austria

Bilbao, April 1st, 2026 - AgileTV, a leading provider of end-to-end TV technolog...

01/04/2026

Green Hippo Debuts Hands on Hippotizer Media Server Train...

Green Hippo is excited to announce the launch of its new Hippotizer Media Server training courses at Pixel Academy, a purpose built AV learning hub combining ha...

01/04/2026

TAG Video Systems and Oracle Cloud Infrastructure Partner...

TAG Video Systems, a global leader in IP-native broadcast monitoring, multiviewing, and quality control, today announced a collaboration with Oracle Cloud Infra...

01/04/2026

Professional Wireless Systems PWS Takes on Intercom and R...

Professional Wireless Systems (PWS), a leading provider of wireless audio solutions and RF management, was on site at the Caesars Superdome in New Orleans, wher...

01/04/2026

AgileTV powers Liwest next generation TV experience with...

AgileTV, a leading provider of end-to-end TV technology solutions, has deployed next , the new IPTV platform of the Austrian telco LIWEST, marking the first st...

01/04/2026

LTN and Ateme partner to deliver integrated video process...

LTN, a leader in fully managed IP video transport, and Ateme, a global leader in video compression and delivery solutions, today announced a collaboration integ...

01/04/2026

Adobe Unveils Powerful New Innovations for Creative Pros in Adobe Illustrator

Adobe Unveils Powerful New Innovations for Creative Pros in Adobe Illustrator Deepa Subramaniam April 1, 2026 0 Comments I'm excited to share that...

01/04/2026

Boland Communications Introduces QD4K315HDR10 QD-OLED Series Monitors for Live Production, Film, Post, and Broadcast

Boland Communications Introduces QD4K315HDR10 QD-OLED Series Monitors for Live P...

01/04/2026

2026 NAB Show Exhibitor Insight: Evertz

Share Copy link Facebook X Linkedin Bluesky Email...

01/04/2026

Judge Blocks Order Barring NPR and PBS From Funding

Share Copy link Facebook X Linkedin Bluesky Email...

01/04/2026

Nikon to Sell Mark Roberts Motion Control

Share Copy link Facebook X Linkedin Bluesky Email...

01/04/2026

Mediagenix Showcases Semantic Intelligence-Powered Title Management, Schedule Optimization, and Personalization at NAB 2026

Mediagenix Showcases Semantic Intelligence-Powered Title Management, Schedule Op...

01/04/2026

FCC Approves WJAX-TV License Transfer to Cox

Share Copy link Facebook X Linkedin Bluesky Email...

01/04/2026

Scripps Sports Ink Deal for Ion to Air 2026 Teal Rising Cup

Share Copy link Facebook X Linkedin Bluesky Email...

01/04/2026

UK Group Companies Unveil NAB Show Plans

Share Copy link Facebook X Linkedin Bluesky Email...

01/04/2026

Victoria Mont Brings the Multi-Hyphenate Mindset to Career Jam 2026

Victoria Mon t Brings the Multi-Hyphenate Mindset to Career Jam 2026 The Grammy-winning singer, songwriter, and producer shared how versatility and self-inves...

01/04/2026

UKTV announces expanded remit for Jonathan Newman and appoints David Swetman as Director of Content Partnerships & Sales

UKTV today announces that Jonathan Newman has formally stepped into the role of ...