Sony Pixel Power calrec Sony

Give AI a Look: Any Industry Can Now Search and Summarize Vast Volumes of Visual Data

04/11/2024

Enterprises and public sector organizations around the world are developing AI agents to boost the capabilities of workforces that rely on visual information from a growing number of devices - including cameras, IoT sensors and vehicles.

To support their work, a new NVIDIA AI Blueprint for video search and summarization will enable developers in virtually any industry to build visual AI agents that analyze video and image content. These agents can answer user questions, generate summaries and enable alerts for specific scenarios.

Part of NVIDIA Metropolis, a set of developer tools for building vision AI applications, the blueprint is a customizable workflow that combines NVIDIA computer vision and generative AI technologies.

Global systems integrators and technology solutions providers including Accenture, Dell Technologies and Lenovo are bringing the NVIDIA AI Blueprint for visual search and summarization to businesses and cities worldwide, jump-starting the next wave of AI applications that can be deployed to boost productivity and safety in factories, warehouses, shops, airports, traffic intersections and more.

Announced ahead of the Smart City Expo World Congress, the NVIDIA AI Blueprint gives visual computing developers a full suite of optimized software for building and deploying generative AI-powered agents that can ingest and understand massive volumes of live video streams or data archives.

Users can customize these visual AI agents with natural language prompts instead of rigid software code, lowering the barrier to deploying virtual assistants across industries and smart city applications.

NVIDIA AI Blueprint Harnesses Vision Language Models Visual AI agents are powered by vision language models (VLMs), a class of generative AI models that combine computer vision and language understanding to interpret the physical world and perform reasoning tasks.

The NVIDIA AI Blueprint for video search and summarization can be configured with NVIDIA NIM microservices for VLMs like NVIDIA VILA, LLMs like Meta's Llama 3.1 405B and AI models for GPU-accelerated question answering and context-aware retrieval-augmented generation. Developers can easily swap in other VLMs, LLMs and graph databases and fine-tune them using the NVIDIA NeMo platform for their unique environments and use cases.

Adopting the NVIDIA AI Blueprint could save developers months of effort on investigating and optimizing generative AI models for smart city applications. Deployed on NVIDIA GPUs at the edge, on premises or in the cloud, it can vastly accelerate the process of combing through video archives to identify key moments.

In a warehouse environment, an AI agent built with this workflow could alert workers if safety protocols are breached. At busy intersections, an AI agent could identify traffic collisions and generate reports to aid emergency response efforts. And in the field of public infrastructure, maintenance workers could ask AI agents to review aerial footage and identify degrading roads, train tracks or bridges to support proactive maintenance.

Beyond smart spaces, visual AI agents could also be used to summarize videos for people with impaired vision, automatically generate recaps of sporting events and help label massive visual datasets to train other AI models.

The video search and summarization workflow joins a collection of NVIDIA AI Blueprints that make it easy to create AI-powered digital avatars, build virtual assistants for personalized customer service and extract enterprise insights from PDF data.

NVIDIA AI Blueprints are free for developers to experience and download, and can be deployed in production across accelerated data centers and clouds with NVIDIA AI Enterprise, an end-to-end software platform that accelerates data science pipelines and streamlines generative AI development and deployment.

AI Agents to Deliver Insights From Warehouses to World Capitals Enterprise and public sector customers can also harness the full collection of NVIDIA AI Blueprints with the help of NVIDIA's partner ecosystem.

Global professional services company Accenture has integrated NVIDIA AI Blueprints into its Accenture AI Refinery, which is built on NVIDIA AI Foundry and enables customers to develop custom AI models trained on enterprise data.

Global systems integrators in Southeast Asia - including ITMAX in Malaysia and FPT in Vietnam - are building AI agents based on the video search and summarization NVIDIA AI Blueprint for smart city and intelligent transportation applications.

Developers can also build and deploy NVIDIA AI Blueprints on NVIDIA AI platforms with compute, networking and software provided by global server manufacturers.

Dell will use VLM and agent approaches with Dell's NativeEdge platform to enhance existing edge AI applications and create new edge AI-enabled capabilities. Dell Reference Designs for the Dell AI Factory with NVIDIA and the NVIDIA AI Blueprint for video search and summarization will support VLM capabilities in dedicated AI workflows for data center, edge and on-premises multimodal enterprise use cases.

NVIDIA AI Blueprints are also incorporated in Lenovo Hybrid AI solutions powered by NVIDIA.

Companies like K2K, a smart city application provider in the NVIDIA Metropolis ecosystem, will use the new NVIDIA AI Blueprint to build AI agents that analyze live traffic cameras in real time. This will enable city officials to ask questions about street activity and receive recommendations on ways to improve operations. The company also is working with city traffic managers in Palermo, Italy, to deploy visual AI agents using NIM microservices and NVIDIA AI Blueprints.

Discover more about the NVIDIA AI Blueprint for video search and summarization by visiting the NVIDIA booth at the Smart Cities Expo World Congress, taking place in Barcelona through Nov. 7.

Le
LINK: https://blogs.nvidia.com/blog/video-search-summarization-ai-agents/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

06/09/2026

Dolby and MagentaTV Bring Fans Closer to the FIFA World Cup 2026 in Germany with Dolby Vision and Dolby Atmos

June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

17/06/2026

The Immersive Supervisor Emerges as Hollywood's Next Production Role

The Immersive Supervisor Emerges as Hollywood's Next Production Role Brie Clayton June 17, 2026 0 Comments Above image: On set live immersive revi...

17/06/2026

Vertical Musical Playback Shot with Blackmagic PYXIS 6K

Vertical Musical Playback Shot with Blackmagic PYXIS 6K Brie Clayton June 17, 2026 0 Comments Large format sensor and DaVinci Resolve workflow used fo...

17/06/2026

DAZ 3D Launches New Game-Ready Character Assets Built for Modern Engines and Production Workflows

DAZ 3D Launches New Game-Ready Character Assets Built for Modern Engines and Pro...

17/06/2026

Spectrum Awards $1.1 Million in Digital Education Grants

Share Copy link Facebook X Linkedin Bluesky Email...

17/06/2026

XR Sports Alliance Adds New Members

Share Copy link Facebook X Linkedin Bluesky Email...

17/06/2026

AIMS Launches Free Online IPMX Training Series

Share Copy link Facebook X Linkedin Bluesky Email...

17/06/2026

Kiloview Partners with SFM to Expand AV-over-IP Solutions...

Montr al, Quebec, June 11, 2026 Kiloview, a leading provider of AV-over-IP and NDI -based video transmission solutions, today announced a distribution partner...

17/06/2026

Kiloview Launches U4 IP Video Dock Bringing Professional...

Changsha, China, June 15, 2026 Kiloview officially announced the launch of U4 IP Video Dock, a compact IP video decoder and output dock designed to bring prof...

17/06/2026

Good Vibrations hit Dublin and Limerick as RT Concert Orchestra plays the music of the Beach Boys

The RT Concert Orchestra will bring the timeless music of The Beach Boys to aud...

17/06/2026

June 16, 2026

Calibr-Skaggs awarded $5.1M by NIH to develop long-acting hepatitis B virus therapy A new program aims to replace a daily HBV drug with once-monthly or even qua...

16/06/2026

Thomson launches new learning App

Thomson's highly regarded expert-led online learning courses are now easier to access on the go via our new App. Available now on Google Play Store, the J...

16/06/2026

Neumann MT 48 Receives Major Firmware 2.0 Update

Neumann.Berlin has released firmware version 2.0 for the MT 48 audio interface, adding plugin compatibility, expanded Dante networking options, broadcast encode...

16/06/2026

TVNewsCheck Opens Nominations for 2027 Women in Technology Awards

TVNewsCheck has announced that nominations are now open for its 2027 Women in Technology Awards, to be presented at NAB Show 2027 on Tuesday, April 6 in the Med...

16/06/2026

Clear-Com Introduces Avalon IP Intercom Platform

Clear-Com has announced Avalon, a 1RU IP intercom platform for broadcast, live events, and production environments. Designed for IP-only workflows, Avalon suppo...

16/06/2026

SNS EVO Enables Remote and Distributed Video Editing Workflows

SNS has published a guide to remote video editing workflows using its EVO shared storage platform and companion tools, covering use cases ranging from home edit...

16/06/2026

Richmond Flying Squirrels Deploy Grass Valley LDX 110 Cameras at CarMax Park

Grass Valley has announced that the Richmond Flying Squirrels, a Minor League Baseball affiliate of the San Francisco Giants, have deployed five Grass Valley LD...

16/06/2026

AIMS Launches Free Official IPMX Training Series Online

The Alliance for IP Media Solutions (AIMS) has announced the launch of the Official IPMX Training Series, a free online program covering the design, configurati...

16/06/2026

Swerve Womens Sports Announces Distribution Deals with Fubo, Plex, Amazon Fire TV, and Anoki AI

Swerve TV has announced distribution agreements with Fubo, Plex, Amazon Fire TV,...

16/06/2026

ATP and TikTok Expand Global Content Partnership

ATP and TikTok have announced an expansion of their global content partnership, extending the ATP's TikTok hub powered by TikTok GamePlan to cover all nine ...

16/06/2026

FOX Sports Turns Los Angeles Pico Lot Into Its FIFA World Cup Production Nerve Center

Network's LA facility serves as the heart of a sprawling operation built to ...

16/06/2026

300+ Records a Day, 150 TB Daily, and a Relentless Content Avalanche: Inside FOX Sports' World Cup Media Engine

At Pico, the network's media-management team is supporting a flood of HBS fe...

16/06/2026

NHL Games Leaving CBC in Canada as Sublicense With Rogers Sportsnet Ends

The NHL will no longer air on CBC after the pulic broadcasters and national rights-holder Rogers Sportsnet were unable to come to agreement. After a successfu...

16/06/2026

SVG New Sponsor Spotlight: Virtual Eye's Ben Taylor on Making Live Sports More Valuable and Entertaining Through Data-Driven Graphics

As live sports broadcasters continue to seek new ways to make complex action mor...

16/06/2026

Thats BRISK, Baby! FOX Sports' Broadcast Remote IP Studio Kits Bring World Cup Fan Energy Back to Pico

Built with the 2026 FIFA World Cup in mind, these small but mighty IP-based tran...

16/06/2026

Rumble three-band soft synth by UVI

Boasts individual synths for each band UVI's latest synth takes an interesting approach to synthesis, offering a trio of synth engines that each operate...

16/06/2026

PSP Levelizer: auto level adjustment plug-in from PSPaudioware

New intelligent auto-fader plug-in unveiled PSPaudioware's latest release offers automatic level adjustment and provides more detailed control than many...

16/06/2026

The Crow Hill Company launch Crystal Pads

New performance-focused library announced Crystal Pads is the latest addition to The Crow Hill Company's ever-growing product range, and according to th...

16/06/2026

GForce launch official Prophet-5 soft synth

Developed in partnership with Sequential In recent years, GForce Software have branched into official emulations of classic hardware synths, delivering a ha...

16/06/2026

DT 30 IE: New in-ears from beyerdynamic

Designed specifically for live performance monitoring beyerdynamic's latest announcement sees the company introduce an affordable in-ear monitoring syst...

16/06/2026

Cherry Audio recreate the Ensoniq ESQ-1

Official emulation celebrates iconic synth's 40th anniversary Cherry Audio have just introduced Ensoniq ESQ-1, an official recreation of the 1986 polyph...

16/06/2026

Australians place growing trust in SBS News

Australians place growing trust in SBS News 16 June, 2026 Media releases SBS has been recognised as one of Australia's most trusted news providers, ran...

16/06/2026

Rohde & Schwarz achieves highest number of GCF validated 3GPP NR NTN test cases for RF, RRM and PCT domains

Rohde & Schwarz achieves highest number of GCF validated 3GPP NR NTN test cases ...

16/06/2026

Hitachi and PESA Announce Strategic Partnership to Drive Growth in Poland's Rail Market

Bydgoszcz to Become a Local Centre of Excellence for Advanced Rail Technologies....

16/06/2026

Chyron Unveils Chyron Weather 2.4

Share Copy link Facebook X Linkedin Bluesky Email...

16/06/2026

Historic Zhuque-3 Reusable Rocket Test Mission Captured with URSA Cine Immersive

Historic Zhuque-3 Reusable Rocket Test Mission Captured with URSA Cine Immersive Brie Clayton June 16, 2026 0 Comments Apple Immersive Video puts view...

16/06/2026

SMPTE Plans ST 2110 Education Summer Programs

Share Copy link Facebook X Linkedin Bluesky Email...

16/06/2026

Rise Awards Returns for 2026 to Celebrate Excellence in B...

Rise WIB, the award-winning advocacy group championing gender diversity and career progression across the broadcast and media technology industry, today announc...

16/06/2026

Limecraft Expands its Media Production Platform with Team...

Limecraft today announced the availability of Limecraft 2026.4, the fourth of eight planned platform releases this year. The update introduces Team-Based Access...

16/06/2026

Perry Sook: Big Tech Poses 'Very Urgent Threat to Broadcast Stations

Share Copy link Facebook X Linkedin Bluesky Email...

16/06/2026

FIFA World Cup Delivers Record Ratings on Fox

Share Copy link Facebook X Linkedin Bluesky Email...

16/06/2026

AIMS Launches the Official IPMX Training Series Online

Free Program Supports IPMX Education from Foundational Concepts Through System and Network Design The Alliance for IP Media Solutions (AIMS) today announced t...

16/06/2026

Share your views on Screen Australia and the future of the industry

Share your views on Screen Australia and the future of the industry 15 June 2026 Your feedback matters. Following the instrumental insights provided in 2025,...

16/06/2026

HPE AI Factory With NVIDIA Expands for the Era of Agents

Enterprises are moving agentic AI from proof of concept to production - and the next generation of AI factories are built for the era of agents. At HPE Discove...

16/06/2026

Coherent Breaks Ground on Expanded Texas Facility, Scaling AI's Optical Backbone

AI runs at the speed of light. More and more, that light is made in Texas. Cohe...

16/06/2026

Techtel Supports T-Motion RCCP-2A Controller Upgrade for Major Australian Broadcaster

Techtel Supports T-Motion RCCP-2A Controller Upgrade for Major Australian Broadc...