Sony Pixel Power calrec Sony

Why Vidispine becomes cognitive

04/05/2020

Vision and hearing are your main senses when experiencing a movie. You recognize the actors, you understand the spoken language even if it is not your native language. You follow the story and enjoy the amazing film photo of the different environments - and by sharing all these experiences you can not only relate to the movie itself but also convince your friend to see the movie.

Wouldn't it be great if your Media Asset Management system could possess similar capabilities when managing your media? Being able to understand the language? Recognize actors, detect and define parts of the image - maybe also differentiate between genres? But how?

In order to do this you need a system that can actually see and listen what's inside your media - you need a system that has cognitive capabilities like yourself and can store that info in - yes, you guessed right - metadata.

But how do we navigate our vastly growing archives of file-based media?

Media files themselves today includes a lot of metadata already in a descriptive format. In here, there is room for all general metadata as well as technical metadata describing the actual file structures. MAM (Media Asset Management) systems make use of this existing metadata along with additional layers of metadata frameworks to help you navigate, find and tag not only media files themselves but also the time-based intervals of the media.

Because of this, you can argue that the true definition of a media file must include an audio-visual asset AND an associated metadata description. Without one or the other - the asset is not complete.

Cognitive Metadata to boldly go where no MAM has gone before Traditionally, the common notion is that while a machine can read and act on the associated text-based metadata of a media file, a human can understand the storyline. We can detect lipsync, recognize actors, emotions, and all the visual objects inside a frame. We can also listen to the language spoken, understand the story and do a translation into a new language.

Because of this common view on the differences between machine capabilities and human capabilities, it is still also quite common that production companies and similar, divide many tasks in a media supply chain between man and machine this way.

But times are changing, and they are changing fast. For any Content Owner, CTO or technical strategist building a modern media workflow, it is vital to challenge this traditional view on what machines can and cannot do.

Interview with Ralf Jansen Product Manager and Software Architect at Arvato / Vidispine To find out more on this subject, we talked to Ralf Jansen, Product Manager and Software Architect at Arvato / Vidispine AB. Ralf Jansen has a strong technical background, finished computer science degree with a Thesis Diploma at Fraunhofer Institute and has since worked as a developer and software architect in the industry for nearly the last 20 years. Today Ralf Jansen is managing the development of the new Vidinet Cognitive Services (VCS) and is part of the Vidinet partner success team.

So, Ralf, why is cognitive services important? Cognitive services allow the machine to find information inside the video and audio frame itself, very much like we humans can interpret the same content. This of course opens up important new possibilities depending on what type of workflow you are managing. A channel distributor can use cognitive services to automatically find (new) types of information in a huge amount of media content that could not be processed manually before - and thus use or present that insights to the viewer as a program, highlights, suggested shows or even as autogenerated trailers. Cognitive services carry this new information as metadata and give your MAM system new and much more granular methods of managing your media files. This is very important in the process of optimizing the performance and capabilities of your evolving media supply chain.

Revenue and how we can improve revenue are, of course, a driver for the advancement and adaption of cognitive services like for most other technology. And once you are getting familiar with the idea of challenging your common view on what machines can do - the subject of revenue by technology gets even more interesting.

In what areas could cognitive services improve existing revenue streams? Knowing and understanding the inside of your media opens many new opportunities that can improve revenue and help customers to monetize their owned media assets. The first one that comes to mind is of course speech to text - where cognitive services can in best case reach or even exceed the magical benchmark of human understanding (which is roughly at 5% error rate) depending on how purely spoken and what known vocabulary was used with automatic transcribe functionality already today. Automatic speech to text at this level not only free up human resources and saves money otherwise spent on external subtitling services, but also enables a new layer of time based metadata where you actually can navigate in time to find deep linked subjects, names and topics by simply searching the contents of your subtitling in your MAM systems accurate search capabilities and in our case powered by Elastic Search. And this is of course just one of many examples.

It is important to understand the value of temporal metadata since captured reality stored into the video (and audio) file changes every 30-60 frames per second or more - and because of temporal metadata we are able to define accurate time spans for different video and audio content detected by cognitive services. A post house ingesting reality content normally uses human resources for logging and preparing projects for the editors. In these and similar production workflows, the challenge is the huge amount of incoming raw footage that needs to be sorted a
LINK: https://www.vidispine.com/blog/why-vidispine-becomes-cognitive...
See more stories from vidispine

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

19/04/2026

NAB Show 2026 Is Here! Follow All of our Live Coverage!

Blackmagic Design has announced the ATEM 4 M/E Constellation IP and ATEM 4 M/E Constellation IP Plus, two SMPTE 2110-native live production switchers. The ATEM ...

19/04/2026

Live From NAB 2026: Grass Valley CEO Jon Wilson on AMPPs Explosive Growth, Hybrid Workflows, and Whats New at the Show

Grass Valley is finding the right balance between its hardware heritage with an ...

19/04/2026

Live From NAB 2026: Oracles Kip Schauer on Why OCI Is Doubling Down on Media, Sports, and Broadcast

Oracle's strategy rests on the foundational strengths of Oracle Cloud Infras...

19/04/2026

Live From NAB 2026: Program Productions Jess Kowatch on Whats New with ProCrewz and the Impact of AI on Crewing

Program Productions, the live sports production industry's leading crewer, i...

19/04/2026

Live From NAB 2026: Aggrekos Joe Scionti on Powering the Super Bowl, PGA Championship, and the Road to the FIFA World Cup

At the 2026 NAB Show in Las Vegas, SVG sat down with Joe Scionti, Account Manage...

19/04/2026

NAB 2026: Evertz to highlight evertz.io XChange for live event management and market switching

Evertz (Booth N817) is set to present new services within its evertz.io platform...

19/04/2026

NAB 2026: Evertz to showcase IPMX-certified NUCLEUS and MMA platforms for AV and ST 2110 integration

Evertz (Booth N817) will showcase its IPMX-certified NUCLEUS platform alongside ...

19/04/2026

NAB 2026: Evertz to showcase ENX media core for hybrid SDI and IP facilities

Evertz (Booth N817) is set to showcase ENX at NAB 2026, a media core platform designed to support hybrid SDI and IP infrastructures in production facilities and...

19/04/2026

NAB 2026: Evertz introduces Studer VistaVUE Touch for broadcast control

Evertz (Booth N817) will introduce Studer VistaVUE Touch at NAB 2026, a control surface designed to integrate audio, video and control workflows within a custom...

19/04/2026

NAB 2026: Evertz highlights X-CALIBER high-density encoding platform for media transport

Evertz (Booth N817) will highlight X-CALIBER at NAB 2026, an encoding and decodi...

19/04/2026

NAB 2026: Cobalt Digital introduces blueCORE standalone processors for SDI and ST 2110 workflows

Cobalt Digital (Booth N1340) will introduce the blueCORE family of standalone si...

19/04/2026

NAB 2026: Chyron and Asport to demonstrate AI-driven end-to-end sports production and distribution workflows

Chyron and Asport (Booth N2441) will demonstrate an integrated sports video work...

19/04/2026

NAB 2026: MediaKind outlines growth of Multiview deployments as Charter rollout expands in North America

MediaKind (Booth W1743) provided an update on its Multiview deployments at NAB S...

19/04/2026

NAB 2026: Calrec and Grass Valley announce partnership to integrate ImPulseV with AMPP platform

Calrec (Booth C6907) and Grass Valley (Booth C2408) announced a long-term broadc...

19/04/2026

NAB 2026: Oracle and partners to demo MoQ-based streaming ecosystem

Oracle is bringing a multi-partner demonstration of Media over QUIC (MoQ)-based live streaming to NAB Show 2026, showcasing how independent systems from multipl...

19/04/2026

NAB 2026: Encompass Digital Media and Oracle Cloud Infrastructure expand partnership for cloud-native broadcast operations

Encompass Digital Media announced an expanded partnership with Oracle Cloud Infr...

19/04/2026

SportsTechBuzz at NAB 2026, Day 1: Live Reports From the Show Floor in Vegas

The NAB Show is in full swing, and the SVG and SVG Europe editorial teams are chasing down the hottest stories from all over the Las Vegas Convention Center. He...

19/04/2026

NAB 2026: Blackmagic Design Announces ATEM 4 M/E Constellation IP Switchers

Blackmagic Design has announced the ATEM 4 M/E Constellation IP and ATEM 4 M/E Constellation IP Plus, two SMPTE 2110-native live production switchers. The ATEM ...

19/04/2026

Waves update Sync Vx

Now available in VST3, AU and AAX formats Waves have recently released an update that extends their vocal-alignment plug-in's capabilities to all DAWs -...

19/04/2026

TV and Radio HQ Moves Closer to the Conversation

Share Copy link Facebook X Linkedin Bluesky Email...

19/04/2026

Expanded Creator Lab Seeds Digital Synergies

Share Copy link Facebook X Linkedin Bluesky Email...

19/04/2026

Why Streamers Are Seizing the Now

Share Copy link Facebook X Linkedin Bluesky Email...

19/04/2026

Make Every Dollar Count on Set

Share Copy link Facebook X Linkedin Bluesky Email...

19/04/2026

Tech Transforms the Live Sports Playbook

Share Copy link Facebook X Linkedin Bluesky Email...

19/04/2026

Amagi Managed Services Modernizes Broadcasting Operations...

Amagi, the agentic industry cloud platform for unified broadcast, streaming, and monetization, today announced that AccuWeather , the most trusted source of wea...

19/04/2026

Calrec and Grass Valley unlock exceptional choice and fle...

Calrec (Booth:C6907) and Grass Valley (Booth: C2408) are today announcing a long-term broadcast audio technology partnership at NAB Show 2026. The companies are...

19/04/2026

Ikegami Announces VFE-P07D Monocular OLED Viewfinder with...

Ikegami announces a further expansion to its range of on-camera viewfinders. Scheduled for introduction on Ikegamis Central Hall booth C3819 at the April 19th -...

19/04/2026

GatesAir Strengthens Global Services Team with New Hires

Share Copy link Facebook X Linkedin Bluesky Email...

19/04/2026

MASN, Spectrum Ink Multiyear Carriage Agreement

Share Copy link Facebook X Linkedin Bluesky Email...

19/04/2026

Calrec and Grass Valley Unveil ImPulseV and AMPP Integration

Share Copy link Facebook X Linkedin Bluesky Email...

19/04/2026

Clear-Com Unveils FreeSpeak Cell

Share Copy link Facebook X Linkedin Bluesky Email...

19/04/2026

Riedel's SimplyLive Solution Powers Centralized VAR for Argentina's Top Football League

Wuppertal April 19, 2026 Riedel's SimplyLive Solution Powers Centralized V...

19/04/2026

Bridge Digital and Riedel Build CampusWide ST 2110 Network for Eastern Kentucky University

Wuppertal April 19, 2026 Bridge Digital and Riedel Build Campus Wide ST 2110 N...

19/04/2026

Riedel Showcases Next Advances in IP-Based Production at NAB 2026

Wuppertal April 19, 2026 Riedel Showcases Next Advances in IP-Based Production at NAB 2026MediorNet HorizoN ST 2110 MultiViewer App, SmartPanel Commentary Con...

19/04/2026

Harmonic Enables DIRECTV to Reimagine Nationwide DTH Service

Harmonic's Cloud-Native VOS Media Software Lowers Costs by Unifying Media Playout to Delivery on a Single Platform SAN JOSE, Calif. - April 19, 2026 - Harmo...

18/04/2026

NAB 2026: MultiDyne begins shipping C16-AM-12G audio monitor for SDI and IP workflows

MultiDyne Video & Fiber Optic Systems has begun shipping the C16-AM-12G audio mo...

18/04/2026

NAB 2026: FOR-A America announces AI updates for IMPULSE platform and release of MixBoard and HVS-Q12 switchers

FOR-A America is set to detail AI functionality for its software-defined IMPULSE...

18/04/2026

NAB 2026: Cobalt Digital and SineSix Media integrate audio description technology for broadcast workflows

Cobalt Digital and SineSix Media have announced a partnership to integrate the v...

18/04/2026

NAB 2026: ATSC focuses on 3.0 broadcast standard implementation at NAB Show 2026

The ATSC, the broadcast standards association, is highlighting the status of the ATSC 3.0 internet protocol-based broadcast standard at the 2026 NAB Show. The e...

18/04/2026

NAB 2026: Bolin Technology introduces Range PTZR camera, R9-L420N PTZ, and KBD Plus controller

Bolin Technology has introduced a new range of hardware for live production envi...

18/04/2026

NAB 2026: KMH Integration showcases AV Casting workflow approach and new technology at NAB 2026

KMH Integration is participating in the 2026 NAB Show, focusing on its AV Casti...

18/04/2026

NAB 2026: Appear appoints Mike Burk as vice president of business development

Appear has appointed Mike Burk as vice president of business development for North America. Burk brings over two decades of experience in the broadcast and live...

18/04/2026

NAB 2026: Skyline Communications demonstrates DataMiner platform and AI capabilities at NAB Show

Skyline Communications is showcasing its DataMiner platform and the new DataMine...