
Vision and hearing are your main senses when experiencing a movie. You recognize the actors, you understand the spoken language even if it is not your native language. You follow the story and enjoy the amazing film photo of the different environments - and by sharing all these experiences you can not only relate to the movie itself but also convince your friend to see the movie.
Wouldn't it be great if your Media Asset Management system could possess similar capabilities when managing your media? Being able to understand the language? Recognize actors, detect and define parts of the image - maybe also differentiate between genres? But how?
In order to do this you need a system that can actually see and listen what's inside your media - you need a system that has cognitive capabilities like yourself and can store that info in - yes, you guessed right - metadata.
But how do we navigate our vastly growing archives of file-based media?
Media files themselves today includes a lot of metadata already in a descriptive format. In here, there is room for all general metadata as well as technical metadata describing the actual file structures. MAM (Media Asset Management) systems make use of this existing metadata along with additional layers of metadata frameworks to help you navigate, find and tag not only media files themselves but also the time-based intervals of the media.
Because of this, you can argue that the true definition of a media file must include an audio-visual asset AND an associated metadata description. Without one or the other - the asset is not complete.
Cognitive Metadata to boldly go where no MAM has gone before Traditionally, the common notion is that while a machine can read and act on the associated text-based metadata of a media file, a human can understand the storyline. We can detect lipsync, recognize actors, emotions, and all the visual objects inside a frame. We can also listen to the language spoken, understand the story and do a translation into a new language.
Because of this common view on the differences between machine capabilities and human capabilities, it is still also quite common that production companies and similar, divide many tasks in a media supply chain between man and machine this way.
But times are changing, and they are changing fast. For any Content Owner, CTO or technical strategist building a modern media workflow, it is vital to challenge this traditional view on what machines can and cannot do.
Interview with Ralf Jansen Product Manager and Software Architect at Arvato / Vidispine To find out more on this subject, we talked to Ralf Jansen, Product Manager and Software Architect at Arvato / Vidispine AB. Ralf Jansen has a strong technical background, finished computer science degree with a Thesis Diploma at Fraunhofer Institute and has since worked as a developer and software architect in the industry for nearly the last 20 years. Today Ralf Jansen is managing the development of the new Vidinet Cognitive Services (VCS) and is part of the Vidinet partner success team.
So, Ralf, why is cognitive services important? Cognitive services allow the machine to find information inside the video and audio frame itself, very much like we humans can interpret the same content. This of course opens up important new possibilities depending on what type of workflow you are managing. A channel distributor can use cognitive services to automatically find (new) types of information in a huge amount of media content that could not be processed manually before - and thus use or present that insights to the viewer as a program, highlights, suggested shows or even as autogenerated trailers. Cognitive services carry this new information as metadata and give your MAM system new and much more granular methods of managing your media files. This is very important in the process of optimizing the performance and capabilities of your evolving media supply chain.
Revenue and how we can improve revenue are, of course, a driver for the advancement and adaption of cognitive services like for most other technology. And once you are getting familiar with the idea of challenging your common view on what machines can do - the subject of revenue by technology gets even more interesting.
In what areas could cognitive services improve existing revenue streams? Knowing and understanding the inside of your media opens many new opportunities that can improve revenue and help customers to monetize their owned media assets. The first one that comes to mind is of course speech to text - where cognitive services can in best case reach or even exceed the magical benchmark of human understanding (which is roughly at 5% error rate) depending on how purely spoken and what known vocabulary was used with automatic transcribe functionality already today. Automatic speech to text at this level not only free up human resources and saves money otherwise spent on external subtitling services, but also enables a new layer of time based metadata where you actually can navigate in time to find deep linked subjects, names and topics by simply searching the contents of your subtitling in your MAM systems accurate search capabilities and in our case powered by Elastic Search. And this is of course just one of many examples.
It is important to understand the value of temporal metadata since captured reality stored into the video (and audio) file changes every 30-60 frames per second or more - and because of temporal metadata we are able to define accurate time spans for different video and audio content detected by cognitive services. A post house ingesting reality content normally uses human resources for logging and preparing projects for the editors. In these and similar production workflows, the challenge is the huge amount of incoming raw footage that needs to be sorted a
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
04/08/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
04/07/2026
April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...
01/06/2026
January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026
Throughout the week, Dolby brings to life the latest innovatio...
02/05/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
01/05/2026
January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...
15/04/2026
Open Broadcast Systems has announced that BBC World Service has selected its IP ...
15/04/2026
LiveU has announced an expansion of its collaboration with Sony Corporation, add...
15/04/2026
Ateme has announced a collaboration with NVIDIA to support live Apple Immersive ...
15/04/2026
The Professional Fighters League (PFL) has announced a multi-year partnership renewal with DAZN DACH, covering Germany, Switzerland, Austria, Liechtenstein, and...
15/04/2026
Canon U.S.A. (NAB Booth C3825) today took the lid off of the CINE-SERVO 40-1200m...
15/04/2026
Panasonic Video and Audio Systems North America and NEP Group will demonstrate a...
15/04/2026
For the fourth year running, independent analysts found businesses across all industries and verticals pay roughly the same amount in fees as they spend on stor...
15/04/2026
The Soccer Tournament (TST) has announced a media rights deal with NBC Sports to...
15/04/2026
JB&A will host the Pre-NAB 2026 Technology Event on April 17-18 at Flamingo Las Vegas, ahead of NAB Show. The event features hands-on demonstrations and technic...
15/04/2026
The Sennheiser Group will exhibit at NAB Show 2026 (Booth 4931, Central Hall), with demonstrations from Sennheiser, Neumann, and Merging across three areas: Rel...
15/04/2026
NAB Show 2026 will take place April 18-22 at the Las Vegas Convention Center, wi...
15/04/2026
AI-Media has announced the LEXI Text Encoder and LEXI Voice Encoder at NAB Show 2026, the company's first new encoder hardware release in more than a decade...
15/04/2026
Italian camera support manufacturer Cartoni will introduce several new products at NAB Show 2026 (Booth C6540, Central Hall), including the Master 30 OB fluid h...
15/04/2026
Lawo and swXtch.io have announced a memorandum of understanding at NAB Show 2026, under which Lawo will explore incorporating swXtch.io's groundSwXtch softw...
15/04/2026
CacheFly will exhibit at NAB Show 2026 (Booth W3129, April 19-22, Las Vegas Convention Center), showcasing three new additions to its content delivery platform:...
15/04/2026
Synamedia has announced GO Shorts, a new module within its Synamedia Go OTT platform that uses AI to convert an operator's existing content library into a s...
15/04/2026
The NAB Show kicks off on Saturday, and the SVG and SVG Europe editorial teams a...
15/04/2026
AJA Video Systems has announced an agreement to acquire Comprimato, a live video encoding and processing software company. The deal will unite the two companies...
15/04/2026
Prime Video Sports' NBA Playoffs coverage, which includes the entire SoFi NB...
15/04/2026
Just announced, the SDE standard provides a unified method and file format to ensure consistent and reliably comparable noise predictions
Sports and entertainm...
15/04/2026
From immersive storytelling to laugh-out-loud comedies, podcasts are booming in ...
15/04/2026
Books have always moved with us, whether tucked in our bags or humming in our he...
15/04/2026
For many artists, independent venues are where music careers begin and fan communities take shape. Independent venue operators work hard every day to keep local...
15/04/2026
From gripping thrillers to poignant memoirs, the 21st century has had no shortage of unforgettable books. To celebrate the standout storytelling of our modern e...
15/04/2026
Vintage broadcast experts release second plug-in
Telsie T is the second plug-in to be released by SonicWorld, a German audio company who specialise in servi...
15/04/2026
Includes eight free UAD plug-ins
Universal Audio's latest bundle brings together a selection of their renowned plug-ins and virtual instruments, and is ...
15/04/2026
Maximum uptime for broadcasters: Rohde & Schwarz launches R&S BroadcastShield at...
15/04/2026
Image courtesy of MD Helicopters...
15/04/2026
Virginia Gov. Abigail Spanberger, L3Harris VP Mark Farley, and state and local l...
15/04/2026
U.S. Space Forces Ground-Based Optical Sensor System upgrade at the Maui Space S...
15/04/2026
NBCU-Versant notches 13.1% of TV viewing in February, its best since August 2024...
15/04/2026
New data reveals older Kiwis are financially resilient, loyal to local products,...
15/04/2026
aconnic AG (ISIN: DE000A0LBKW6), Munich, announces the market launch of the ACCE...
15/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
15/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
15/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
15/04/2026
Evergent introduces its Agentic Revenue Orchestration Platform, transforming how subscription businesses across direct-to-consumer streaming, pay-TV, telecommun...
15/04/2026
Harmonic's XOS Media Processor Delivers Exceptional Video Quality to More than Half of U.S. Public Media Viewership
Harmonic (NASDAQ: HLIT) today announce...
15/04/2026
LONGMONT, COLORADO, APRIL 15, 2026 DPA Microphones N Series Digital Wireless System users in North America can now take full advantage of the system's exc...
15/04/2026
Cobalt Iron, a leading provider of SaaS-based enterprise data protection, today announced the launch of Compass Tape Gateway (CTG), a transformative enhancemen...
15/04/2026
Disguise to Showcase Cutting-Edge Experience Tech for Sports, Broadcast and More...
15/04/2026
Arooj Aftab Makes the Music She Wants to Hear The singular artist explores the juxtaposition of grief and joy, dark and light, in her distinctive sound.
Apri...
15/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
15/04/2026
Interra Systems, a provider of end-to-end quality assurance solutions for the digital media industry, is proud to announce its central role in the digital trans...