Sony Pixel Power calrec Sony

Vidinet Cognitive Services - AWS Speech to Text

16/10/2020

Transcribe your content from speech to text- why?

There are many reasons to transcribe your spoken content in your media. The first reason that comes to mind is, of course, subtitling. Not only in the natively spoken language but also in translated versions. According to multiple research, subtitled videos improve reach, CTA, reactions, and share rates significantly. The second reason is, of course, to help you find the content you are looking for - do you remember the soundbite that the CEO made in that speech - but where is it?

From a business perspective, it also essential to understand how Search Engine Optimization (SEO) is affected by subtitling. Video in itself is obviously not text-based, so any information that informs Google what the video content describes benefits the ranking of the video. Subtitling your video to not just one language but many, therefore, could improve your SEO and visibility. Makes sense?

These are just some of the benefits of making subtitling in preferably more than one language available for your content.

However, for some of you, there are also new regulations to consider. An E.U. directive 2016/2102/EU now states that all member states must include subtitling on all official video information to comply with the U.N. Convention on the Rights of Persons with Disabilities (CRPD). This includes video information from government, schools, and other official organizations, including private companies that delivers information for public viewing.

Similar regulations have been present in the U.S. for many years. The most recent regulation, The 21st Century Communications, and Video Accessibility Act of 2010, states the presence of closed captions on material produced and distributed in the U.S. and can be accessed in the U.S.

Transcribe your content - but how?

Traditionally, transcribing speech to text has been a human task only. With the introduction of the new machine learning algorithms, this is now changing, and we can see how machines and humans can interact and cooperate in this area. Machine learning transcribing software proves more and more accurate, and with today's score at around 80 % or higher depending on the quality of material, the software-based services can offload a lot of initial work that would typically be done by humans only.

So, instead of spending 8 hours on manually transcribing a 1-hour video, you will be able to improve your subtitling distribution workflow by offloading the first 80 % of work to a cognitive automatic subtitling algorithm such as the VCS (Vidispine Cognitive Services) in Vidinet.

With the introduction of VCS, we now take Vidispine API and Vidinet to the next level. The Vidinet Cognitive Services is a core architecture designed to manage cognitive services from a growing number of providers on the market. In this first release of VCS, you will find cognitive services based on the AWS Transcribe libraries.

Vidinet and AWS Speech to Text - a short introduction.

Vidinet is our media supply chain platform where Vidispine customers add and configure different services for their on-premise, cloud, or hybrid environment. In here, you can now access VCS Speech to Text and add this service to your infrastructure - or just your trial account.

Let s take a quick look at a UI and how you can test the VCS Speech to Text functionality.

After uploading your content, choose Analyze to enable the AWS transcription service for your video. Vidinet will provide you with a cost estimate for the service as a basis for your calculations.

When the analysis and transcription have finished, you can easily search and navigate for the results.

The Vidispine UIIt is essential to understand that our Vidispine Development Toolkit (VDT) allows you to design any user interface (UI) that works for your environment. In these examples, we have provided a UI that provides basic functionality for testing the Vidispine API. As you can see, the VCS Speech to Text service provides you with not only a transcription and time-code but also a simple interface for manual adjustment of the auto-generated text.

The Vidispine Development Toolkit (VDT) is free and includes multiple packages

Low-level javascript SDK for front/backend

React wrappers

Prebuilt components using https://material-ui.com/ (react components using Googles material design CSS)

With this brief introduction to the VCS Speech to Text service in Vidinet, it is time for you to test this service for yourself. Remember that the functionality and accuracy of machine learning also algorithms improve over time.

If you are using a transcription service or are working manually with speech to text today, you will most likely benefit from VCS Speech to Text in Vidinet.

Amazon Transcribe Pricing - how much?

When you try out the VCS Speech to Text, you will get an automatic cost estimate based on the amazon transcribe pricing and the source duration for the job you are starting. Use this estimate as a basis for calculating the price for the automation of speech to text in your media supply chain.

Currently, we charge 0,024 USD per content minute, but remember that you only pay when you use the service. You will scale up or pause your media supply chain whenever your business model requires it.

This flexibility is just one of many advantages when building your media supply chain with Vidispine.

Related Articles

Vidinet Cognitive Services

Create intelligent workflows with Vidinet Cognitive Services.

Why we are Going Cognitive

In an interview with Ralf Jansen, you can learn more about Vidinet Cognitive Services, why it is important, and how you can use it.

Webinar: Basics of VidiNet Cognitive Services

This webinar gives insights about our AI strategy and how the integration in the VidiNet ecosystem will work. We also demonstrate the first integrations in acti
LINK: https://www.vidispine.com/resources/blog/vidinet-cognitive-services-aw...
See more stories from vidispine

Most recent headlines

09/11/2025

Dalet Unveils Agentic AI Media Workflows at IBC2025

Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...

23/10/2025

Unlocking Character: Sportcast on Executing the Bundesliga and Bundesliga 2 New Season Production

Unlocking character: Sportcast on executing the Bundesliga and Bundesliga 2 new ...

23/10/2025

Clear Coordination: Juggling the New Bundesliga Rights Cycle Requirements and Pushing Innovation Forward at Sportcast

Clear coordination: Juggling the new Bundesliga rights cycle requirements and pu...

23/10/2025

Analysis: Is Piracy Just the Cost of Doing Business?

Analysis: Is piracy just the cost of doing business? By Callum McCarthy, Editor-at-Large Tuesday, October 21, 2025 - 09:58 Print This Story It's high ...

23/10/2025

ESPN's Adam Whitlock on Driving Real-World Innovation Across the Video-Transmission Industry

ESPN's Adam Whitlock on Driving Real-World Innovation Across the Video-Trans...

23/10/2025

SVG TranSPORT 2025 Unites 300+ Industry Leaders in New York for Deep Dive Into Live Transmission Technology

SVG TranSPORT 2025 Unites 300+ Industry Leaders in New York for Deep Dive Into L...

23/10/2025

NBA Tip-Off: League Starts Season With Two New Broadcast Partners, In-House NBA TV/NBA App Ops

NBA Tip-Off: League Starts Season With Two New Broadcast Partners, In-House NBA ...

23/10/2025

NFL Deepens Business Partnership with EA Sports; More Madden Casts to Come?

NFL Deepens Business Partnership with EA Sports; More Madden Casts to Come?EA Sports will remain the exclusive producer and distributor of Madden NFL video game...

23/10/2025

NFL Moves Pro Bowl Games Indoors and to Super Bowl Week; Leans Into a Made-for-TV Presentation

NFL Moves Pro Bowl Games Indoors and to Super Bowl Week; Leans Into a Made-for-T...

23/10/2025

Together in Time: Alan Domnguez on the Common Themes in his Films and Sundance Institute's Support

By Alan Dominguez Recently I have been thinking about the intersection of two e...

23/10/2025

Coexistence, My Ass! Dares Peacemaking to Not Be So Serious

(L-R) Amber Fares and Noam Shuster Eliassi attend the 2025 Sundance Film Festival premiere of Coexistence, My Ass! at the Egyptian Theatre on January 26, 2025...

23/10/2025

A Force Multiplier for High-Frequency Communications: The L3Harris ARGUS-HF

The new solution is industry's first multi-channel receiver available for L3Harris's resilient tactical high-frequency data waveforms....

23/10/2025

Survey: Americans Concerned' About AI's Impact on Journalism

NEW YORK During a high-profile session at NAB Show New York, new survey data was shared that revealed significant public concern over artificial intelligence (A...

23/10/2025

Fox Weather Taps T-Mobile's SuperMobile for Extreme Weather Coverage

BELLEVUE, Wash. and NEW YORK Fox Weather has tapped T-Mobile as its preferred communications provider and said all of its reporters will be equipped with SuperM...

23/10/2025

Mike Wright Joins Lawo as VP, Sales, North America

RASTATT, Germany Broadcast and media workflow technology vendor Lawo has tapped Mike Wright as VP of sales, North America....

23/10/2025

European Broadcaster ARTE Taps Grass Valley for IP Transition

MONTREAL European cultural broadcaster ARTE has selected Grass Valley LDX 135 cameras and Creative Grading solution as part of its move from SDI/1080i to a nati...

23/10/2025

Scripps Names Daniel Parsons Chief Information Security Officer

CINCINNATI The E.W. Scripps Company has named Daniel Parsons as its new chief information security officer, effective Oct. 20....

23/10/2025

WWTV Completes IP Studio Upgrade

ALAMEDA, Calif. Northern Michigan broadcaster WWTV recently completed a major IP-based upgrade that connects its new Traverse City studio with its control room ...

23/10/2025

Verizon Fios TV, Nexstar Blackout Looms as Contract Ends on Oct. 24

A deadline is looming for a new carriage deal between Verizon's Fios TV and Nexstar, with both Verizon and the pay TV-backed American Television Alliance bl...

23/10/2025

Survey: Americans 'Concerned' About AI's Impact on Journalism

NEW YORK During a high-profile session at NAB Show New York, new survey data was shared that revealed significant public concern over artificial intelligence (A...

23/10/2025

Fox Weather Taps T-Mobile's Supermobile for Extreme-Weather Coverage

BELLEVUE, Wash. and NEW YORK Fox Weather has tapped T-Mobile has as its preferred communications provider and announced that all Fox Weather reporters are being...

23/10/2025

PBS Taps Amazon Bedrock to Improve Search on Digital Platforms

PBS will use generative AI from Amazon Web Services to provide enhanced search results to viewers on the PBS App and PBS LearningMedia platforms, the network an...

23/10/2025

The Resurrected' Marks First Chinese-Language Series to Launch Netflix Profile Icons

Back to All News The Resurrected' Marks First Chinese-Language Series to L...

23/10/2025

RT publishes Register of External Activities for Q2/2025 (statistical summary)

RT is today publishing a statistical summary from the Register of External Activities for the second quarter of 2025. The RT Register of External Activities ...

23/10/2025

THE BOYS ARE BACK IN TOWN THE 2 JOHNNIES LATE NIGHT LOCK IN RETURNS FOR SERIES 3

Series three of the award winning, hit comedy entertainment series The 2 Johnnies Late Night Lock In is back on your screens, celebrating the very best of all t...

23/10/2025

Fleadh Cheoil, presented by Dith S and Muireann Nic Amhlaoibh returns to RT

Performances by Michael Flatley, Andy Irvine, Cuckoo's Nest, Foster and Allen and more Friday 24 October, 8pm on RT One and RT Player Fleadh Cheoil re...

23/10/2025

Fangs Out, Frames Up: Vampire: The Masquerade - Bloodlines 2' Leads a Killer GFN Thursday

The nights grow longer and the shadows get bolder with Vampire The Masquerade: B...

22/10/2025

Prime Video Inks Deal To Present NFL Black Friday Game Worldwide

Prime Video Inks Deal To Present NFL Black Friday Game Worldwide By SVG Staff Wednesday, October 22, 2025 - 10:06 am Print This Story | Subscribe Story ...

22/10/2025

NBA Tip-Off: ESPN Goes 1080p HDR End-to-End, Flipping HDR Switch on REMI and REMCO Shows

NBA Tip-Off: ESPN Goes 1080p HDR End-to-End, Flipping HDR Switch on REMI and REM...

22/10/2025

FloSports Empowers Division II, III Athletic Departments With Turnkey Production Suite for Livestreaming Production

FloSports Empowers Division II, III Athletic Departments With Turnkey Production...

22/10/2025

Wall Street Video Summit Debuts, Bringing Together 200 Financial Enterprise Video Executives in NYC

Wall Street Video Summit Debuts, Bringing Together 200 Financial Enterprise Vide...

22/10/2025

Dueling Pianos: International Chopin Piano Competition Is as Competitive as a Ballgame - and Has Amazing Audio

Dueling Pianos: International Chopin Piano Competition Is as Competitive as a Ba...

22/10/2025

Celebrate the Anniversaries of Shakira's Landmark Albums With Spotify-Exclusive EP and Video Special

In 1995, a young Colombian artist released an album that would change Latin pop ...

22/10/2025

SGL Carbon expands sustainable energy supply and invests in photovoltaic system at Meitingen site

Over the past few months, a photovoltaic system has been installed on a three-he...

22/10/2025

Orion Meets SLS: L3Harris Technology Ready to go to the Moon

The Orion spacecraft for NASA's Artemis II mission is stacked on the Space Launch System (SLS) rocket in High Bay 3 of the Vehicle Assembly Building at Kenn...

22/10/2025

Hybrid SATCOM: Delivering Resilient and Agile Connectivity Today

L3Harris' Hybrid SATCOM is resilient by design, offering path diversity that eliminates vulnerabilities by routing data across the best available networks i...

22/10/2025

The 2025 NAB Show New York Opens With More Than 12,000 Attendees Expected

WASHINGTON, D.C. Organizers of NAB Show New York said they are expecting more than 12,000 registered attendees from about 100 countries along with 260 exhibitor...

22/10/2025

The 2025 NAB Show New York Set to Open with More Than 12,000 Attendees Expected

WASHINGTON, D.C The organizers of The 2025 NAB Show New York have announced that they are expecting more than 12,000 registered attendees from about 100 countr...

22/10/2025

Masque Sound and Jaffe Holden Create Transformative Perfo...

Masque Sound, a leading theatrical sound reinforcement, installation and design company, supplied an extensive gear package of professional-grade equipment for ...

22/10/2025

Lightware UCX-3x3-TPX-RX20 sets new standard for connecte...

Lightware, a global leader in signal management and AV connectivity solutions, is seeing strong market momentum for the UCX-3x3-TPX-RX20, a compact transmitter-...

22/10/2025

Chyron Releases PAINT 10.2 Telestration Platform

MELVILLE, N.Y. Chyron has released PAINT 10.2, the latest update for its telestration platform, adding support for SMPTE ST 2110 IP workflows, expanding brandin...

22/10/2025

NBCUniversal Invests in ATSC 3.0 Authority Behind Run3TV

WASHINGTON Run3TV today said NBCUniversal is joining as an investor in the ATSC 3.0 Framework Authority, which develops the Run3TV NextGen TV application platfo...

22/10/2025

swXtch.io to Feature SRT-X Gateway, groundSwXtch at NAB Show New York

ATLANTA swXtch.io will feature two new networking solutions extending the company's reach across more cloud and on-prem workflows at NAB Show New York, set ...

22/10/2025

HBO Max Increases Prices for All Tiers

The Warner Bros. Discoverys HBO Max streaming services has increased prices for all its streaming tiers effectively immediately for new customers. Existing cust...

22/10/2025

OpenDrives Inks Agreement With Versatile Distribution Services

LOS ANGELES OpenDrives has signed a new distribution partnership deal with Versatile Distribution Services (VDS) to strengthen its channel and streamline how it...

22/10/2025

The 2025 NAB Show New York Set to Open with More Than 12,000 Attendees

WASHINGTON, D.C The organizers of The 2025 NAB Show New York have announced that they are expecting more than 12,000 registered attendees from about 100 countr...

22/10/2025

Samora Pinderhughes Brings Immersive Sound to Berklee's Signature Series

Samora Pinderhughes Brings Immersive Sound to Berklee's Signature Series The artist and composer, who's worked with Herbie Hancock, Robert Glasper, Co...

22/10/2025

BMI Day at Berklee Celebrates Composer Fil Eisler and Awards Scholarship to Student Jack Ryan

BMI Day at Berklee Celebrates Composer Fil Eisler and Awards Scholarship to Stud...

22/10/2025

Tribeca Announces Star-Studded Lineup of Membership Events Featuring Tracy Morgan, Alex Rodriguez, and Lucy Liu

October 22nd, 2025 TRIBECA ANNOUNCES STAR-STUDDED LINEUP OF MEMBERSHIP EVENTS F...

22/10/2025

Rohde & Schwarz and TRUMPF cooperate in drone defense

Rohde & Schwarz and TRUMPF cooperate in drone defense Rohde & Schwarz and TRUMPF partner to deliver a comprehensive drone defense solution combining Rohde & S...