Sony Pixel Power calrec Sony

Vidinet Cognitive Services - AWS Speech to Text

16/10/2020

Transcribe your content from speech to text- why?

There are many reasons to transcribe your spoken content in your media. The first reason that comes to mind is, of course, subtitling. Not only in the natively spoken language but also in translated versions. According to multiple research, subtitled videos improve reach, CTA, reactions, and share rates significantly. The second reason is, of course, to help you find the content you are looking for - do you remember the soundbite that the CEO made in that speech - but where is it?

From a business perspective, it also essential to understand how Search Engine Optimization (SEO) is affected by subtitling. Video in itself is obviously not text-based, so any information that informs Google what the video content describes benefits the ranking of the video. Subtitling your video to not just one language but many, therefore, could improve your SEO and visibility. Makes sense?

These are just some of the benefits of making subtitling in preferably more than one language available for your content.

However, for some of you, there are also new regulations to consider. An E.U. directive 2016/2102/EU now states that all member states must include subtitling on all official video information to comply with the U.N. Convention on the Rights of Persons with Disabilities (CRPD). This includes video information from government, schools, and other official organizations, including private companies that delivers information for public viewing.

Similar regulations have been present in the U.S. for many years. The most recent regulation, The 21st Century Communications, and Video Accessibility Act of 2010, states the presence of closed captions on material produced and distributed in the U.S. and can be accessed in the U.S.

Transcribe your content - but how?

Traditionally, transcribing speech to text has been a human task only. With the introduction of the new machine learning algorithms, this is now changing, and we can see how machines and humans can interact and cooperate in this area. Machine learning transcribing software proves more and more accurate, and with today's score at around 80 % or higher depending on the quality of material, the software-based services can offload a lot of initial work that would typically be done by humans only.

So, instead of spending 8 hours on manually transcribing a 1-hour video, you will be able to improve your subtitling distribution workflow by offloading the first 80 % of work to a cognitive automatic subtitling algorithm such as the VCS (Vidispine Cognitive Services) in Vidinet.

With the introduction of VCS, we now take Vidispine API and Vidinet to the next level. The Vidinet Cognitive Services is a core architecture designed to manage cognitive services from a growing number of providers on the market. In this first release of VCS, you will find cognitive services based on the AWS Transcribe libraries.

Vidinet and AWS Speech to Text - a short introduction.

Vidinet is our media supply chain platform where Vidispine customers add and configure different services for their on-premise, cloud, or hybrid environment. In here, you can now access VCS Speech to Text and add this service to your infrastructure - or just your trial account.

Let s take a quick look at a UI and how you can test the VCS Speech to Text functionality.

After uploading your content, choose Analyze to enable the AWS transcription service for your video. Vidinet will provide you with a cost estimate for the service as a basis for your calculations.

When the analysis and transcription have finished, you can easily search and navigate for the results.

The Vidispine UIIt is essential to understand that our Vidispine Development Toolkit (VDT) allows you to design any user interface (UI) that works for your environment. In these examples, we have provided a UI that provides basic functionality for testing the Vidispine API. As you can see, the VCS Speech to Text service provides you with not only a transcription and time-code but also a simple interface for manual adjustment of the auto-generated text.

The Vidispine Development Toolkit (VDT) is free and includes multiple packages

Low-level javascript SDK for front/backend

React wrappers

Prebuilt components using https://material-ui.com/ (react components using Googles material design CSS)

With this brief introduction to the VCS Speech to Text service in Vidinet, it is time for you to test this service for yourself. Remember that the functionality and accuracy of machine learning also algorithms improve over time.

If you are using a transcription service or are working manually with speech to text today, you will most likely benefit from VCS Speech to Text in Vidinet.

Amazon Transcribe Pricing - how much?

When you try out the VCS Speech to Text, you will get an automatic cost estimate based on the amazon transcribe pricing and the source duration for the job you are starting. Use this estimate as a basis for calculating the price for the automation of speech to text in your media supply chain.

Currently, we charge 0,024 USD per content minute, but remember that you only pay when you use the service. You will scale up or pause your media supply chain whenever your business model requires it.

This flexibility is just one of many advantages when building your media supply chain with Vidispine.

Related Articles

Vidinet Cognitive Services

Create intelligent workflows with Vidinet Cognitive Services.

Why we are Going Cognitive

In an interview with Ralf Jansen, you can learn more about Vidinet Cognitive Services, why it is important, and how you can use it.

Webinar: Basics of VidiNet Cognitive Services

This webinar gives insights about our AI strategy and how the integration in the VidiNet ecosystem will work. We also demonstrate the first integrations in acti
LINK: https://www.vidispine.com/resources/blog/vidinet-cognitive-services-aw...
See more stories from vidispine

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

07/04/2026

ASG Names Andrea Cummis VP of Systems Engineering

Share Copy link Facebook X Linkedin Bluesky Email...

07/04/2026

KTVJ Completes Major Signal Upgrade

Share Copy link Facebook X Linkedin Bluesky Email...

07/04/2026

Hearst's WDSU to Air Million Dollar Rodeo Competition

Share Copy link Facebook X Linkedin Bluesky Email...

07/04/2026

Grass Valley Launches Future Playmakers Program

Share Copy link Facebook X Linkedin Bluesky Email...

07/04/2026

Saranyu Technologies Launches MATCH - Multi-View Sports S...

Designed for synchronized multi-stream playback, low-latency delivery, and real-time analytics, MATCH introduces a unified viewing experience for sports broadca...

07/04/2026

Berklee Students to Honor George Martin with Performance of Original Scores

Berklee Students to Honor George Martin with Performance of Original Scores The orchestra, led by associate professor Xander Rovang, will perform several work...

06/04/2026

Fab Five' Reunion Drives TNT and CBS's Experimental Final Four Altcast Built on REMI Workflow

Michigan legends bring a new voice to the broadcast as TNT Sports and CBS Sports...

06/04/2026

SVG New Sponsor Spotlight: Optikka CEO Daniel Evans on Scaling Sports Content with Programmatic Graphics

From high school sports all the way up to the major leagues, building high-quali...

06/04/2026

Quickplay and TwelveLabs Join AWS Business Outcomes Xcelerator Program

Quickplay, an AI company for the media and entertainment industry, has been accepted into the Advanced tier of the TwelveLabs Ecosystem Partner Program. Quickpl...

06/04/2026

Grass Valley Launches Future Playmakers Program for Students in Sports Production and Media Technology

Grass Valley has announced the Future Playmakers Program, a global initiative to...

06/04/2026

SVG All-Stars: Raasean Robinson, Gerente de Posproduccin y Operaciones de Estudio, FOX Deportes

El l der de operaciones impulsa la producci n en estudio mientras encuentra insp...

06/04/2026

SVG All-Stars: Raasean Robinson, Manager, Post Production and Studio Operations, FOX Deportes

The ops leader helps lead the charge in studio for the Spanish-language broadcas...

06/04/2026

Behind The Mic: SiriusXM Shares 2026 Masters Broadcast Team; ESPN to Produce Over 140+ Hours of Masters Live Coverage

Behind The Mic provides a roundup of recent news regarding on-air talent, includ...

06/04/2026

NHL Opens Innovation Lab in Partnership with Verizon, New Jersey Devils

The National Hockey League (NHL), in partnership with Verizon and the New Jersey Devils, today announced the opening of the NHL Innovation Lab powered by Verizo...

06/04/2026

ESPN+ To Stream Inaugural Rock League Curling Season

Rock League, a new professional curling league, has announced that ESPN+ will stream its inaugural 2026 season for fans in the United States. The first Rock Lea...

06/04/2026

ASG Appoints Andrea Cummis as VP of Systems Design and Engineering

Advanced Systems Group has announced the appointment of Andrea (Andy) Cummis as Vice President of Systems Design and Engineering. In this role, she will lead de...

06/04/2026

Source Media Group Launches Source Golf, a Creator-Driven YouTube Network Targeting Next-Gen Fans

Backed by Bolt Ventures, the venture brings Bryson DeChambeau, Grant Horvat, and...

06/04/2026

How the NHL's Innovation Lab Will Take Broadcast, Fan, and Team Tech to New Heights

With this environment we can start that collaboration even earlier because we ca...

06/04/2026

K-Pop Artist ENHYPEN Host The Blood Diary,' a New Video Podcast Series From HYBE

Like the immortal lives of vampires, some stories never really end. That's t...

06/04/2026

From Audio to IRL: How Let's Get Haunted' Is Building Community With Spotify RADAR

As podcasting continues to evolve, growth increasingly means building beyond aud...

06/04/2026

FSK Audio update Bark24 Dyn

Multiband dynamics plug-in enhanced California-based developer FSK Audio have released a significant update for their innovative multiband dynamics processo...

06/04/2026

IK Multimedia introduce ToneNET Preset Sharing

Share official & user-created full-rig presets IK Multimedia's latest TONEX update makes it possible for users of the popular amp and effects modelling ...

06/04/2026

Baseball 2026: More AI, Better Viewing Choices

Share Copy link Facebook X Linkedin Bluesky Email...

06/04/2026

JB&A Announces Details for its Pre-NAB 2026 Event

Share Copy link Facebook X Linkedin Bluesky Email...

06/04/2026

Dalet Showcases Dalia Agentic AI and End-to-End Media Workflows at NAB Show 2026

Dalet Showcases Dalia Agentic AI and End-to-End Media Workflows at NAB Show 2026 Brie Clayton April 6, 2026 0 Comments Dalet, a leading technology and...

06/04/2026

OpenDrives Shows Off Sports Expertise in Sports Business Hub located in NAB Show's West Hall

OpenDrives Shows Off Sports Expertise in Sports Business Hub located in NAB Show...

06/04/2026

Proton to Demonstrate 3D Application at NAB 2026

Proton to Demonstrate 3D Application at NAB 2026 Brie Clayton April 6, 2026 0 Comments Yet further creative potential unleashed through innovation in ...

06/04/2026

Autoscript Highlights Voice-Driven Prompting and PTZ Solutions at NAB 2026

Autoscript Highlights Voice-Driven Prompting and PTZ Solutions at NAB 2026 Brie Clayton April 6, 2026 0 Comments Experience Autoscript Voice, PTZ prom...

06/04/2026

Mediaproxy Highlights Significant Enhancements to its LogServer suite at NAB Show 2026

Mediaproxy Highlights Significant Enhancements to its LogServer suite at NAB Sho...

06/04/2026

Re-Architectured PCC Software Streamlines and Enhances the Full High-Speed Imaging Workflow

Wayne, N.J., April 6th, 2026 Phantom High-Speed announces the release of PCC 4...

06/04/2026

Tribeca Studios And Lilly Announce Winners Of Inaugural Vital Stories Filmmaker Program

April 6th, 2026 TRIBECA STUDIOS AND LILLY ANNOUNCE WINNERS OF INAUGURAL VITAL...

06/04/2026

Netflix Expands Kids Entertainment Lineup With Playground App for Games, New Shows & Returning Favorites

Back to All News Netflix Expands Kids Entertainment Lineup With Playground App ...

05/04/2026

Latest SoundBridge update now live

Tackles all reported bugs! SoundBridge have just announced the launch of a new update that introduces a couple of minor changes to their remote collaboratio...

04/04/2026

Don't Be Lame: Arizona Men's Basketball Social Team Aims To Catch the Attention of Wildcats Fans

The University of Arizona's Men's Basketball team has only loss twice th...

04/04/2026

HDR Makes Its Men's Final Four Debut as CBS Sports and TNT Sports Collaborate on New Camera Tools and an IP-Powered Compound

1080p HDR arrives, a new generation of storytelling tools takes center stage, an...

04/04/2026

Fab Five Reunion Drives TNT and CBS's Experimental Final Four Altcast Built on REMI Workflow

Michigan legends bring a new voice to the broadcast as TNT Sports and CBS Sports...

04/04/2026

Flock Audio's latest Patch App DX update

Faster, cleaner and more intuitive than ever The control software for Flock Audio's digitally controlled patchbay systems has just been treated to an up...

04/04/2026

Sinclair to FCC: Broadcast Sports Drives Investment in Local News

Share Copy link Facebook X Linkedin Bluesky Email...

04/04/2026

Study: Worldwide Telecom Capex to Decline in 2026,

Share Copy link Facebook X Linkedin Bluesky Email...

04/04/2026

Ateme Delivers Full End-to-End Streaming Platform to Moldtelecom

Share Copy link Facebook X Linkedin Bluesky Email...

04/04/2026

FCC Plans Spending, Regulatory Fee Revenue Reductions in FY 2027

Share Copy link Facebook X Linkedin Bluesky Email...

04/04/2026

DHD Introduces AI-Based Audio Noise Reduction to XD3 IP Core

DHD Introduces AI-Based Audio Noise Reduction to XD3 IP Core Brie Clayton April 3, 2026 0 Comments The accompanying image shows the rear panel of the ...

04/04/2026

Macnica Redefines ST 2110 Flexibility with Two Speeds on One Card

Macnica Redefines ST 2110 Flexibility with Two Speeds on One Card Brie Clayton April 3, 2026 0 Comments New for NAB Show 2026, MEP100 SmartNIC now sup...

04/04/2026

Unified Media Workflows for Story-Centric Production

Unified Media Workflows for Story-Centric Production Brie Clayton April 3, 2026 0 Comments Framelight X unifies field capture, editing and publishing ...

03/04/2026

TNT Sports and CBS Sports To Reunite Michigan's Iconic Fab Five' for Special NCAA Men's Final Four Altcast on truTV and HBO Max

Michigan's Fab Five will reunite for an alternate presentation of the Mich...