Sony Pixel Power calrec Sony

Vidinet Cognitive Services - AWS Speech to Text

16/10/2020

Transcribe your content from speech to text- why?

There are many reasons to transcribe your spoken content in your media. The first reason that comes to mind is, of course, subtitling. Not only in the natively spoken language but also in translated versions. According to multiple research, subtitled videos improve reach, CTA, reactions, and share rates significantly. The second reason is, of course, to help you find the content you are looking for - do you remember the soundbite that the CEO made in that speech - but where is it?

From a business perspective, it also essential to understand how Search Engine Optimization (SEO) is affected by subtitling. Video in itself is obviously not text-based, so any information that informs Google what the video content describes benefits the ranking of the video. Subtitling your video to not just one language but many, therefore, could improve your SEO and visibility. Makes sense?

These are just some of the benefits of making subtitling in preferably more than one language available for your content.

However, for some of you, there are also new regulations to consider. An E.U. directive 2016/2102/EU now states that all member states must include subtitling on all official video information to comply with the U.N. Convention on the Rights of Persons with Disabilities (CRPD). This includes video information from government, schools, and other official organizations, including private companies that delivers information for public viewing.

Similar regulations have been present in the U.S. for many years. The most recent regulation, The 21st Century Communications, and Video Accessibility Act of 2010, states the presence of closed captions on material produced and distributed in the U.S. and can be accessed in the U.S.

Transcribe your content - but how?

Traditionally, transcribing speech to text has been a human task only. With the introduction of the new machine learning algorithms, this is now changing, and we can see how machines and humans can interact and cooperate in this area. Machine learning transcribing software proves more and more accurate, and with today's score at around 80 % or higher depending on the quality of material, the software-based services can offload a lot of initial work that would typically be done by humans only.

So, instead of spending 8 hours on manually transcribing a 1-hour video, you will be able to improve your subtitling distribution workflow by offloading the first 80 % of work to a cognitive automatic subtitling algorithm such as the VCS (Vidispine Cognitive Services) in Vidinet.

With the introduction of VCS, we now take Vidispine API and Vidinet to the next level. The Vidinet Cognitive Services is a core architecture designed to manage cognitive services from a growing number of providers on the market. In this first release of VCS, you will find cognitive services based on the AWS Transcribe libraries.

Vidinet and AWS Speech to Text - a short introduction.

Vidinet is our media supply chain platform where Vidispine customers add and configure different services for their on-premise, cloud, or hybrid environment. In here, you can now access VCS Speech to Text and add this service to your infrastructure - or just your trial account.

Let s take a quick look at a UI and how you can test the VCS Speech to Text functionality.

After uploading your content, choose Analyze to enable the AWS transcription service for your video. Vidinet will provide you with a cost estimate for the service as a basis for your calculations.

When the analysis and transcription have finished, you can easily search and navigate for the results.

The Vidispine UIIt is essential to understand that our Vidispine Development Toolkit (VDT) allows you to design any user interface (UI) that works for your environment. In these examples, we have provided a UI that provides basic functionality for testing the Vidispine API. As you can see, the VCS Speech to Text service provides you with not only a transcription and time-code but also a simple interface for manual adjustment of the auto-generated text.

The Vidispine Development Toolkit (VDT) is free and includes multiple packages

Low-level javascript SDK for front/backend

React wrappers

Prebuilt components using https://material-ui.com/ (react components using Googles material design CSS)

With this brief introduction to the VCS Speech to Text service in Vidinet, it is time for you to test this service for yourself. Remember that the functionality and accuracy of machine learning also algorithms improve over time.

If you are using a transcription service or are working manually with speech to text today, you will most likely benefit from VCS Speech to Text in Vidinet.

Amazon Transcribe Pricing - how much?

When you try out the VCS Speech to Text, you will get an automatic cost estimate based on the amazon transcribe pricing and the source duration for the job you are starting. Use this estimate as a basis for calculating the price for the automation of speech to text in your media supply chain.

Currently, we charge 0,024 USD per content minute, but remember that you only pay when you use the service. You will scale up or pause your media supply chain whenever your business model requires it.

This flexibility is just one of many advantages when building your media supply chain with Vidispine.

Related Articles

Vidinet Cognitive Services

Create intelligent workflows with Vidinet Cognitive Services.

Why we are Going Cognitive

In an interview with Ralf Jansen, you can learn more about Vidinet Cognitive Services, why it is important, and how you can use it.

Webinar: Basics of VidiNet Cognitive Services

This webinar gives insights about our AI strategy and how the integration in the VidiNet ecosystem will work. We also demonstrate the first integrations in acti
LINK: https://www.vidispine.com/resources/blog/vidinet-cognitive-services-aw...
See more stories from vidispine

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

23/05/2026

FOX Sports, IMS Productions Scale Up Indy 500 Production With New In-Car Cameras, AR Graphics, Cinematic Sets

In its second year as rightsholder, FOX Sports goes bigger across the board for ...

23/05/2026

Inside Apple TVs MLS iPhone Production with Royce Dickerson, Apple Live Sports, Executive Producer

Tonight's MLS matchup between the LA Galaxy and the Houston Dynamo FC will m...

23/05/2026

IK Multimedia reveal ReSing Voices Japanese Pack

AI-powered vocal tool gains first new language expansion IK Multimedia's AI-powered voice-creation software has seen a number of updates since it launch...

23/05/2026

Building a better future: Nielsen celebrates Global Volunteer Month and Earth Day 2026 with record participation

Nielsen Global Leadership Network graduates celebrate Earth Day 2026 Nielsen vo...

23/05/2026

Gray Media Names New Station General Managers

Share Copy link Facebook X Linkedin Bluesky Email...

23/05/2026

New CIMM Paper Urges Industry to Rethink How Media Is Evaluated

Share Copy link Facebook X Linkedin Bluesky Email...

23/05/2026

Lawo to Showcase Edge One, Efficient IP Workflows at InfoComm 2026

Share Copy link Facebook X Linkedin Bluesky Email...

23/05/2026

Spectrum Launches Ultra-Low Latency Internet

Share Copy link Facebook X Linkedin Bluesky Email...

22/05/2026

Germanys Magenta TV Selects DMC to Provide FIFA World Cup Technical Support for Studios in Munich, New York City

Germany's Magenta TV, which will have 44 exclusive FIFA World Cup match broa...

22/05/2026

DAZN Grabs IFAF Flag Football Global Rights

DAZN, the world's leading sports entertainment platform, has acquired global broadcast rights to the International Federation of American Football's ( I...

22/05/2026

ATHLOS 2026 Season Set for October Debut in London; Aurora Media Worldwide Named Host Broadcast Partner

ATHLOS, the all-women's professional track and field league, has announced i...

22/05/2026

NATAS to Stream Sports, News, and Documentary Emmy Awards Live on YouTube

The National Academy of Television Arts & Sciences (NATAS) today announced that the 47th Annual Sports Emmy Awards and the 47th Annual News & Documentary Emmy A...

22/05/2026

Wooden Camera Rolls Out New Blackmagic URSA Accessories

Wooden Camera today announced the release of new accessories for the Blackmagic URSA Cine Immersive. The new lineup includes a redesigned Top Plate and Side Rai...

22/05/2026

YES Network, OTT Advisors Extend Streaming Partnership for Sixth Season

YES Network and OTT Advisors have announced a sixth consecutive season of their streaming partnership, continuing their collaboration on the Gotham app. OTT Adv...

22/05/2026

NESN Monster Week' Returns With Full Red Sox Broadcast From Atop Green Monster

NESN, New England's premier sports network, will again turn its camera to Fe...

22/05/2026

Dale Pro Audio RF Over Fiber Webinar Set for May 28

Dale Pro Audio is hosting an RF over Fiber Livestream Webinar on May 28 from 1-2:30 pm EST. With major sporting events and large-scale productions putting incre...

22/05/2026

Audio-Technica Appoints Humrichouser, Schanz to New Roles

Audio-Technica has announced key leadership appointments designed to further strengthen its sales organization and drive continued growth across the Americas. M...

22/05/2026

Scott Coker Launches Global MMA League With $60 Million in Backing

After nearly four decades shaping the global combat sports landscape, Scott Coker has announced a powerful return as he looks to build a new international mixed...

22/05/2026

Skyline Launches xOps Vanguard Runway for Autonomous Era

Skyline Communications, the company behind the globally deployed DataMiner xOps platform, today announced the launch of xOps Vanguard Runway, a strategic accele...

22/05/2026

The American Rodeo Takes Over Globe Life Field for Championship Weekend

For the fully onsite production, 30 cameras - including a SkyCam and Megalodon - will capture the action in Texas One of the world's biggest rodeo producti...

22/05/2026

Argentinas Torneos Taps Imagine Versio for Playout Operations Upgrade

Leading Argentina-based sports media company Torneos y Competencias S.A. has modernized its playout operations, implementing a fully redundant, multichannel env...

22/05/2026

Owl AI and Major League Pickleball Go Live with First-Ever AI Officiating System Powered by Broadcast Cameras and the Cloud

As the 2026 Major League Pickleball season kicks off this weekend in Dallas, it ...

22/05/2026

Shure, Edge Sound Research Look to Innovate via Partnership

Shure has become a minority investor in Edge Sound Research, a start-up company that is developing new experiential audio technologies that redefine how many au...

22/05/2026

SVG Rewind: MLBs UmpCam AR System Puts Fans Inside the Strike Zone Like Never Before

In advance of this year's Sports Emmy Awards, SVG is taking a deep dive into...

22/05/2026

Jelly Roll Offers Up 2026 Stanley Cup Playoff Theme Song for NHL, Amazon Music

The National Hockey League (NHL) and Amazon Music announced that GRAMMY Award-winning superstar Jelly Roll will provide the official theme song of the 2026 Stan...

22/05/2026

David Pogue, Andy Beach Keynotes Highlight Silicon Valley Video Summer Camp, July 14 at De Anza College

David Pogue will keynote SVV Summer Camp and discuss Apple at 50: How the World...

22/05/2026

FOX Sports, IMS Productions Scale Up Indy 500 Production in Year Two With New In-Car Cameras, AR Graphics, and Cinematic Sets

In its second year as rightsholder, FOX Sports goes bigger across the board for ...

22/05/2026

FOX Sports' Indy 500 Director Mitch Riggin on the Tech and Storytelling for the Greatest Spectacle in Racing

The broadcaster is drawing on lessons learned in its first year of covering the ...

22/05/2026

How Spotify's Rebuilt Ad Platform Is Delivering New Value for Brands

At our 2026 Investor Day, we shared an inside look at the rebuild of our advertising business. This pivot to our own purpose-built platform is already driving s...

22/05/2026

Spotify Levels Up Our Podcast Experience With New Features for Fans and Creators

Podcasting on Spotify continues to grow, and so do the ways listeners engage with it. At Investor Day 2026, we shared how we're building the next chapter of...

22/05/2026

CEDAR Audio introduce Icons Bundles

Limited-time collections now available Restoration experts CEDAR Audio have recently launched a new line of Icons plug-ins that make their powerful processo...

22/05/2026

Boss expand PS-1 Plugout Pedal

Three new classics join Model Pass line-up Boss' PX-1 Plugout Pedal offers an innovative approach to guitar pedals, providing users with a hardware stom...

22/05/2026

SGL Carbon commissions photovoltaic system and lays the foundation for a new nitrogen plant at its Meitingen site

At its Meitingen site, SGL Carbon has implemented two key projects to further de...

22/05/2026

Statement regarding 2026 National NAIDOC Lifetime Achievement Award for the late Rhoda Roberts AO

Statement regarding 2026 National NAIDOC Lifetime Achievement Award for the late...

22/05/2026

Polsat Reclaims Second Place and ByteDance Enters Top 10 as Polish Viewing Moves Beyond the Living Room in April

Latest data reveals steady distributor rankings, a seasonal shift toward digital...

22/05/2026

FCC Votes to Update Disaster Information Reporting System

Share Copy link Facebook X Linkedin Bluesky Email...

22/05/2026

Amagi delivers 30 per cent revenue growth in FY26 Adjuste...

Amagi Media Labs Limited (NSE: AMAGI, BSE: 544679), a cloud-native SaaS platform providing AI-enabled solutions to global media and entertainment companies, tod...

22/05/2026

Annima Post Relies on Cintel to Revive Classic Mexican Films

An nima Post Relies on Cintel to Revive Classic Mexican Films Brie Clayton May 22, 2026 0 Comments Film scanner and DaVinci Resolve Studio help manage...

22/05/2026

Boris FX Sapphire Adds Optical Beauty and Hypnotic Textures

Boris FX Sapphire Adds Optical Beauty and Hypnotic Textures Jessie Electa Petrov May 22, 2026 0 Comments The 2026.5 release introduces advanced defocu...

22/05/2026

Deployment Preserves Trusted Workflows While Enabling a P...

Deployment Preserves Trusted Workflows While Enabling a Path to UHD and SMPTE ST 2110 Leading Argentina-based sports media company Torneos y Competencias S.A....

22/05/2026

Study: AI Labeling Does Not Hurt Video Ad Performance

Share Copy link Facebook X Linkedin Bluesky Email...

22/05/2026

NAB Show Makes 200+ Sessions Available on Demand

Share Copy link Facebook X Linkedin Bluesky Email...

22/05/2026

Torneos Upgrades Multichannel Playout with Imagine's Versio

Share Copy link Facebook X Linkedin Bluesky Email...

22/05/2026

Ex-Husband, Current Husband, One Wild Rescue: Korean Action Comedy Husbands in Action' Premieres June 19

Back to All News Ex-Husband, Current Husband, One Wild Rescue: Korean Action Co...

22/05/2026

Beta Da Silva hosts live performances from 20 new Irish artists in Sessions from Oblivion on 2FM's New Music Show

Catch the latest in Irish music live from venues such as Whelan's, R is n Du...

21/05/2026

CBS Sports Expands WNBA Tip-Off Show To Cover Half of 20-Game, Regular-Season Package

Game Creek Video Columbia and Celtic, NEP Supershooter 8 will house onsite produ...