Sony Pixel Power calrec Sony

AI Can Be Leveraged to Simplify, Enhance STT Services

01/07/2020

AI Can Be Leveraged to Simplify, Enhance STT Services

Author:Guy Finley Artificial intelligence (AI) can be used by media and entertainment companies to simplify and enhance all of their subtitling, translation and transcription (STT) services in the cloud, according to M&E technology firm Digital Nirvana.

Digital Nirvana's Russell Wise, SVP of sales and marketing, and Ed Hauber, its business development manager, used the June 24 webinar Leveraging AI for Speed & Efficiency in M&E STT to detail how Trance - the company's enterprise-level, cloud-based closed captioning and translation solution - can simplify the process, as a managed or self-service STT tool.

Bloomberg, Turner and other major media organizations are already using the plug-and-play, AI-powered offering to produce captions at record speed, improving productivity by 50% and more, according to Digital Nirvana. The workflow can be used across the industry, with media, post and caption service providers all able to take advantage.

Trance is a cloud-based, enterprise-level Software-as-a-Service (SaaS) platform that is used to generate automated transcripts, to create closed captions, to translate those captions into alternate languages and also to export captioned files in all known industry-supported formats, Hauber pointed out.

Trance is also fully web-based, he noted, adding: It's accessible via a LAN, WAN or even a basic Internet connection. As an enterprise tool, Trance is fully configurable for an unlimited number of users, groups and roles.

Administrators, meanwhile, can manage multiple projects, they can create manage users, define roles and permissions, as well as establish system presets, he said, while giving viewers a demonstration of Trance.

The Manage Presets section gives users the ability to define caption attributes, such as the number of lines, the line length and the total number of characters, he pointed out during the demo.

To get media into Trance, we have a tool that we use called Media Services Portal and, like Trance, Media Services Portal - also called MSP - [is] a cloud-based platform, which allows users to ingest any number of common audio and video file formats into Trance, he said. MSP can directly integrate with both FTP and Amazon S3, he also noted.

Digital Nirvana also offers an open application programming interface (API) to integrate Media Services Portal directly into large enterprise media systems, he pointed out. Using our API, those operators don't need to create a secondary workflow process to move media into and out of Trance - and this is a really big time-saving and productivity advantage of Trance, he said.

The Trance speech-to-text engine has created a highly-accurate transcript of the media that we just imported, he also showed during the demo, noting that eliminates the necessity of doing the manual transcribing of content and delivers huge productivity gains over conventional transcription methods. It is also highly accurate - between roughly 90 to 95 percent accurate - based on good good-quality content, he noted.

The transcript interface includes text on the right side of the screen and a media player on the left with intuitive controls to play back audio and video, he demonstrated. Also featured are tools that help provide fast text editing, including an auto highlight of potentially misspelled words and spell check, he showed. Users can also create captions in more than one language, he noted.

During the Q&A, he said: Unlike other providers, we're not limited to one specific speech-to-text engine. In fact, we, by design, do not operate that way. We constantly evaluate and measure the performance of all the best speech-to-text engines that exist in the marketplace today. And so, we're not limited to just one. And the reason that that's important is this technology is progressing and developing and advancing very quickly and so being tied to one or the other is inherently limiting. We would rather take the approach of using them all and continually measuring and evaluating them.

So, as an example, if we detect that Engine A' is performing better in scenarios - say where there is sports content, and we can even be more specific: domestic American basketball - we see that speech-to-text Engine A' is performing better in this application, we automatically in the background route that content based on machine learning capability to say we're going to route this client's content through this speech-to-text engine because we see it now as performing better than the other options, he explained.

There is a great degree of accuracy that we can accomplish by using that process, he noted.

Although Trance is currently not a live captioning solution, he was quick to say: It is on our roadmap and it is something that we're actively developing. So, live captioning with the ability to run our speech-to-text engine, to collapse the time of that speech-to-text process down to near real-time, or essentially real-time, giving an operator the ability to make very quick edits within a few seconds of live and be able to do that on the fly. That's something that we're evaluating and we're working towards as the technology matures and there's a degree of reliability and consistency that we can bring to the market that is on the roadmap for sure. Not today - but coming soon.

He went on to point out: We're constantly developing the product . This company really adheres to a philosophy and a down-to-earth principle in being very, very agile. And, as much as this is an enterprise tool, the product operates on a very agile basis, meaning it's able to take and respond to customer requests very, very quickly.

There is a long history at Digital Nirvana of continual development an
LINK: https://digital-nirvana.com/ai-can-be-leveraged-to-simplify-enhance-st...
See more stories from digitalnirvana

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

30/04/2026

Ocean Blue Software Launches ATSC 3.0 Inspector for Smart TVs

Share Copy link Facebook X Linkedin Bluesky Email...

30/04/2026

Digital Azul positions Lisbon as a remote production hub...

Scalable broadcast-grade production over public internet, replacing traditional OB workflows...

30/04/2026

V Nova Brings DTV Deployment LCEVC Ecosystem Progress and...

Live demonstrations highlight LCEVC ecosystem momentum, AI-powered video pipelines, and expansion across broadcast, streaming, and social media. DTV (TV 3.0) ...

30/04/2026

Student Spotlight: Matthew Leon

Student Spotlight: Matthew Leon The dual major shares his path from community college to Berklee, and how his heritage influences his work. April 29, 2026 B...

30/04/2026

Berklee Artists to Perform at Major Global Music Festivals

Berklee Artists to Perform at Major Global Music Festivals As part of the Berklee Popular Music Institute, students will perform at Lollapalooza, Governors Ba...

30/04/2026

Study: Rokus Low-Cost, Ad-Free Howdy Streamer Hits 1 Million Subs

Share Copy link Facebook X Linkedin Bluesky Email...

30/04/2026

MLS Innovation Lab Selects AI Partners

Share Copy link Facebook X Linkedin Bluesky Email...

30/04/2026

NRB Files FCC Complaint Over Jimmy Kimmel Live!' Monologue

Share Copy link Facebook X Linkedin Bluesky Email...

30/04/2026

D.C. Court Denies Emergency Stay of Nexstar/Tegna Merger

Share Copy link Facebook X Linkedin Bluesky Email...

30/04/2026

NAB Criticizes FCC for Ordering Early Renewal of ABC-Owned Stations

Share Copy link Facebook X Linkedin Bluesky Email...

30/04/2026

FCC Approves Station Swaps Between Scripps and Gray Media

Share Copy link Facebook X Linkedin Bluesky Email...

30/04/2026

Telestream Introduces Pulse - a Software-Defined Test an...

A flexible monitoring platform designed to simplify ST 2110 operations, consolidate vendor tools, and support modern live production environments. See it at NAB...

30/04/2026

Langlev Takes Run Amok to Sundance with Zeiss Supreme Pri...

Sundance-premiering Run Amok is an expressive, unconventional take on today's teen experience, replete with musical numbers. When cinematographer Shachar ...

30/04/2026

Studio Technologies Elevates Jacksonville State Universit...

JACKSONVILLE, AL, APRIL 29, 2026 Jacksonville State University, known as Jax State and a proud NCAA Division I member of Conference USA, has transformed its a...

30/04/2026

Knowledge Network selects ThinkAnalytics to launch AI-pow...

Transforming viewer and content data into real-time intelligence to deliver relevant streaming experiences at scale ThinkAnalytics, the global leader in AI-pow...

30/04/2026

Sports Production, Delivery is Big Biz at NAB

Sports Production, Delivery is Big Biz at NAB Andy Marken April 29, 2026 0 Comments Hero image source: NAB One of the neat things about trade shows i...

30/04/2026

Former Kerry footballer and broadcaster Dara Cinnide investigates the century-old murder of a Kerry man in new RT documentary

Rian na Fola airs on RT One and RT Player on Monday May 4 Rian na Fola is a o...

30/04/2026

It's Gonna Be May: 16 Games Hit the Cloud This Month, With More NVIDIA GeForce RTX 5080 Power

It's gonna be May - and the cloud's in full festival mode. 16 games ar...

30/04/2026

April 29, 2026

Scripps Research ranks third in 2026 Cure Innovation Index April 29, 2026 LA JOLLA, CA Scripps Research ranked third in the inaugural 2026 Cure Innovation In...

29/04/2026

Churchill Downs Racetrack Lifts Curtain on New Big Board Ahead of Kentucky Derby Week

It was a delicate job in a 150-year-old venue laden with traditions. Begun at th...

29/04/2026

How the MLS Innovation Lab Enhances Everything From Athlete Performance to Production to Ticket Sales

In annual event, the league gives startup companies the opportunity to prove the...

29/04/2026

Save the Date: SVG Venues & Teams Summit Travels South to Miamis Kaseya Center on Aug. 12

Panel discussions, networking, and a facility tour will take place in the renova...

29/04/2026

The Cast and Crew of Conbody VS Everybody Have Each Other's Backs for Life

(L-R) Derek Drescher, Coss Marte, and Syretta Wright have each other's backs. (Micheal Hurcomb/Shutterstock for Sundance Film Festival) By Veronika Lee Cla...

29/04/2026

Free Tilt EQ plug-in from Techivation

Combines EQ and harmonic distortion Techivation's latest release is a simple EQ designed to offer quick control over a source's overall tonal balanc...

29/04/2026

Expressive E launch the Osmose CE

Two new MPE controllers announced Expressive E caused quite a stir when they released the Osmose, making the sort of expression that was once reserved for p...

29/04/2026

iZotope RX 12 is here

New modules & enhanced machine-learning The latest version of iZotope's flagship restoration suite is now available, and now offers over 50 tools design...

29/04/2026

Surgeon Dr Jasmina Kevric wins 2026 Les Murray Award

Surgeon Dr Jasmina Kevric wins 2026 Les Murray Award 29 April, 2026 Media releases Australia for UNHCR and SBS are proud to announce that Dr Jasmina Kevric...

29/04/2026

YEP Spotlight: Julissa Padilla

Some people stumble into their passion. Julissa Padilla walked straight into a film vault. For her, entertainment was never just about the movies themselves. It...

29/04/2026

Clear-Com Promotes Brian Grahn, Ben Turnwell to Expanded Roles

Share Copy link Facebook X Linkedin Bluesky Email...

29/04/2026

The CW Unveils Major Streaming Deals with ESPN and Roku

Share Copy link Facebook X Linkedin Bluesky Email...

29/04/2026

Clear-Com Announces New Roles for Brian Grahn and Ben Tur...

Clear-Com has appointed Brian Grahn as Market Outreach Manager of the Americas and Ben Turnwell as Business Development Manager for EMEA live, expanding their ...

29/04/2026

Reduce Workflow Complexity with nxtedition at MPTS 2026

nxtedition is bringing its range of consolidated production tools to MPTS 2026, with new developments spanning transcription, editing, graphics and AI-assisted ...

29/04/2026

Telxius taps Synamedia to unlock seamless global multi-CD...

Quortex Switch to boost the streaming experience for Telxius customers, reaching millions of viewers worldwide Synamedia and Telxius, the leading global connec...

29/04/2026

freispace and Projective Announce Joint Showcase of Integ...

freispace, the leading ERP-as-a-Service platform for media and entertainment production, and Projective, a leading provider of post-production collaboration tec...

29/04/2026

DHD Celebrates 30th Anniversary at 2026 NAB Show

DHD reports strong interest in its broadcast audio product range, exhibited at the April 19th-22nd NAB Show in Las Vegas. The event attracted a claimed 58,000 a...

29/04/2026

Student Spotlight: Alan Catz

Student Spotlight: Alan Catz The Argentine film and game composer talks about working on League of Legends, receiving Berklee's BMI Award, and the lifelon...

29/04/2026

Jay Jennings Builds the Worlds You Hear on Screen

Jay Jennings Builds the Worlds You Hear on Screen The supervising sound designer behind A Minecraft Movie, The Meg, Letters from Iwo Jima, and dozens of other...

29/04/2026

Paramount Will be 49.5% Foreign Owned After WBD Merger

Share Copy link Facebook X Linkedin Bluesky Email...

29/04/2026

NAB Show 2026: AI, Vertical and BPS Dominate Broadcasters' Discussions

Share Copy link Facebook X Linkedin Bluesky Email...

29/04/2026

IAB Tech Lab Announces New OpenRTB Attributes

Share Copy link Facebook X Linkedin Bluesky Email...

29/04/2026

Meet the eight finalists for RT Today's TV Home Cook competition

Voting opens at 4pm today RT 's Today show have announced the eight finalists for their TV Home Cook competition. Amateur cooks from Cork, Dublin, Galway ...

29/04/2026

VEON and Kyivstar Fulfill Commitment to Invest USD 1 Billion in Ukraine over 2023-2027 Ahead of Schedule

29 Apr 2026 VEON and Kyivstar Fulfill Commitment to Invest USD 1 Billion in Ukr...

29/04/2026

U challenges stars and their dogs to compete for Wagging Rights

Rhod Gilbert, Harriet Kemsley, Kae Kurd, Sara Pascoe and Vicki Pattison to take part in brand new series on free streaming service U London, 29th April 2026: F...

29/04/2026

Katie Price: Nothing to Hide, a Sky Original documentary series from BAFTA-winning Mindhouse, coming this summer

Wednesday 29 April 2026 Katie Price: Nothing to Hide, a Sky Original documentar...