Sony Pixel Power calrec Sony

AI Can Be Leveraged to Simplify, Enhance STT Services

01/07/2020

AI Can Be Leveraged to Simplify, Enhance STT Services

Author:Guy Finley Artificial intelligence (AI) can be used by media and entertainment companies to simplify and enhance all of their subtitling, translation and transcription (STT) services in the cloud, according to M&E technology firm Digital Nirvana.

Digital Nirvana's Russell Wise, SVP of sales and marketing, and Ed Hauber, its business development manager, used the June 24 webinar Leveraging AI for Speed & Efficiency in M&E STT to detail how Trance - the company's enterprise-level, cloud-based closed captioning and translation solution - can simplify the process, as a managed or self-service STT tool.

Bloomberg, Turner and other major media organizations are already using the plug-and-play, AI-powered offering to produce captions at record speed, improving productivity by 50% and more, according to Digital Nirvana. The workflow can be used across the industry, with media, post and caption service providers all able to take advantage.

Trance is a cloud-based, enterprise-level Software-as-a-Service (SaaS) platform that is used to generate automated transcripts, to create closed captions, to translate those captions into alternate languages and also to export captioned files in all known industry-supported formats, Hauber pointed out.

Trance is also fully web-based, he noted, adding: It's accessible via a LAN, WAN or even a basic Internet connection. As an enterprise tool, Trance is fully configurable for an unlimited number of users, groups and roles.

Administrators, meanwhile, can manage multiple projects, they can create manage users, define roles and permissions, as well as establish system presets, he said, while giving viewers a demonstration of Trance.

The Manage Presets section gives users the ability to define caption attributes, such as the number of lines, the line length and the total number of characters, he pointed out during the demo.

To get media into Trance, we have a tool that we use called Media Services Portal and, like Trance, Media Services Portal - also called MSP - [is] a cloud-based platform, which allows users to ingest any number of common audio and video file formats into Trance, he said. MSP can directly integrate with both FTP and Amazon S3, he also noted.

Digital Nirvana also offers an open application programming interface (API) to integrate Media Services Portal directly into large enterprise media systems, he pointed out. Using our API, those operators don't need to create a secondary workflow process to move media into and out of Trance - and this is a really big time-saving and productivity advantage of Trance, he said.

The Trance speech-to-text engine has created a highly-accurate transcript of the media that we just imported, he also showed during the demo, noting that eliminates the necessity of doing the manual transcribing of content and delivers huge productivity gains over conventional transcription methods. It is also highly accurate - between roughly 90 to 95 percent accurate - based on good good-quality content, he noted.

The transcript interface includes text on the right side of the screen and a media player on the left with intuitive controls to play back audio and video, he demonstrated. Also featured are tools that help provide fast text editing, including an auto highlight of potentially misspelled words and spell check, he showed. Users can also create captions in more than one language, he noted.

During the Q&A, he said: Unlike other providers, we're not limited to one specific speech-to-text engine. In fact, we, by design, do not operate that way. We constantly evaluate and measure the performance of all the best speech-to-text engines that exist in the marketplace today. And so, we're not limited to just one. And the reason that that's important is this technology is progressing and developing and advancing very quickly and so being tied to one or the other is inherently limiting. We would rather take the approach of using them all and continually measuring and evaluating them.

So, as an example, if we detect that Engine A' is performing better in scenarios - say where there is sports content, and we can even be more specific: domestic American basketball - we see that speech-to-text Engine A' is performing better in this application, we automatically in the background route that content based on machine learning capability to say we're going to route this client's content through this speech-to-text engine because we see it now as performing better than the other options, he explained.

There is a great degree of accuracy that we can accomplish by using that process, he noted.

Although Trance is currently not a live captioning solution, he was quick to say: It is on our roadmap and it is something that we're actively developing. So, live captioning with the ability to run our speech-to-text engine, to collapse the time of that speech-to-text process down to near real-time, or essentially real-time, giving an operator the ability to make very quick edits within a few seconds of live and be able to do that on the fly. That's something that we're evaluating and we're working towards as the technology matures and there's a degree of reliability and consistency that we can bring to the market that is on the roadmap for sure. Not today - but coming soon.

He went on to point out: We're constantly developing the product . This company really adheres to a philosophy and a down-to-earth principle in being very, very agile. And, as much as this is an enterprise tool, the product operates on a very agile basis, meaning it's able to take and respond to customer requests very, very quickly.

There is a long history at Digital Nirvana of continual development an
LINK: https://digital-nirvana.com/ai-can-be-leveraged-to-simplify-enhance-st...
See more stories from digitalnirvana

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

06/09/2026

Dolby and MagentaTV Bring Fans Closer to the FIFA World Cup 2026 in Germany with Dolby Vision and Dolby Atmos

June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

15/06/2026

University of South Carolina's Valerie Gerfin on Gamecock Productions' Growth, Upgrades at Williams-Brice Stadium

One of the more exciting internal video production divisions within a college at...

15/06/2026

Fox Corp. To Acquire Roku, Pairs Live Sports Powerhouse With Major CTV Platform

The deal valued at $22 Billion is expected to close in the first half of 2027...

15/06/2026

Golf Channel Mobile to Live Stream 2026 Arnold Palmer Cup Beginning July 13th

Golf Channel and the Arnold Palmer Cup have announced a partnership to livestream the 2026 Arnold Palmer Cup on Golf Channel Mobile and GolfChannel.com. The tou...

15/06/2026

TikTok and Panini Launch Digital Collectible Card Experience for FIFA World Cup 2026

TikTok and Panini have announced a partnership to bring a digital collectible ca...

15/06/2026

Cosm and Monster Energy Launch First Full-Dome Immersive Advertisement in Shared Reality Venues

Cosm and Monster Energy have announced the debut of the first full-dome immersiv...

15/06/2026

Fox Nation and Real American Freestyle Sign International Media Rights Deal

Real American Freestyle (RAF) and Fox Nation have announced an exclusive streaming agreement for three RAF international events, beginning with RAF Georgia on J...

15/06/2026

FanConnect and Extreme Networks Announce IPTV Integration for Large Venue Deployments

FanConnect has announced a partnership with Extreme Networks integrating FanConn...

15/06/2026

2026 Sundance Institute Ignite x Adobe Fellows Named

Ten Emerging Filmmakers Ages 18 to 25 Will Start Fellowship Year at Ignite Lab from June 14-19 LOS ANGELES, CA, June 15, 2026 - The nonprofit Sundance Institut...

15/06/2026

Rumble from UVI

Innovative three-band soft synth introduced UVI's latest synth takes an interesting approach to synthesis, offering a trio of synth engines that each op...

15/06/2026

Oram Awards 2026: Open call announcement

Applications now open for 2026 The Oram Awards have returned for 2026 to celebrate the unusual, unique and unfiltered creative worlds of women and gender-di...

15/06/2026

PSPaudioware release PSP Levelizer

New intelligent auto-fader plug-in revealed PSPaudioware's latest release offers automatic level adjustment and provides more detailed control than many...

15/06/2026

4.78M AUSSIES TUNE IN FOR SOCCEROOS WIN OVER TRKYE ON SBS

4.78M AUSSIES TUNE IN FOR SOCCEROOS WIN OVER T RK YE ON SBS 15 June, 2026 Media releases Match had a Total TV average audience of 3.035 million, with over ...

15/06/2026

SBS Head of Commissioning John Godfrey to depart after 18 years

SBS Head of Commissioning John Godfrey to depart after 18 years 15 June, 2026 Media releases SBS Head of Commissioning John Godfrey will depart the broadca...

15/06/2026

Greater Manchester Police installs Rohde & Schwarz security scanner for custody searches

Greater Manchester Police installs Rohde & Schwarz security scanner for custody ...

15/06/2026

The New Discovery Stack: AI, Metadata and Audience Intelligence

Insights from NAGRAVISION's latest industry webinar featuring One Hungary, Liberty Global and Media Press Group In this blog, Laura Rognoni explores the k...

15/06/2026

Clear-Com Introduces Avalon IP Intercom Platform

Share Copy link Facebook X Linkedin Bluesky Email...

15/06/2026

DoJ Approves Paramount Skydance, Warner Bros. Discovery Merger

Share Copy link Facebook X Linkedin Bluesky Email...

15/06/2026

Clear-Com Introduces Avalon IP Station for Modern Communi...

Clear-Com has introduced Avalon , a purpose built 1RU IP intercom communication platform for modern networked production, designed to simplify and scale workfl...

15/06/2026

Fox Makes CTV Play with Roku Acquisition

Share Copy link Facebook X Linkedin Bluesky Email...

15/06/2026

Gray Announces Plans to Expand Lansing, Mich. Broadcast HQ

Share Copy link Facebook X Linkedin Bluesky Email...

15/06/2026

Richmond Flying Squirrels Raise the Bar for Live Baseball...

MiLB Club Deploys LDX 110 Cameras at CarMax Park to Deliver A New Standard in Engaging Fan Experience Grass Valley today announced that the Richmond Flying Sq...

15/06/2026

Detach from Direct-Attached: How Remote Editing with EVO Keeps Creative Teams Moving

Detach from Direct-Attached: How Remote Editing with EVO Keeps Creative Teams Mo...

15/06/2026

Techtel Completes Media Production Setup for a major AFL sporting organisation

Techtel Completes Media Production Setup for a major AFL sporting organisation Sports 15 June Written By Suzanne Costello (Sydney, Australia 15 June 2026)...

15/06/2026

Sky News takes viewers inside Minab in new film investigating primary school strike in Iran

Monday 15 June 2026 Sky News takes viewers inside Minab in new film investigati...

15/06/2026

Fox Corporation to Acquire Roku, Inc.

Fox Corporation to Acquire Roku, Inc. Combination Creates a Scaled Media and Technology Platform with Superior Reach, Engagement and Monetization Capability ...

14/06/2026

Detroit Drums from Iconic Instruments

Library captures 1960s R&B/pop drum sound Following on from their recent wave of plug-in effects, Iconic Instruments have just launched an all-new virtual d...

14/06/2026

HBO Comedy Rooster Shot with URSA Cine 17K 65

HBO Comedy Rooster Shot with URSA Cine 17K 65 Brie Clayton June 14, 2026 0 Comments Large format brings viewers intimately close to characters. Black...

13/06/2026

Rhythmic Filters for Devious Machines' Infiltrator

Latest expansion pack includes 252 presets Devious Machines have recently introduced another expansion for their powerful multi-effects plug-in, Infiltrator...

13/06/2026

MetaGrid Pro gains AI Builder

Create custom DAW/plug-in controllers using prompts MetaGrid have recently introduced an all-new AI Builder function to their touchscreen-based control surf...

13/06/2026

Spectrum Reach Taps Anoki AI for Contextual Intelligence

Share Copy link Facebook X Linkedin Bluesky Email...

13/06/2026

Google TV Launches Soccer Hub, New Voice Command Features

Share Copy link Facebook X Linkedin Bluesky Email...

12/06/2026

YES Network and Gotham Sports App to Air Seven Athletes Unlimited Softball League Games

YES Network and The Gotham Sports App will air seven Athletes Unlimited Softball...

12/06/2026

UFL to Feature FAST Innovation Suite at 2026 United Bowl

The United Football League will host its FAST Innovation Suite at the 2026 United Bowl presented by Credit One Bank on Saturday, June 13 at 3:00 p.m. ET at Audi...

12/06/2026

InfoComm 2026: PTZOptics and LayerJot to Demo AI-Driven Camera Control

PTZOptics and LayerJot will present live demonstrations at InfoComm 2026 showing how natural-language AI prompting, robotic camera control, and on-device comput...

12/06/2026

InfoComm 2026: MultiDyne to Debut VF-9100 Fiber Transport Platform and Crescendo Audio Monitor

MultiDyne Video and Fiber Optic Systems will exhibit at InfoComm 2026, featuring...

12/06/2026

Eurovision Services Deploys Ateme Software-Based Frame-Rate Conversion

Ateme has announced that Eurovision Services is using Ateme's software-based frame-rate conversion technology for international live event workflows. The de...

12/06/2026

Bitmovin, Simplestream, and Xperi Partner to Support OTT Services on TiVo OS

Bitmovin and Simplestream have announced a partnership with Xperi to simplify the launch of OTT streaming services on TiVo OS smart TVs and devices. The collabo...

12/06/2026

Net Insight Deploys Nimbra 520 and Nimbra Edge for Multinational Corporate Live Production Workflow

Net Insight has announced that a multinational technology company is deploying a...

12/06/2026

MLB Players Inc., Athletes First Announce Content Partnership

MLB Players Inc., the business arm of the MLB Players Association, has announced a partnership with Athletes First to develop and sell brand partnerships across...

12/06/2026

G&D and VuWall Announce CommandKeyboard-Advanced for Network-Independent Control Room Operations

Guntermann and Drunck (G&D) and VuWall have announced the CommandKeyboard-Advanc...

12/06/2026

Philadelphia Union and Comcast Deploy Smart Technology at Subaru Park and WSFS Bank Sportsplex

Comcast Smart Solutions announces a new smart technology deployment with Major L...

12/06/2026

Elevation Worship Completes First Leg of 2026 Tour Using SSL Live Consoles and New UMD192 Interface

Elevation Worship completed the initial leg of its Elevation Nights 2026 tour ...

12/06/2026

AJA Announces KONA IP25 Integration with Colorfront Transkoder and On-Set Dailies

AJA Video Systems has announced KONA IP25 support for Colorfront Transkoder and ...

12/06/2026

InfoComm 2026: Audinate To Exhibit With New AVIO Install Adapters and Iris Camera Control Platform

Audinate Group Limited (ASX: AD8) will exhibit at InfoComm 2026 (Booth C7321, Ce...

12/06/2026

Pac-12 Appoints Scott Adametz as Chief Technology Officer

Pac-12 Commissioner Teresa Gould has announced the appointment of Scott Adametz as Chief Technology Officer. The Pac-12 describes the hire as the first CTO appo...

12/06/2026

InfoComm 2026: Grass Valley Introduces AMPP Edge Live for Enterprise Production

Grass Valley has announced AMPP Edge Live, a production system combining Grass Valley hardware, NVIDIA Blackwell GPU acceleration, and AMPP OS in a single platf...