Sony Pixel Power calrec Sony

AI Can Be Leveraged to Simplify, Enhance STT Services

01/07/2020

AI Can Be Leveraged to Simplify, Enhance STT Services

Author:Guy Finley Artificial intelligence (AI) can be used by media and entertainment companies to simplify and enhance all of their subtitling, translation and transcription (STT) services in the cloud, according to M&E technology firm Digital Nirvana.

Digital Nirvana's Russell Wise, SVP of sales and marketing, and Ed Hauber, its business development manager, used the June 24 webinar Leveraging AI for Speed & Efficiency in M&E STT to detail how Trance - the company's enterprise-level, cloud-based closed captioning and translation solution - can simplify the process, as a managed or self-service STT tool.

Bloomberg, Turner and other major media organizations are already using the plug-and-play, AI-powered offering to produce captions at record speed, improving productivity by 50% and more, according to Digital Nirvana. The workflow can be used across the industry, with media, post and caption service providers all able to take advantage.

Trance is a cloud-based, enterprise-level Software-as-a-Service (SaaS) platform that is used to generate automated transcripts, to create closed captions, to translate those captions into alternate languages and also to export captioned files in all known industry-supported formats, Hauber pointed out.

Trance is also fully web-based, he noted, adding: It's accessible via a LAN, WAN or even a basic Internet connection. As an enterprise tool, Trance is fully configurable for an unlimited number of users, groups and roles.

Administrators, meanwhile, can manage multiple projects, they can create manage users, define roles and permissions, as well as establish system presets, he said, while giving viewers a demonstration of Trance.

The Manage Presets section gives users the ability to define caption attributes, such as the number of lines, the line length and the total number of characters, he pointed out during the demo.

To get media into Trance, we have a tool that we use called Media Services Portal and, like Trance, Media Services Portal - also called MSP - [is] a cloud-based platform, which allows users to ingest any number of common audio and video file formats into Trance, he said. MSP can directly integrate with both FTP and Amazon S3, he also noted.

Digital Nirvana also offers an open application programming interface (API) to integrate Media Services Portal directly into large enterprise media systems, he pointed out. Using our API, those operators don't need to create a secondary workflow process to move media into and out of Trance - and this is a really big time-saving and productivity advantage of Trance, he said.

The Trance speech-to-text engine has created a highly-accurate transcript of the media that we just imported, he also showed during the demo, noting that eliminates the necessity of doing the manual transcribing of content and delivers huge productivity gains over conventional transcription methods. It is also highly accurate - between roughly 90 to 95 percent accurate - based on good good-quality content, he noted.

The transcript interface includes text on the right side of the screen and a media player on the left with intuitive controls to play back audio and video, he demonstrated. Also featured are tools that help provide fast text editing, including an auto highlight of potentially misspelled words and spell check, he showed. Users can also create captions in more than one language, he noted.

During the Q&A, he said: Unlike other providers, we're not limited to one specific speech-to-text engine. In fact, we, by design, do not operate that way. We constantly evaluate and measure the performance of all the best speech-to-text engines that exist in the marketplace today. And so, we're not limited to just one. And the reason that that's important is this technology is progressing and developing and advancing very quickly and so being tied to one or the other is inherently limiting. We would rather take the approach of using them all and continually measuring and evaluating them.

So, as an example, if we detect that Engine A' is performing better in scenarios - say where there is sports content, and we can even be more specific: domestic American basketball - we see that speech-to-text Engine A' is performing better in this application, we automatically in the background route that content based on machine learning capability to say we're going to route this client's content through this speech-to-text engine because we see it now as performing better than the other options, he explained.

There is a great degree of accuracy that we can accomplish by using that process, he noted.

Although Trance is currently not a live captioning solution, he was quick to say: It is on our roadmap and it is something that we're actively developing. So, live captioning with the ability to run our speech-to-text engine, to collapse the time of that speech-to-text process down to near real-time, or essentially real-time, giving an operator the ability to make very quick edits within a few seconds of live and be able to do that on the fly. That's something that we're evaluating and we're working towards as the technology matures and there's a degree of reliability and consistency that we can bring to the market that is on the roadmap for sure. Not today - but coming soon.

He went on to point out: We're constantly developing the product . This company really adheres to a philosophy and a down-to-earth principle in being very, very agile. And, as much as this is an enterprise tool, the product operates on a very agile basis, meaning it's able to take and respond to customer requests very, very quickly.

There is a long history at Digital Nirvana of continual development an
LINK: https://digital-nirvana.com/ai-can-be-leveraged-to-simplify-enhance-st...
See more stories from digitalnirvana

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

15/04/2026

BBC World Service TV Selects Open Broadcast Systems IP Decoders for Global Distribution

Open Broadcast Systems has announced that BBC World Service has selected its IP ...

15/04/2026

NAB 2026: LiveU Expands Collaboration with Sony to Include File-Based Workflow Integration

LiveU has announced an expansion of its collaboration with Sony Corporation, add...

15/04/2026

NAB 2026: Ateme and NVIDIA Announce Immersive Video Workflow for Apple Vision Pro

Ateme has announced a collaboration with NVIDIA to support live Apple Immersive ...

15/04/2026

Professional Fighters League Renews Multi-Year Partnership with DAZN DACH

The Professional Fighters League (PFL) has announced a multi-year partnership renewal with DAZN DACH, covering Germany, Switzerland, Austria, Liechtenstein, and...

15/04/2026

NAB 2026: Canon Sets New Benchmark with CINE-SERVO 40-1200m Lens; New Remote Camera Controller Supports Up to 200 Cameras

Canon U.S.A. (NAB Booth C3825) today took the lid off of the CINE-SERVO 40-1200m...

15/04/2026

NAB 2026: Panasonic and NEP Group to Demonstrate KAIROS and NEP Platform Integration

Panasonic Video and Audio Systems North America and NEP Group will demonstrate a...

15/04/2026

Exclusive Wasabi Report: AI Spending Is Surging, But ROI Tells a Different Story

For the fourth year running, independent analysts found businesses across all industries and verticals pay roughly the same amount in fees as they spend on stor...

15/04/2026

NBC Sports to Broadcast The Soccer Tournament Live on NBC, Peacock, and NBCSN, May 30-June 1

The Soccer Tournament (TST) has announced a media rights deal with NBC Sports to...

15/04/2026

NAB 2026: JB&A Announces Exhibitors for Pre-NAB 2026 Technology Event

JB&A will host the Pre-NAB 2026 Technology Event on April 17-18 at Flamingo Las Vegas, ahead of NAB Show. The event features hands-on demonstrations and technic...

15/04/2026

NAB 2026: Sennheiser Group to Exhibit with Spectera and AMBEO Updates

The Sennheiser Group will exhibit at NAB Show 2026 (Booth 4931, Central Hall), with demonstrations from Sennheiser, Neumann, and Merging across three areas: Rel...

15/04/2026

NAB 2026: NAB Show 2026 to Feature Expanded AI, Sports, and Creator Economy Programming

NAB Show 2026 will take place April 18-22 at the Las Vegas Convention Center, wi...

15/04/2026

NAB 2026: AI-Media Launches LEXI Text Encoder and LEXI Voice Encoder

AI-Media has announced the LEXI Text Encoder and LEXI Voice Encoder at NAB Show 2026, the company's first new encoder hardware release in more than a decade...

15/04/2026

NAB 2026: Cartoni Debuts New Camera Support Products

Italian camera support manufacturer Cartoni will introduce several new products at NAB Show 2026 (Booth C6540, Central Hall), including the Master 30 OB fluid h...

15/04/2026

NAB 2026: Lawo and swXtch.io Sign MOU to Explore groundSwXtch Integration

Lawo and swXtch.io have announced a memorandum of understanding at NAB Show 2026, under which Lawo will explore incorporating swXtch.io's groundSwXtch softw...

15/04/2026

NAB 2026: CacheFly to Demonstrate New CDN Features

CacheFly will exhibit at NAB Show 2026 (Booth W3129, April 19-22, Las Vegas Convention Center), showcasing three new additions to its content delivery platform:...

15/04/2026

NAB 2026: Synamedia Launches GO Shorts for Mobile-First Short-Form Video

Synamedia has announced GO Shorts, a new module within its Synamedia Go OTT platform that uses AI to convert an operator's existing content library into a s...

15/04/2026

NAB 2026 Preview, Central Hall: Everything You Need To Know Heading Into the Show

The NAB Show kicks off on Saturday, and the SVG and SVG Europe editorial teams a...

15/04/2026

AJA Video Systems to Acquire Video Encoding Software Company Comprimato

AJA Video Systems has announced an agreement to acquire Comprimato, a live video encoding and processing software company. The deal will unite the two companies...

15/04/2026

NBA Playoffs 2026: Prime Vision, Prime Insights Offer New Data-Driven Experiences for NBA Fans

Prime Video Sports' NBA Playoffs coverage, which includes the entire SoFi NB...

15/04/2026

Top Live-Sound-System Manufacturers Team Up To Better Manage Stadium Noise

Just announced, the SDE standard provides a unified method and file format to ensure consistent and reliably comparable noise predictions Sports and entertainm...

15/04/2026

Spotify Podcast Awards Return to Celebrate Latin America's Most Influential Voices

From immersive storytelling to laugh-out-loud comedies, podcasts are booming in ...

15/04/2026

Spotify Expands Audiobook Features, and Printed Book Sales Go Live in the US and UK

Books have always moved with us, whether tucked in our bags or humming in our he...

15/04/2026

Spotify and NIVA Partner to Support Independent Venues Across the US

For many artists, independent venues are where music careers begin and fan communities take shape. Independent venue operators work hard every day to keep local...

15/04/2026

Spotify Editors Reveal Their Picks for Best Book of the Century (So Far)

From gripping thrillers to poignant memoirs, the 21st century has had no shortage of unforgettable books. To celebrate the standout storytelling of our modern e...

15/04/2026

SonicWorld introduce Telsie T

Vintage broadcast experts release second plug-in Telsie T is the second plug-in to be released by SonicWorld, a German audio company who specialise in servi...

15/04/2026

UAD Explore Free from Universal Audio

Includes eight free UAD plug-ins Universal Audio's latest bundle brings together a selection of their renowned plug-ins and virtual instruments, and is ...

15/04/2026

Maximum uptime for broadcasters: Rohde & Schwarz launches R&SBroadcastShield at NAB 2026

Maximum uptime for broadcasters: Rohde & Schwarz launches R&S BroadcastShield at...

15/04/2026

WIDOW: The Mission Software Defining Rotary Strike

Image courtesy of MD Helicopters...

15/04/2026

L3Harris Announces Billion Dollar Expansion to Boost Solid Rocket Motor Production in Orange County, Virginia

Virginia Gov. Abigail Spanberger, L3Harris VP Mark Farley, and state and local l...

15/04/2026

Advancing America's Space Defense: L3Harris Completes Critical Milestone on Way to Delivering GBOSS Capability to the Warfighter

U.S. Space Forces Ground-Based Optical Sensor System upgrade at the Maui Space S...

15/04/2026

Winter Olympics, Super Bowl Power NBCU-Versant to Gold Medal Performance in Nielsen's February Gauge Reports

NBCU-Versant notches 13.1% of TV viewing in February, its best since August 2024...

15/04/2026

Nielsen CMI shows New Zealand's over-65s are a growing, cashed-up and still-working audience brands can't ignore

New data reveals older Kiwis are financially resilient, loyal to local products,...

15/04/2026

aconnic launches ACCEED 4430 10 Gigabit system for high volume enterprise and business service aggregation

aconnic AG (ISIN: DE000A0LBKW6), Munich, announces the market launch of the ACCE...

15/04/2026

Autocue to Mark 2026 NAB Show Debut of its New PTZ Prompter

Share Copy link Facebook X Linkedin Bluesky Email...

15/04/2026

Locality Deploys Nielsen's Media Data Engine

Share Copy link Facebook X Linkedin Bluesky Email...

15/04/2026

Viant Announces Agreement to Acquire TVision

Share Copy link Facebook X Linkedin Bluesky Email...

15/04/2026

Evergent introduces Agentic Revenue Orchestration Platfor...

Evergent introduces its Agentic Revenue Orchestration Platform, transforming how subscription businesses across direct-to-consumer streaming, pay-TV, telecommun...

15/04/2026

CentralCast Delivers Breakthrough Efficiencies to Public...

Harmonic's XOS Media Processor Delivers Exceptional Video Quality to More than Half of U.S. Public Media Viewership Harmonic (NASDAQ: HLIT) today announce...

15/04/2026

DPA N Series Wireless System Unlocks Duplex Gap and Guard...

LONGMONT, COLORADO, APRIL 15, 2026 DPA Microphones N Series Digital Wireless System users in North America can now take full advantage of the system's exc...

15/04/2026

Cobalt Iron Launches Compass Tape Gateway Modernizing IBM...

Cobalt Iron, a leading provider of SaaS-based enterprise data protection, today announced the launch of Compass Tape Gateway (CTG), a transformative enhancemen...

15/04/2026

Disguise to Showcase Cutting-Edge Experience Tech for Sports, Broadcast and More at NAB 2026

Disguise to Showcase Cutting-Edge Experience Tech for Sports, Broadcast and More...

15/04/2026

Arooj Aftab Makes the Music She Wants to Hear

Arooj Aftab Makes the Music She Wants to Hear The singular artist explores the juxtaposition of grief and joy, dark and light, in her distinctive sound. Apri...

15/04/2026

Panasonic, NEP Partner on IP-Based Live Production

Share Copy link Facebook X Linkedin Bluesky Email...

15/04/2026

Encompass Digital Media Powers Global Cloud Transformatio...

Interra Systems, a provider of end-to-end quality assurance solutions for the digital media industry, is proud to announce its central role in the digital trans...