
AI Can Be Leveraged to Simplify, Enhance STT Services
Author:Guy Finley Artificial intelligence (AI) can be used by media and entertainment companies to simplify and enhance all of their subtitling, translation and transcription (STT) services in the cloud, according to M&E technology firm Digital Nirvana.
Digital Nirvana's Russell Wise, SVP of sales and marketing, and Ed Hauber, its business development manager, used the June 24 webinar Leveraging AI for Speed & Efficiency in M&E STT to detail how Trance - the company's enterprise-level, cloud-based closed captioning and translation solution - can simplify the process, as a managed or self-service STT tool.
Bloomberg, Turner and other major media organizations are already using the plug-and-play, AI-powered offering to produce captions at record speed, improving productivity by 50% and more, according to Digital Nirvana. The workflow can be used across the industry, with media, post and caption service providers all able to take advantage.
Trance is a cloud-based, enterprise-level Software-as-a-Service (SaaS) platform that is used to generate automated transcripts, to create closed captions, to translate those captions into alternate languages and also to export captioned files in all known industry-supported formats, Hauber pointed out.
Trance is also fully web-based, he noted, adding: It's accessible via a LAN, WAN or even a basic Internet connection. As an enterprise tool, Trance is fully configurable for an unlimited number of users, groups and roles.
Administrators, meanwhile, can manage multiple projects, they can create manage users, define roles and permissions, as well as establish system presets, he said, while giving viewers a demonstration of Trance.
The Manage Presets section gives users the ability to define caption attributes, such as the number of lines, the line length and the total number of characters, he pointed out during the demo.
To get media into Trance, we have a tool that we use called Media Services Portal and, like Trance, Media Services Portal - also called MSP - [is] a cloud-based platform, which allows users to ingest any number of common audio and video file formats into Trance, he said. MSP can directly integrate with both FTP and Amazon S3, he also noted.
Digital Nirvana also offers an open application programming interface (API) to integrate Media Services Portal directly into large enterprise media systems, he pointed out. Using our API, those operators don't need to create a secondary workflow process to move media into and out of Trance - and this is a really big time-saving and productivity advantage of Trance, he said.
The Trance speech-to-text engine has created a highly-accurate transcript of the media that we just imported, he also showed during the demo, noting that eliminates the necessity of doing the manual transcribing of content and delivers huge productivity gains over conventional transcription methods. It is also highly accurate - between roughly 90 to 95 percent accurate - based on good good-quality content, he noted.
The transcript interface includes text on the right side of the screen and a media player on the left with intuitive controls to play back audio and video, he demonstrated. Also featured are tools that help provide fast text editing, including an auto highlight of potentially misspelled words and spell check, he showed. Users can also create captions in more than one language, he noted.
During the Q&A, he said: Unlike other providers, we're not limited to one specific speech-to-text engine. In fact, we, by design, do not operate that way. We constantly evaluate and measure the performance of all the best speech-to-text engines that exist in the marketplace today. And so, we're not limited to just one. And the reason that that's important is this technology is progressing and developing and advancing very quickly and so being tied to one or the other is inherently limiting. We would rather take the approach of using them all and continually measuring and evaluating them.
So, as an example, if we detect that Engine A' is performing better in scenarios - say where there is sports content, and we can even be more specific: domestic American basketball - we see that speech-to-text Engine A' is performing better in this application, we automatically in the background route that content based on machine learning capability to say we're going to route this client's content through this speech-to-text engine because we see it now as performing better than the other options, he explained.
There is a great degree of accuracy that we can accomplish by using that process, he noted.
Although Trance is currently not a live captioning solution, he was quick to say: It is on our roadmap and it is something that we're actively developing. So, live captioning with the ability to run our speech-to-text engine, to collapse the time of that speech-to-text process down to near real-time, or essentially real-time, giving an operator the ability to make very quick edits within a few seconds of live and be able to do that on the fly. That's something that we're evaluating and we're working towards as the technology matures and there's a degree of reliability and consistency that we can bring to the market that is on the roadmap for sure. Not today - but coming soon.
He went on to point out: We're constantly developing the product . This company really adheres to a philosophy and a down-to-earth principle in being very, very agile. And, as much as this is an enterprise tool, the product operates on a very agile basis, meaning it's able to take and respond to customer requests very, very quickly.
There is a long history at Digital Nirvana of continual development an
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
01/06/2026
January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026
Throughout the week, Dolby brings to life the latest innovatio...
02/05/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
01/05/2026
January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...
01/04/2026
January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION
Douyin Users Can Now Create And Share Videos With Stun...
26/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/03/2026
Nevion introduces powerful new Panel Builder to enhance VideoIPath broadcast con...
26/03/2026
2026 Oscar Nominated Films Powered by Blackmagic Design
Brie Clayton March 25, 2026
0 Comments
DaVinci Resolve Studio used on 27 of this year's no...
26/03/2026
Leader to present full suite of advanced Test & Measurement solutions at NAB Sho...
26/03/2026
Boston Conservatory to Present New England and Collegiate Premiere of Groundbrea...
26/03/2026
Wayne, N.J., March 26, 2026 Phantom High-Speed announces the latest product li...
25/03/2026
Live match directors Sarah Cheadle (Sky Sports), Rob Levi (TNT Sports), and Andrew Swift (BBC Sport) sit down with the Premier League's Rachel Nightingale t...
25/03/2026
The senior from Upstate New York is manning the mic while also interning for the athletic department's sports-information team
In the live-sports-video ind...
25/03/2026
Synamedia has announced ContentArmor Edge Watermarking, a server-side solution t...
25/03/2026
SES has announced meoSphere, a medium Earth orbit (MEO) satellite network targeted for operation by 2030. The first phase will pair SES-developed software-defin...
25/03/2026
TVU Networks is working with Reuters on a phased migration from satellite to a c...
25/03/2026
Nielsen has announced three senior appointments. Seth Ladetsky has been named Head of Global Sports. Trevor Fellows will lead Nielsen's advertiser and agenc...
25/03/2026
Anoki and Amagi have launched In-Scene Ads powered by Anoki ContextIQ across Amagi's portfolio of in-content ad formats for Free Ad-supported Streaming TV (...
25/03/2026
Arkona Technologies will announce a series of enhancements to its BLADE//runner platform at NAB 2026 (Booth C.1808). The updates focus on usability and workflow...
25/03/2026
Daktronics has installed two tower displays and a video wall in the Lexus Club at Petco Park in San Diego ahead of the 2026 season.
Continuing to improve the ...
25/03/2026
MultiDyne Video & Fiber Optic Systems is celebrating its 50th anniversary as NAB Show 2026 approaches. The company was founded in 1976 by Vincent Jachetta, an N...
25/03/2026
IPC, a provider of integrated communication solutions, will make its NAB 2026 de...
25/03/2026
Live production categories were led by NBC, FOX, and ESPN's NFL coverage...
25/03/2026
The Atlanta Braves and Spectrum have announced a multiyear distribution agreemen...
25/03/2026
(L-R) Charlie Tyrell and Daniel Roher attend The AI Doc: Or How I Became An Apocaloptimist Premiere during the 2026 Sundance Film Festival at The Ray Theatre ...
25/03/2026
Directed By, Spotify's documentary-style series that pulls back the curtain ...
25/03/2026
BTS is so back., This week, the global pop superstars took the stage at New York City's Pier 17 for their first U.S. performance in four years.
Part of Spo...
25/03/2026
How you listen can shape what you hear. That's the idea behind the new Spotify Listening Lounge, an acoustic space at our London headquarters purpose-built ...
25/03/2026
Tape effects taken to the extreme
The latest release from New York-based developer Iconic Instruments is said to accurately recreate the saturation and comp...
25/03/2026
Launched alongside new Vocal Phrases bundle
Sonuscore's latest release has been designed specifically for composers working on fantasy TV, film and game...
25/03/2026
Latest update now live
The latest version of Steinberg's post-production-focused DAW has just arrived, and comes packed with new dialogue editing, sound...
25/03/2026
Rohde & Schwarz joins FormFactor's MeasureOne partner program FormFactor and Rohde & Schwarz advance their partnership for on-wafer RF component character...
25/03/2026
L3Harris Technologies and RFTEQ Pty Ltd signed a memorandum of understanding to ...
25/03/2026
L3Harris delivers combat-ready Torpedo Tube Launch and Recovery system, which deploys and retrieves Iver4 900 autonomous underwater vehicles through submarine t...
25/03/2026
The company expands leadership team under Chief Revenue Officer Amilcar Perez
S...
25/03/2026
Winter Olympic Games Opening Ceremony features in top 10 programmes of the month...
25/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
25/03/2026
Providing wide view timing visibility across the entire production chain...
25/03/2026
Continuing development drives advances in security, availability, access and connectivity...
25/03/2026
Caudalie, the renowned French cosmetics brand, has unveiled a state-of-the-art 200-seat auditorium at its new headquarters in the historic Marais district of ce...
25/03/2026
Telestream, a global leader in media workflow technologies, today announced expanded integration with Adobe Premiere, Adobe Media Encoder (AME), and Frame.io, d...
25/03/2026
Marshall Electronics is expanding its lineup of high-performance POV cameras designed for broadcast, live production and professional AV applications with the d...
25/03/2026
OOONA, a global provider of professional management and production tools for the media localization industry, announced today that it has been awarded the TPN G...
25/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
25/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
25/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
25/03/2026
SipRadius, specialists in secure, low-latency media transport, will drive innovation and interoperability still further with the launch of the SipMX Alliance at...