
AI Can Be Leveraged to Simplify, Enhance STT Services
Author:Guy Finley Artificial intelligence (AI) can be used by media and entertainment companies to simplify and enhance all of their subtitling, translation and transcription (STT) services in the cloud, according to M&E technology firm Digital Nirvana.
Digital Nirvana's Russell Wise, SVP of sales and marketing, and Ed Hauber, its business development manager, used the June 24 webinar Leveraging AI for Speed & Efficiency in M&E STT to detail how Trance - the company's enterprise-level, cloud-based closed captioning and translation solution - can simplify the process, as a managed or self-service STT tool.
Bloomberg, Turner and other major media organizations are already using the plug-and-play, AI-powered offering to produce captions at record speed, improving productivity by 50% and more, according to Digital Nirvana. The workflow can be used across the industry, with media, post and caption service providers all able to take advantage.
Trance is a cloud-based, enterprise-level Software-as-a-Service (SaaS) platform that is used to generate automated transcripts, to create closed captions, to translate those captions into alternate languages and also to export captioned files in all known industry-supported formats, Hauber pointed out.
Trance is also fully web-based, he noted, adding: It's accessible via a LAN, WAN or even a basic Internet connection. As an enterprise tool, Trance is fully configurable for an unlimited number of users, groups and roles.
Administrators, meanwhile, can manage multiple projects, they can create manage users, define roles and permissions, as well as establish system presets, he said, while giving viewers a demonstration of Trance.
The Manage Presets section gives users the ability to define caption attributes, such as the number of lines, the line length and the total number of characters, he pointed out during the demo.
To get media into Trance, we have a tool that we use called Media Services Portal and, like Trance, Media Services Portal - also called MSP - [is] a cloud-based platform, which allows users to ingest any number of common audio and video file formats into Trance, he said. MSP can directly integrate with both FTP and Amazon S3, he also noted.
Digital Nirvana also offers an open application programming interface (API) to integrate Media Services Portal directly into large enterprise media systems, he pointed out. Using our API, those operators don't need to create a secondary workflow process to move media into and out of Trance - and this is a really big time-saving and productivity advantage of Trance, he said.
The Trance speech-to-text engine has created a highly-accurate transcript of the media that we just imported, he also showed during the demo, noting that eliminates the necessity of doing the manual transcribing of content and delivers huge productivity gains over conventional transcription methods. It is also highly accurate - between roughly 90 to 95 percent accurate - based on good good-quality content, he noted.
The transcript interface includes text on the right side of the screen and a media player on the left with intuitive controls to play back audio and video, he demonstrated. Also featured are tools that help provide fast text editing, including an auto highlight of potentially misspelled words and spell check, he showed. Users can also create captions in more than one language, he noted.
During the Q&A, he said: Unlike other providers, we're not limited to one specific speech-to-text engine. In fact, we, by design, do not operate that way. We constantly evaluate and measure the performance of all the best speech-to-text engines that exist in the marketplace today. And so, we're not limited to just one. And the reason that that's important is this technology is progressing and developing and advancing very quickly and so being tied to one or the other is inherently limiting. We would rather take the approach of using them all and continually measuring and evaluating them.
So, as an example, if we detect that Engine A' is performing better in scenarios - say where there is sports content, and we can even be more specific: domestic American basketball - we see that speech-to-text Engine A' is performing better in this application, we automatically in the background route that content based on machine learning capability to say we're going to route this client's content through this speech-to-text engine because we see it now as performing better than the other options, he explained.
There is a great degree of accuracy that we can accomplish by using that process, he noted.
Although Trance is currently not a live captioning solution, he was quick to say: It is on our roadmap and it is something that we're actively developing. So, live captioning with the ability to run our speech-to-text engine, to collapse the time of that speech-to-text process down to near real-time, or essentially real-time, giving an operator the ability to make very quick edits within a few seconds of live and be able to do that on the fly. That's something that we're evaluating and we're working towards as the technology matures and there's a degree of reliability and consistency that we can bring to the market that is on the roadmap for sure. Not today - but coming soon.
He went on to point out: We're constantly developing the product . This company really adheres to a philosophy and a down-to-earth principle in being very, very agile. And, as much as this is an enterprise tool, the product operates on a very agile basis, meaning it's able to take and respond to customer requests very, very quickly.
There is a long history at Digital Nirvana of continual development an
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
06/09/2026
June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...
04/08/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
04/07/2026
April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...
22/06/2026
Behind The Mic provides a roundup of recent news regarding on-air talent, includ...
22/06/2026
Cosm has announced the appointment of David Ho as Chief Legal Officer, a newly created executive role reporting to President and CEO Jeb Terry. Ho will oversee ...
22/06/2026
Warner Bros. Discovery and Amazon Web Services (AWS) have announced the developm...
22/06/2026
Daktronics has completed an audio control system upgrade at Petco Park in San Di...
22/06/2026
Accelerate Media has named John Willi as President and announced the launch of the Accelerate Sports Network (ASN), a prep sports media and streaming platform c...
22/06/2026
All Women's Sports Network (AWSN) and 3XBA (3 3 Basketball Association) have announced live television coverage of the annual 3XBA tournament on Friday, Jun...
22/06/2026
OWL AI has announced the appointment of Jay Prasad as Chief Executive Officer and member of the Board of Directors. Prasad succeeds Josh Gwyther, who has served...
22/06/2026
CP Communications delivered RF video and audio support for TNT's Inside the NBA at the 2026 NBA Finals, providing main show coverage in San Antonio and ea...
22/06/2026
Polymarket has announced a partnership with GRID, an official esports data platf...
22/06/2026
As sports venues continue to evolve into more video-centric, fan-engagement-driv...
22/06/2026
As the regional sports production scene shifts toward streaming, this Texan helps lead the engineering behind Victory+'s growing live platform...
22/06/2026
By Kristin Feeley, Director, Documentary Film & Artist Programs
the memories of your elders [are] a scaffolding for you to build your identity on - and t...
22/06/2026
New hyper-resolution analyser EQ revealed
CEDAR Audio's all-new Icons plug-in series has just gained its newest member, Blade. Described by the compan...
22/06/2026
Turn any live input into a cinematic soundscape
Designed for use in the studio and on stage, Sampleson's latest creation is capable of taking any audio ...
22/06/2026
Adds guitar strings to Eurorack rigs
ADDAC System are renowned for their weird and wonderful synth designs, and their line-up includes plenty of gear that...
22/06/2026
FIFA World Cup 2026 fever grows, as more than one third of Australians tune in ...
22/06/2026
In our latest blog, Tim Pearson explores NAGRA Venturi, the new streaming security solution for the AI era from NAGRAVISION. Designed to aggregate and analyze ...
22/06/2026
Expanded integrations give advertisers access to distinct contextual signals acr...
22/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
22/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
22/06/2026
Xilica today announced the release of Dynamic Voice Lift, a new feature in Xilica Designer v4.12 that brings adaptive speech reinforcement to large meeting spac...
22/06/2026
Monday 22 June 2026
Official trailer released for Katie Price: Nothing to Hide,...
22/06/2026
The next era of AI will not be defined by compute alone. Its growth will be dete...
22/06/2026
Mission, Vision and Veritas - new Los Alamos National Laboratory (LANL) supercom...
22/06/2026
At the ISC conference running in Hamburg this week, NVIDIA is introducing new so...
22/06/2026
For the past two years, the U.S. National Science Foundation's National Arti...
22/06/2026
JUPITER, Europe's first exascale supercomputer at Germany's Forschungszentrum J lich, runs on NVIDIA Grace Hopper Superchips and NVIDIA Quantum-X800 Inf...
21/06/2026
To call the 2026 FIFA World Cup a big undertaking would be a big understatement....
21/06/2026
New series now live on Udemy
Regular SOS contributor and Cubase workshop columnist John Walden has just released a new Cubase video course that is now avail...
21/06/2026
Hot tubs sit at about 38 to 40 degrees Celsius, warm enough that most people can only soak for about 15 minutes. NVIDIA's newest AI servers can run their co...
21/06/2026
Sunday 21 June 2026
Sky announces immersive documentary series The Wargame
The Wargame first looks
ZIP (2MB)
Sky today confirms the commission of The Wargam...
20/06/2026
New add-on creates doubles & vocal stacks
IK Multimedia's latest ReSing add-on kits the innovative software out with the ability to automatically genera...
20/06/2026
What exactly is Apogee Control V3?
Control V3 is a new mixer application that controls Apogee interfaces. The new hit feature is that V3 finally allows for...
19/06/2026
Split compound eases operational challenges at Shinnecock Hills Golf Club...
19/06/2026
North Carolina, Oklahoma meet in the best-of-three Finals as ESPN leans into spe...
19/06/2026
Company launch comprehensive mix-comparison tool
The Him DSP are a plug-in company founded by The Him, an EDM DJ and producer who has amassed over half a bi...
19/06/2026
Major Sampler upgrades introduced
The latest version of Bitwig's DAW software has just entered public beta testing, and is available now for all users w...
19/06/2026
Four times the power of their predecessors
Akai Pro have just introduced upgraded versions of two of their popular standalone MPC systems, kitting them out ...
19/06/2026
Data from May shows seasonal outdoor trends triggers lower viewing
Warsaw, Pola...
19/06/2026
Buttons is best control system in the rAVe Best of Infocomm Awards 2026...
19/06/2026
Mavis Studio Makes iPad Production More Powerful
Brie Clayton June 19, 2026
0 Comments
InfoComm update brings new NDI Preview, PTZ control, USB audio ...
19/06/2026
Immersive Studio Metaverse Stage Tackles Post with Blackmagic Design
Brie Clayton June 19, 2026
0 Comments
New narrative projects rely on DaVinci Reso...
19/06/2026
How to Run the Original 1993 After Effects
Graham Quince June 19, 2026
0 Comments
How to the original After Effects v1 in an emulator, and you don'...
19/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
19/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
19/06/2026
SMPTE , the home of media professionals, technologists and engineers, has announced that its entire Standards catalog is now freely available to the global medi...