Sony Pixel Power calrec Sony

AI Can Be Leveraged to Simplify, Enhance STT Services

01/07/2020

AI Can Be Leveraged to Simplify, Enhance STT Services

Author:Guy Finley Artificial intelligence (AI) can be used by media and entertainment companies to simplify and enhance all of their subtitling, translation and transcription (STT) services in the cloud, according to M&E technology firm Digital Nirvana.

Digital Nirvana's Russell Wise, SVP of sales and marketing, and Ed Hauber, its business development manager, used the June 24 webinar Leveraging AI for Speed & Efficiency in M&E STT to detail how Trance - the company's enterprise-level, cloud-based closed captioning and translation solution - can simplify the process, as a managed or self-service STT tool.

Bloomberg, Turner and other major media organizations are already using the plug-and-play, AI-powered offering to produce captions at record speed, improving productivity by 50% and more, according to Digital Nirvana. The workflow can be used across the industry, with media, post and caption service providers all able to take advantage.

Trance is a cloud-based, enterprise-level Software-as-a-Service (SaaS) platform that is used to generate automated transcripts, to create closed captions, to translate those captions into alternate languages and also to export captioned files in all known industry-supported formats, Hauber pointed out.

Trance is also fully web-based, he noted, adding: It's accessible via a LAN, WAN or even a basic Internet connection. As an enterprise tool, Trance is fully configurable for an unlimited number of users, groups and roles.

Administrators, meanwhile, can manage multiple projects, they can create manage users, define roles and permissions, as well as establish system presets, he said, while giving viewers a demonstration of Trance.

The Manage Presets section gives users the ability to define caption attributes, such as the number of lines, the line length and the total number of characters, he pointed out during the demo.

To get media into Trance, we have a tool that we use called Media Services Portal and, like Trance, Media Services Portal - also called MSP - [is] a cloud-based platform, which allows users to ingest any number of common audio and video file formats into Trance, he said. MSP can directly integrate with both FTP and Amazon S3, he also noted.

Digital Nirvana also offers an open application programming interface (API) to integrate Media Services Portal directly into large enterprise media systems, he pointed out. Using our API, those operators don't need to create a secondary workflow process to move media into and out of Trance - and this is a really big time-saving and productivity advantage of Trance, he said.

The Trance speech-to-text engine has created a highly-accurate transcript of the media that we just imported, he also showed during the demo, noting that eliminates the necessity of doing the manual transcribing of content and delivers huge productivity gains over conventional transcription methods. It is also highly accurate - between roughly 90 to 95 percent accurate - based on good good-quality content, he noted.

The transcript interface includes text on the right side of the screen and a media player on the left with intuitive controls to play back audio and video, he demonstrated. Also featured are tools that help provide fast text editing, including an auto highlight of potentially misspelled words and spell check, he showed. Users can also create captions in more than one language, he noted.

During the Q&A, he said: Unlike other providers, we're not limited to one specific speech-to-text engine. In fact, we, by design, do not operate that way. We constantly evaluate and measure the performance of all the best speech-to-text engines that exist in the marketplace today. And so, we're not limited to just one. And the reason that that's important is this technology is progressing and developing and advancing very quickly and so being tied to one or the other is inherently limiting. We would rather take the approach of using them all and continually measuring and evaluating them.

So, as an example, if we detect that Engine A' is performing better in scenarios - say where there is sports content, and we can even be more specific: domestic American basketball - we see that speech-to-text Engine A' is performing better in this application, we automatically in the background route that content based on machine learning capability to say we're going to route this client's content through this speech-to-text engine because we see it now as performing better than the other options, he explained.

There is a great degree of accuracy that we can accomplish by using that process, he noted.

Although Trance is currently not a live captioning solution, he was quick to say: It is on our roadmap and it is something that we're actively developing. So, live captioning with the ability to run our speech-to-text engine, to collapse the time of that speech-to-text process down to near real-time, or essentially real-time, giving an operator the ability to make very quick edits within a few seconds of live and be able to do that on the fly. That's something that we're evaluating and we're working towards as the technology matures and there's a degree of reliability and consistency that we can bring to the market that is on the roadmap for sure. Not today - but coming soon.

He went on to point out: We're constantly developing the product . This company really adheres to a philosophy and a down-to-earth principle in being very, very agile. And, as much as this is an enterprise tool, the product operates on a very agile basis, meaning it's able to take and respond to customer requests very, very quickly.

There is a long history at Digital Nirvana of continual development an
LINK: https://digital-nirvana.com/ai-can-be-leveraged-to-simplify-enhance-st...
See more stories from digitalnirvana

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

01/04/2026

DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION

January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION Douyin Users Can Now Create And Share Videos With Stun...

26/03/2026

Allen Media Group To Deploy Anoki ContextIQ

Share Copy link Facebook X Linkedin Bluesky Email...

26/03/2026

LG Announces New Premium FAST Channels

Share Copy link Facebook X Linkedin Bluesky Email...

26/03/2026

IABM to Host Breakfast Event at 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

26/03/2026

Nexstar Defends Tegna Deal in Calif. Court Filing

Share Copy link Facebook X Linkedin Bluesky Email...

26/03/2026

Nevion introduces powerful new Panel Builder to enhance VideoIPath broadcast control capability

Nevion introduces powerful new Panel Builder to enhance VideoIPath broadcast con...

26/03/2026

2026 Oscar Nominated Films Powered by Blackmagic Design

2026 Oscar Nominated Films Powered by Blackmagic Design Brie Clayton March 25, 2026 0 Comments DaVinci Resolve Studio used on 27 of this year's no...

26/03/2026

Leader to present full suite of advanced Test & Measurement solutions at NAB Show 2026

Leader to present full suite of advanced Test & Measurement solutions at NAB Sho...

26/03/2026

Boston Conservatory to Present New England and Collegiate Premiere of Groundbreaking Opera Time to Act

Boston Conservatory to Present New England and Collegiate Premiere of Groundbrea...

26/03/2026

Phantom C-Series High-Speed Cameras Set a New Standard for Automotive Crash and Safety Imaging

Wayne, N.J., March 26, 2026 Phantom High-Speed announces the latest product li...

25/03/2026

In The Hot Seat: The Art of Directing a Premier League Match

Live match directors Sarah Cheadle (Sky Sports), Rob Levi (TNT Sports), and Andrew Swift (BBC Sport) sit down with the Premier League's Rachel Nightingale t...

25/03/2026

SVG Students To Watch: Kyle Maier, St. Bonaventure University

The senior from Upstate New York is manning the mic while also interning for the athletic department's sports-information team In the live-sports-video ind...

25/03/2026

NAB 2026: Synamedia Launches Edge Watermarking Solution, Marks 10 Years of ContentArmor

Synamedia has announced ContentArmor Edge Watermarking, a server-side solution t...

25/03/2026

SES Taps K2 Space to Build meoSphere MEO Satellite Network

SES has announced meoSphere, a medium Earth orbit (MEO) satellite network targeted for operation by 2030. The first phase will pair SES-developed software-defin...

25/03/2026

Reuters and TVU Networks Begin Satellite-to-IP Migration for Live News Distribution

TVU Networks is working with Reuters on a phased migration from satellite to a c...

25/03/2026

Nielsen Names Three Senior Hires in Sports, Advertising, and Publishing Roles

Nielsen has announced three senior appointments. Seth Ladetsky has been named Head of Global Sports. Trevor Fellows will lead Nielsen's advertiser and agenc...

25/03/2026

Anoki and Amagi Bring Scene-Level Intelligence to In-Content CTV Ads

Anoki and Amagi have launched In-Scene Ads powered by Anoki ContextIQ across Amagi's portfolio of in-content ad formats for Free Ad-supported Streaming TV (...

25/03/2026

NAB 2026: Arkona to Unveil BLADE//planner and Platform Updates

Arkona Technologies will announce a series of enhancements to its BLADE//runner platform at NAB 2026 (Booth C.1808). The updates focus on usability and workflow...

25/03/2026

San Diego Padres Partners With Daktronics to Enhance Petco Park

Daktronics has installed two tower displays and a video wall in the Lexus Club at Petco Park in San Diego ahead of the 2026 season. Continuing to improve the ...

25/03/2026

NAB 2026: MultiDyne Marks 50th Anniversary

MultiDyne Video & Fiber Optic Systems is celebrating its 50th anniversary as NAB Show 2026 approaches. The company was founded in 1976 by Vincent Jachetta, an N...

25/03/2026

NAB 2026: IPC to Debut with One Connect Intercom Platform and New One Link Keypanels

IPC, a provider of integrated communication solutions, will make its NAB 2026 de...

25/03/2026

ESPN Tops 2026 Sports Emmy Nominations With 63 Nods

Live production categories were led by NBC, FOX, and ESPN's NFL coverage...

25/03/2026

Atlanta Braves and Spectrum Reach Multiyear Distribution Agreement for BravesVision

The Atlanta Braves and Spectrum have announced a multiyear distribution agreemen...

25/03/2026

The AI Doc Asks the Question No One Wants to Answer

(L-R) Charlie Tyrell and Daniel Roher attend The AI Doc: Or How I Became An Apocaloptimist Premiere during the 2026 Sundance Film Festival at The Ray Theatre ...

25/03/2026

Kelsey Lu and Savanah Leaf Lean Into the Emotional Core of Running To Pain' in Episode Three of Directed By'

Directed By, Spotify's documentary-style series that pulls back the curtain ...

25/03/2026

BTS and Spotify Bring ARIRANG' to Top Fans in New York City

BTS is so back., This week, the global pop superstars took the stage at New York City's Pier 17 for their first U.S. performance in four years. Part of Spo...

25/03/2026

Step Into Sound at Our New Spotify Listening Lounge in London

How you listen can shape what you hear. That's the idea behind the new Spotify Listening Lounge, an acoustic space at our London headquarters purpose-built ...

25/03/2026

Iconic Instruments launch Transport Vintage Tape

Tape effects taken to the extreme The latest release from New York-based developer Iconic Instruments is said to accurately recreate the saturation and comp...

25/03/2026

Sonuscore introduce Fantasy Vocal Phrases

Launched alongside new Vocal Phrases bundle Sonuscore's latest release has been designed specifically for composers working on fantasy TV, film and game...

25/03/2026

Steinberg unveil Nuendo 15

Latest update now live The latest version of Steinberg's post-production-focused DAW has just arrived, and comes packed with new dialogue editing, sound...

25/03/2026

Rohde & Schwarz joins FormFactor's MeasureOne partner program

Rohde & Schwarz joins FormFactor's MeasureOne partner program FormFactor and Rohde & Schwarz advance their partnership for on-wafer RF component character...

25/03/2026

L3Harris, RFTEQ Sign Agreement to Advance Sovereign Electronic Warfare Capability in Australia

L3Harris Technologies and RFTEQ Pty Ltd signed a memorandum of understanding to ...

25/03/2026

L3Harris to Provide Autonomous Underwater Capability for US Navy Submarines

L3Harris delivers combat-ready Torpedo Tube Launch and Recovery system, which deploys and retrieves Iver4 900 autonomous underwater vehicles through submarine t...

25/03/2026

Nielsen Names New Senior Leaders Supporting Sports, Advertising and Publishing Clients

The company expands leadership team under Chief Revenue Officer Amilcar Perez S...

25/03/2026

Stable TV Viewership in Poland in February as Warner Bros. Discovery Retains Top Spot

Winter Olympic Games Opening Ceremony features in top 10 programmes of the month...

25/03/2026

Mediaproxy to Show Upgrades to LogServer at 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

25/03/2026

Hitomi transforms production synchronisation with the lau...

Providing wide view timing visibility across the entire production chain...

25/03/2026

Bitfocus showcases complete control at NAB Show 2026

Continuing development drives advances in security, availability, access and connectivity...

25/03/2026

Caudalie Paris HQ elevates brand experience with INFiLED...

Caudalie, the renowned French cosmetics brand, has unveiled a state-of-the-art 200-seat auditorium at its new headquarters in the historic Marais district of ce...

25/03/2026

Telestream Unlocks Adobe-Centric Media Pipeline and Strea...

Telestream, a global leader in media workflow technologies, today announced expanded integration with Adobe Premiere, Adobe Media Encoder (AME), and Frame.io, d...

25/03/2026

Marshall Electronics Showcases New Feature Rich CV320 and...

Marshall Electronics is expanding its lineup of high-performance POV cameras designed for broadcast, live production and professional AV applications with the d...

25/03/2026

OOONA Achieves TPN Gold Star Shield - the Highest Level o...

OOONA, a global provider of professional management and production tools for the media localization industry, announced today that it has been awarded the TPN G...

25/03/2026

Gray Media to Simulcast 2026 Atlanta Braves Home Opener

Share Copy link Facebook X Linkedin Bluesky Email...

25/03/2026

2026 NAB Show Exhibitor Insight: Appear

Share Copy link Facebook X Linkedin Bluesky Email...

25/03/2026

Deepfakes Vulnerable to AI Fingerprint Hacks, Study Finds

Share Copy link Facebook X Linkedin Bluesky Email...

25/03/2026

SipMX launches at NAB Show 2026 to democratize media orch...

SipRadius, specialists in secure, low-latency media transport, will drive innovation and interoperability still further with the launch of the SipMX Alliance at...