Sony Pixel Power calrec Sony

AI Can Be Leveraged to Simplify, Enhance STT Services

01/07/2020

AI Can Be Leveraged to Simplify, Enhance STT Services

Author:Guy Finley Artificial intelligence (AI) can be used by media and entertainment companies to simplify and enhance all of their subtitling, translation and transcription (STT) services in the cloud, according to M&E technology firm Digital Nirvana.

Digital Nirvana's Russell Wise, SVP of sales and marketing, and Ed Hauber, its business development manager, used the June 24 webinar Leveraging AI for Speed & Efficiency in M&E STT to detail how Trance - the company's enterprise-level, cloud-based closed captioning and translation solution - can simplify the process, as a managed or self-service STT tool.

Bloomberg, Turner and other major media organizations are already using the plug-and-play, AI-powered offering to produce captions at record speed, improving productivity by 50% and more, according to Digital Nirvana. The workflow can be used across the industry, with media, post and caption service providers all able to take advantage.

Trance is a cloud-based, enterprise-level Software-as-a-Service (SaaS) platform that is used to generate automated transcripts, to create closed captions, to translate those captions into alternate languages and also to export captioned files in all known industry-supported formats, Hauber pointed out.

Trance is also fully web-based, he noted, adding: It's accessible via a LAN, WAN or even a basic Internet connection. As an enterprise tool, Trance is fully configurable for an unlimited number of users, groups and roles.

Administrators, meanwhile, can manage multiple projects, they can create manage users, define roles and permissions, as well as establish system presets, he said, while giving viewers a demonstration of Trance.

The Manage Presets section gives users the ability to define caption attributes, such as the number of lines, the line length and the total number of characters, he pointed out during the demo.

To get media into Trance, we have a tool that we use called Media Services Portal and, like Trance, Media Services Portal - also called MSP - [is] a cloud-based platform, which allows users to ingest any number of common audio and video file formats into Trance, he said. MSP can directly integrate with both FTP and Amazon S3, he also noted.

Digital Nirvana also offers an open application programming interface (API) to integrate Media Services Portal directly into large enterprise media systems, he pointed out. Using our API, those operators don't need to create a secondary workflow process to move media into and out of Trance - and this is a really big time-saving and productivity advantage of Trance, he said.

The Trance speech-to-text engine has created a highly-accurate transcript of the media that we just imported, he also showed during the demo, noting that eliminates the necessity of doing the manual transcribing of content and delivers huge productivity gains over conventional transcription methods. It is also highly accurate - between roughly 90 to 95 percent accurate - based on good good-quality content, he noted.

The transcript interface includes text on the right side of the screen and a media player on the left with intuitive controls to play back audio and video, he demonstrated. Also featured are tools that help provide fast text editing, including an auto highlight of potentially misspelled words and spell check, he showed. Users can also create captions in more than one language, he noted.

During the Q&A, he said: Unlike other providers, we're not limited to one specific speech-to-text engine. In fact, we, by design, do not operate that way. We constantly evaluate and measure the performance of all the best speech-to-text engines that exist in the marketplace today. And so, we're not limited to just one. And the reason that that's important is this technology is progressing and developing and advancing very quickly and so being tied to one or the other is inherently limiting. We would rather take the approach of using them all and continually measuring and evaluating them.

So, as an example, if we detect that Engine A' is performing better in scenarios - say where there is sports content, and we can even be more specific: domestic American basketball - we see that speech-to-text Engine A' is performing better in this application, we automatically in the background route that content based on machine learning capability to say we're going to route this client's content through this speech-to-text engine because we see it now as performing better than the other options, he explained.

There is a great degree of accuracy that we can accomplish by using that process, he noted.

Although Trance is currently not a live captioning solution, he was quick to say: It is on our roadmap and it is something that we're actively developing. So, live captioning with the ability to run our speech-to-text engine, to collapse the time of that speech-to-text process down to near real-time, or essentially real-time, giving an operator the ability to make very quick edits within a few seconds of live and be able to do that on the fly. That's something that we're evaluating and we're working towards as the technology matures and there's a degree of reliability and consistency that we can bring to the market that is on the roadmap for sure. Not today - but coming soon.

He went on to point out: We're constantly developing the product . This company really adheres to a philosophy and a down-to-earth principle in being very, very agile. And, as much as this is an enterprise tool, the product operates on a very agile basis, meaning it's able to take and respond to customer requests very, very quickly.

There is a long history at Digital Nirvana of continual development an
LINK: https://digital-nirvana.com/ai-can-be-leveraged-to-simplify-enhance-st...
See more stories from digitalnirvana

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

21/04/2026

Live From NAB 2026: BitFires Colin Bonzey on Growing Spark Platform for Live Cloud-Based Productions

Cloud-based production isnt going anywhere, and BitFire is doubling down by prov...

21/04/2026

Live From NAB 2026: AWSs Jason Dvorkin, Regina Rossi on Driving Innovation With Al-Based Workflows

The topic of artificial intelligence has a stranglehold on the sports-video-prod...

21/04/2026

Live From NAB 2026: T-Mobile for Business' Jason Schnellbacher on Enhancing 5G for Sports Fans, Broadcasters

5G is still a hot topic in live event production, and this workflow continues to...

21/04/2026

Live From NAB 2026: Appears Ed McGivern on Fox Sports Deal, New XM Platform, and VX Software Debut

At the 2026 NAB Show, Ed McGivern, GM and President of Appear US, discusses the ...

21/04/2026

NAB 2026: Studio Network Solutions launches on-premise AI suite for media production workflows

Studio Network Solutions (SNS) has announced an on-premise AI suite designed for...

21/04/2026

NAB 2026: Suite Studios integrates file-streaming technology into Frame.io Drive

Suite Studios has integrated its file-streaming technology into the newly announced Frame.io Drive, a desktop application from Adobe company Frame.io. The colla...

21/04/2026

NAB 2026: Net Insight integrates InSync FrameFormer into Nimbra Edge for media processing

Net Insight has integrated InSync Technology's FrameFormer into the Nimbra E...

21/04/2026

NAB 2026: Fox Sports selects Appear X Platform for live production infrastructure

Fox Sports has selected Appear as a technology partner to support the next phase...

21/04/2026

NAB 2026: Diversified appoints Tyler Affolter as Chief Revenue Officer

Diversified has appointed Tyler Affolter as Chief Revenue Officer (CRO) to lead the company's commercial organisation. The appointment follows the firm'...

21/04/2026

NAB 2026: Layercake integrates Bitmovin into Streamcake platform for end-to-end media orchestration

Layercake has formalised the integration of Bitmovin's video streaming infra...

21/04/2026

NAB 2026: International Judo Federation extends global content distribution partnership with SES

The International Judo Federation (IJF) has extended its distribution partnershi...

21/04/2026

NAB 2026: Glookast integrates Cinnafilm Tachyon plugin to enable GPU-accelerated video processing

Glookast has launched the Cinnafilm Tachyon plugin for its Media Producer and Me...

21/04/2026

NAB 2026: Cadena Tres selects Eutelsat for television signal distribution in Mexico

Eutelsat has entered into an agreement with Cadena Tres, a division of Grupo Ima...

21/04/2026

NAB 2026: Dolby and TV Azteca deploy Dolby Atmos for free-to-air broadcast

Dolby Laboratories and TV Azteca have partnered to introduce Dolby Atmos immersive audio to free-to-air television broadcasts. The implementation utilises the A...

21/04/2026

Verizon and FOX Entertainment leverage 5G and AI for remote production of Extracted

FOX Entertainment partnered with Verizon to overcome significant production hurd...

21/04/2026

NAB 2026: Osprey Video to showcase expanded IP infrastructure and orchestration at NAB Show 2026

Osprey Video has announced its technology showcase for the NAB Show 2026, highli...

21/04/2026

NAB 2026: Riedel introduces IP-based production updates including multiviewer, commentary control and audio connectivity solutions

Riedel Communications (Booth C4908) introduced a range of new solutions at NAB S...

21/04/2026

SportsTechBuzz at NAB 2026, Day 3: Live Reports From the Show Floor in Vegas

The NAB Show is in full swing, and the SVG and SVG Europe editorial teams are chasing down the hottest stories from all over the Las Vegas Convention Center. He...

21/04/2026

NAB 2026: Blackmagic Design Announces URSA Cine Immersive 100G and URSA Cine Live Encoder

Blackmagic Design has announced the URSA Cine Immersive 100G, an immersive cinem...

21/04/2026

Live From NAB 2026: Clark Wire & Cables David McCarthy Showcases New Connectivity, Enclosure Solutions for Modern Broadcast Workflows at NAB Show 2026

Clark Wire & Cable is continuing its evolution from cable supplier to full-scale solutions partner for broadcast and live production. At the 2026 NAB Show, we s...

21/04/2026

Ricky Sensitively Portrays a Post-Incarceration Coming-of-Age

Rashad Frett attends the 2025 Sundance Film Festival premiere of Ricky at Eccles Theatre on January 24, 2025, in Park City, UT. (Photo by George Pimentel/Shut...

21/04/2026

5 Years of Spotify in Pakistan: The Trends Shaping the Country's Music Scene

Five years ago, Spotify arrived in Pakistan, opening a new chapter in the country's music scene. Since then, local listeners have explored across genres, ge...

21/04/2026

The Next Wave of RADAR Spain Artists Arrives for 2026

Since its launch in 2020, RADAR has been our program for spotlighting emerging artists around the world. This year marks the sixth edition of RADAR Spain, our o...

21/04/2026

EverSync SP-10 from Cloudvocal

Offers compact wireless solution for pedalboards Taiwanese audio brand Cloudvocal have announced the availability of a new pedalboard-friendly wireless syst...

21/04/2026

Arturia launch Augmented Persia

Latest hybrid sampling/synthesis instrument arrives Arturia's Augmented series offerings rely on a mixture of sampling and synthesis, allowing users to ...

21/04/2026

Rohde & Schwarz to host Power Electronics Online Conference From Design to Validation in May

Rohde & Schwarz to host Power Electronics Online Conference From Design to Vali...

21/04/2026

MAS and Lockheed Martin Announce F-35 Sustainment Partnership in Quebec

MAS and Lockheed Martin partner to establish an F-35 depot in Canada, enabling in-country sustainment and creating high-skilled aerospace jobs....

21/04/2026

Nielsen data shows Australian outdoor and sport retailers are changing how they advertise to win over outdoor enthusiasts

Advertising strategies shift as competition grows for a large, active and qualit...

21/04/2026

ATSC Celebrates 3.0's Global Expansion

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Cinematic Feel Makes Survivor' Built to Last

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Live Event Technology Expands Fan Engagement

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

MS NOW Uses Community to Build Up Its Brand

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Why Broadcast Is Well-Positioned to Safeguard Freedom of Speech

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

AWS Demos AI Tools to Deliver Vertical Video

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Video Podcasting Leaps in Popularity

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Audio Systems Get Boost From Cloud and AI

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Layercake Deepens Bitmovin Integration to Power End-to-End Media Orchestration with Streamcake

Layercake Deepens Bitmovin Integration to Power End-to-End Media Orchestration w...

21/04/2026

Cobalt Digital Announces Launch of blueCORE at NAB Show 2026: Compact Powerhouse Signal Processors for SDI and ST 2110 Workflows

Cobalt Digital Announces Launch of blueCORE at NAB Show 2026: Compact Powerhouse...

21/04/2026

On-Premise AI Suite from Studio Network Solutions Debuts at NAB Show 2026

On-Premise AI Suite from Studio Network Solutions Debuts at NAB Show 2026 Melanie Ciotti April 21, 2026 0 Comments Unlimited processing, no cloud depe...

21/04/2026

IBC appoints Tim Banham as Chief Commercial Officer to dr...

London, 21 April 2026 IBC today announced the appointment of Tim Banham as its first Chief Commercial Officer (CCO), a newly created role that reflects the or...

21/04/2026

Motion Design Tools - April 2026

Motion Design Tools - April 2026 Roland Kahlenberg April 21, 2026 0 Comments Within 2 days, Maxon and Canva announced pro-level motion design apps - A...

21/04/2026

Chaos and Zero Density to Showcase Real-Time Ray Tracing for Virtual Studios and XR

Chaos and Zero Density to Showcase Real-Time Ray Tracing for Virtual Studios and...

21/04/2026

Diversified Appoints Tyler Affolter Chief Revenue Officer

Share Copy link Facebook X Linkedin Bluesky Email...