Sony Pixel Power calrec Sony

Now You're Speaking My Language: NVIDIA Riva Sets New Bar for Fully Customizable Speech AI

21/09/2022

Whether for virtual assistants, transcriptions or contact centers, voice AI services are turning words and conversations into bits and bytes of business magic.

At GTC this week, NVIDIA announced new additions to NVIDIA Riva, a GPU-accelerated software development kit for building and deploying speech AI applications.

Riva's pretrained models are now offered in seven languages, including French and Hindi. Additional languages on the horizon: Arabic, Italian, Japanese, Korean and Portuguese. Riva also brings improvements in accuracy for English, German, Mandarin, Russian and Spanish. Additionally, it adds capabilities like word-level confidence scores and speaker diarization - the process of identifying speakers in audio streams.

Riva is built to be fully customizable at every stage of the speech AI pipeline to help solve unique problems efficiently. Developers can also deploy it where they want their data to be: on premises, for hybrid multiclouds, at the edge or in embedded devices. It's used by enterprises to bolster services, efficiency and competitive advantage.

While AI for voice services has been in high demand, development tools have lagged. More people are working and learning from home, shopping online and seeking remote customer support, which strains call centers and pushes voice applications to their limits. Customer service wait times have recently tripled as staffing shortages have hit call centers hard, according to a 2022 Bloomberg report.

Advances in speech AI offer the way forward. NVIDIA Riva enables companies to explore larger deep learning models and develop more nuanced voice systems. Speech AI applications built on Riva provide an accelerated path to better services, promising improved customer experiences and engagement.

Rising Demand for Voice AI Applications The worldwide market for contact center software reached about $27 billion in 2021, a figure expected to nearly triple to $79 billion by 2029, according to Fortune Business Insights.

This increase is due to the benefits that customized voice applications offer businesses of any size, in almost every industry - from global enterprises, to original equipment manufacturers delivering speech AI-based systems and cloud services, to systems integrators and independent software vendors.

Riva SDK Accelerates AI Workflows NVIDIA Riva includes pretrained language models that can be used as is or fine-tuned using transfer learning from the NVIDIA TAO Toolkit, which allows for custom datasets in a no-code environment. Riva automated speech recognition (ASR) and text-to-speech (TTS) models can be optimized, exported and deployed as speech services.

Voice AI is making its way into ever more types of applications, such as customer support virtual assistants and chatbots, video conferencing systems, drive-thru convenience food orders, retail by phone, and media and entertainment. Global organizations have adopted Riva to drive voice AI efforts, including T-Mobile, Deloitte, HPE, Interactions, 1-800-Flowers.com, Quantiphi and Kore.ai.

T-Mobile adopted Riva for its T-Mobile Expert Assist - a custom-built call center application that uses AI to transcribe real-time customer conversations and recommend solutions - for 17,000 customer service agents. T-Mobile plans to deploy Riva worldwide soon.

Hewlett Packard Enterprise offers HPE ProLiant servers that include NVIDIA GPUs and NVIDIA Riva software in a system capable of developing and running challenging speech AI and natural language processing workloads that can easily turn audio into insights. HPE ProLiant systems and NVIDIA Riva form a world-class, full-stack solution for running financial services and other industry applications.

To deliver the capabilities of NVIDIA Riva, HPE offers a Kubernetes-based NLP reference architecture based on HPE Ezmeral software, said Scott Ramsay, vice president of HPE GreenLake solutions at HPE. Delivered through the HPE GreenLake cloud platform, this system enables developers to accelerate the development and deployment of next-generation speech AI applications.

Deloitte supports clients looking to deploy ASR and TTS use cases, such as for order-taking systems in some of the world's largest quick-order restaurants. It's also developing chatbot services for healthcare providers that will enable accurate and efficient transcriptions for patient questions and chat summarizations.

Advances in natural language processing make it possible to design cost-efficient experiences that enable purposeful, simple and natural customer conversations, said Christine Ahn, principal at Deloitte US. Our clients are looking for a streamlined path to conversational AI deployment, and NVIDIA Riva supports that path.

Interactions has integrated Riva with its Curo software platform to create seamless, personalized engagements for customers in a broad range of industries that include telecommunications, as well as for companies such as 1-800-Flowers.com, which has deployed a speech AI order-taking system.

Kore.ai is integrating Riva with its SmartAssist speech AI contact-center-as-a-service, which powers its BankAssist, HealthAssist, AgentAssist, HR Assist and IT Assist products. Proof of concepts with NVIDIA Riva are in progress.

Quantiphi is a solution-delivery partner that is developing closed-captioning solutions using Riva for customers in media and entertainment, including Fox News. It's also developing digital avatars with Riva for telecommunications and other industries.

Complex Speech AI Pipelines, Easier Solutions Speech AI pipelines can be complex and require coordination across multiple services. Microservices are required to run at scale with ASR models, natural language understanding, TTS and domain-specific apps. NVIDIA GPUs are ideal for acceleration of these types of specialized tasks.

Riva offers softwar
LINK: https://blogs.nvidia.com/blog/2022/09/21/riva-speech-ai/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

16/04/2026

NAB 2026: Appear Announces X5 General Availability, XM Management Tool, and SIx300 Module

Appear ASA (OSE: APR) will announce three additions to its X Platform at NAB Sho...

16/04/2026

CentralCast Deploys Harmonic XOS Media Processor for Public Broadcasting

Harmonic has announced that CentralCast, a centralized master control facility for U.S. public media, has deployed Harmonic's XOS Advanced Media Processor t...

16/04/2026

NAB 2026: Encompass Digital Media Integrates Interra Systems BATON into Cloud-Based Altitude Flow Platform

Interra Systems has announced that Encompass Digital Media has integrated its BA...

16/04/2026

NAB 2026: Grass Valley to Showcase Expanded Media Infrastructure Capabilities

Grass Valley will demonstrate its Media Infrastructure capabilities at NAB Show 2026 (Booth C2408, Central Hall), bringing routing, signal processing, and orche...

16/04/2026

Verizon Readies Robust Network Infrastructure Plans for FIFA World Cup 2026

As preparations ramp up for the FIFA World Cup 2026, Verizon has outlined a sweeping connectivity and infrastructure initiative that will underpin broadcast ope...

16/04/2026

Advanced Image Robotics Joins 2026 MLS Innovation Lab

Advanced Image Robotics (AIR) has announced its selection for the 2026 MLS Innovation Lab. AIR will work with MLS clubs, players, and executives on automated vi...

16/04/2026

NAB 2026: JWX Acquires True Anthem, Adding AI-Powered Social Publishing to Publisher Platform

JWX has announced the acquisition of True Anthem, an AI-powered social publishin...

16/04/2026

Jomboy Media and Fubo Launch 24/7 Creator-Led Channel

Jomboy Media and FuboTV Inc. have launched the Jomboy Media Channel, a 24/7 channel available to FuboTV base plan subscribers. The channel, timed to the start o...

16/04/2026

NAB 2026: Wave Central and EVS Announce SMPTE ST 2110 Interoperability Validation

Wave Central, a Domo Broadcast Company (Booth C2820), and EVS Broadcast Equipmen...

16/04/2026

Tata Communications and Formula 1 Release Race Before the Race, First Film in New Content Series

Tata Communications and Formula 1 have released Race Before the Race, the firs...

16/04/2026

NAB 2026: DPA Microphones N-Series Firmware Update Adds Duplex Gap and Guard Band Access for North American Users

DPA Microphones has released a firmware update for its N-Series Digital Wireless...

16/04/2026

NAB 2026: See Sony's Live Stage Presentations at NAB Show 2026

Sony's Live Stage at NAB Show 2026 is the place to hear directly from the content creators, end users, and technology experts who are pushing boundaries in ...

16/04/2026

Perfect Game and Youth Prospects Announce Broadcast Rights and Content Partnership

Perfect Game and Youth Prospects have announced a partnership covering broadcast...

16/04/2026

National Collegiate Rugby Partners with All Womens Sports Network for 2026 National 7s Championships

National Collegiate Rugby (NCR) has announced a media rights partnership with Al...

16/04/2026

NAB 2026: Audio-Technica Introduces BP350ST-UB and BP350ST-UL Mid-Side Stereo Broadcast Microphones

Audio-Technica has announced two new mid-side (MS) stereo broadcast microphones:...

16/04/2026

NAB 2026: PSSI Global Services Acquires Beagle Networks

PSSI Global Services has announced the acquisition of Beagle Networks, a provider of IT infrastructure and onsite technical support for media and enterprise cus...

16/04/2026

NAB 2026: Blackmagic Design Releases Blackmagic Camera for iOS 3.3

Blackmagic Design has released Blackmagic Camera for iOS 3.3, a free update available now from the Apple App Store. The update will be demonstrated at NAB Show ...

16/04/2026

SVG New Sponsor Spotlight: 4Wall Entertainment's Dave Caulwell on Scaling Live Event Production for Sports

As live sports production continues to expand across linear, digital, and in-ven...

16/04/2026

ESPN Enters First HDR Postseason With REMCO Support in New Bristol-Based Control Rooms

Part of an infrastructure upgrade, the recently constructed spaces accommodate n...

16/04/2026

NBC Sports Tips Off First NBA Playoffs Campaign in 23 Years With Six-Truck Fleet, Interchangeable Production Model

NBC has added NEP's Supershooter 11, which only just came online in time for...

16/04/2026

Introducing a Smarter, Smoother Experience for Tablets

At Spotify, we want your experience to feel intuitive and personal across every moment of the day. Whether you're streaming your favorite playlist while you...

16/04/2026

Telsie T - SonicWorlds extended classic German Equaliser

Vintage broadcast experts release second plug-in Telsie T is the second plug-in to be released by SonicWorld, a German audio company who specialise in servi...

16/04/2026

Softube unveil Flow Studio

Compact unit offers hands-on plug-in control Softube have just announced the launch of their latest hardware unit, the Flow Studio. Housed in a compact desk...

16/04/2026

Fiedler Audio release Armada

Use any VST3 plug-in on immersive audio Fiedler Audio have just released a powerful new plug-in wrapper that brings full VST3 processing to Dolby Atmos and ...

16/04/2026

Nugen Audio update Halo Vision

Analysis plug-in gains enhanced frequency readouts Nugen Audio's real-time analysis plug-in has just received a significant update that introduces some ...

16/04/2026

GearExpo UK Announced

Recording & Music Technology Show Sound On Sound are pleased to announce a new recording and music technology exhibition taking place in London on Saturday ...

16/04/2026

SBS reveals all-star alumni team to celebrate 70 years of the Eurovision Song Contest

SBS reveals all-star alumni team to celebrate 70 years of the Eurovision Song Co...

16/04/2026

Reimagining Earth Measurement for the AI Era: L3Harris and Xoople Develop a New Spaceborne Capability

For decades, understanding the physical world from space has required a trade-of...

16/04/2026

Locality and Nielsen Announce Landmark Integration of Media Data Engine, Transforming Local TV Measurement

The integration accelerates demographic audience delivery across local markets a...

16/04/2026

Quickplay Deploys Gray Medias New Streaming Platform

Share Copy link Facebook X Linkedin Bluesky Email...

16/04/2026

Spectrum TV App Launches On Google TV and Other Android TV Devices

Share Copy link Facebook X Linkedin Bluesky Email...

16/04/2026

Telestream Enables Multi-Cloud Media Workflows with Oracl...

Telestream Cloud Services, including Vantage Cloud, UP, and SENTRY monitoring tools, are now optimized for OCI, powering flexible multi-cloud media orchestratio...

16/04/2026

Fox Corporation Names Amazon Web Services its Preferred A...

Fox Corporation (Nasdaq: FOXA, FOX; FOX or the Company ) today announced a strategic collaboration with Amazon Web Services (AWS), naming AWS as its preferre...

16/04/2026

Triveni Digital Introduces New StreamScope Analyzer to Br...

Triveni Digital, a trusted leader in ATSC 1.0 and 3.0 service delivery, data broadcasting, and quality assurance solutions, today announced an ISDB-Tb capabilit...

16/04/2026

Synamedia and SoFast team up to accelerate FAST and OTT

Synamedia and SoFast announce strategic go-to-market partnership to accelerate FAST, pay-TV, and VOD At the NAB Show 2026, Synamedia and SoFast are announcing ...

16/04/2026

Imagine Builds on a Decade of ST 2110 Leadership at 2026...

Showcases New, Open-Standard IP Solutions Across Its Portfolio, From Production to Playout Imagine Communications is marking a decade of leadership in ST 2110 ...

16/04/2026

Roku Marks 100M Milestone

Share Copy link Facebook X Linkedin Bluesky Email...

16/04/2026

Sportway and Broadcast Solutions acquire Studio Automated...

Sportway Media Group, a world leading AI-automated sports production company and Broadcast Solutions, a leading system integrator and provider of innovative sol...

16/04/2026

AJA to Acquire Video Encoding Software Company Comprimato

Share Copy link Facebook X Linkedin Bluesky Email...

16/04/2026

NAB Show 2026: Sony Announces New Cameras, Virtual Production Tools

Share Copy link Facebook X Linkedin Bluesky Email...

16/04/2026

Synamedia launches GO Shorts

Share Copy link Facebook X Linkedin Bluesky Email...

16/04/2026

Clear-Com Introduces Arcadia and Eclipse HX Updates

Share Copy link Facebook X Linkedin Bluesky Email...

16/04/2026

AJA Enters into Agreement to Acquire Video Encoding Software Company Comprimato

AJA Enters into Agreement to Acquire Video Encoding Software Company Comprimato Brie Clayton April 15, 2026 0 Comments Deal will expand AJA's video ...

16/04/2026

Deity Announces PR-4 Compact Field Recorder with Pre-Orders Launching April 14

Deity Announces PR-4 Compact Field Recorder with Pre-Orders Launching April 14 Brie Clayton April 15, 2026 0 Comments Deity Microphones today announce...