Sony Pixel Power calrec Sony

Now You're Speaking My Language: NVIDIA Riva Sets New Bar for Fully Customizable Speech AI

21/09/2022

Whether for virtual assistants, transcriptions or contact centers, voice AI services are turning words and conversations into bits and bytes of business magic.

At GTC this week, NVIDIA announced new additions to NVIDIA Riva, a GPU-accelerated software development kit for building and deploying speech AI applications.

Riva's pretrained models are now offered in seven languages, including French and Hindi. Additional languages on the horizon: Arabic, Italian, Japanese, Korean and Portuguese. Riva also brings improvements in accuracy for English, German, Mandarin, Russian and Spanish. Additionally, it adds capabilities like word-level confidence scores and speaker diarization - the process of identifying speakers in audio streams.

Riva is built to be fully customizable at every stage of the speech AI pipeline to help solve unique problems efficiently. Developers can also deploy it where they want their data to be: on premises, for hybrid multiclouds, at the edge or in embedded devices. It's used by enterprises to bolster services, efficiency and competitive advantage.

While AI for voice services has been in high demand, development tools have lagged. More people are working and learning from home, shopping online and seeking remote customer support, which strains call centers and pushes voice applications to their limits. Customer service wait times have recently tripled as staffing shortages have hit call centers hard, according to a 2022 Bloomberg report.

Advances in speech AI offer the way forward. NVIDIA Riva enables companies to explore larger deep learning models and develop more nuanced voice systems. Speech AI applications built on Riva provide an accelerated path to better services, promising improved customer experiences and engagement.

Rising Demand for Voice AI Applications The worldwide market for contact center software reached about $27 billion in 2021, a figure expected to nearly triple to $79 billion by 2029, according to Fortune Business Insights.

This increase is due to the benefits that customized voice applications offer businesses of any size, in almost every industry - from global enterprises, to original equipment manufacturers delivering speech AI-based systems and cloud services, to systems integrators and independent software vendors.

Riva SDK Accelerates AI Workflows NVIDIA Riva includes pretrained language models that can be used as is or fine-tuned using transfer learning from the NVIDIA TAO Toolkit, which allows for custom datasets in a no-code environment. Riva automated speech recognition (ASR) and text-to-speech (TTS) models can be optimized, exported and deployed as speech services.

Voice AI is making its way into ever more types of applications, such as customer support virtual assistants and chatbots, video conferencing systems, drive-thru convenience food orders, retail by phone, and media and entertainment. Global organizations have adopted Riva to drive voice AI efforts, including T-Mobile, Deloitte, HPE, Interactions, 1-800-Flowers.com, Quantiphi and Kore.ai.

T-Mobile adopted Riva for its T-Mobile Expert Assist - a custom-built call center application that uses AI to transcribe real-time customer conversations and recommend solutions - for 17,000 customer service agents. T-Mobile plans to deploy Riva worldwide soon.

Hewlett Packard Enterprise offers HPE ProLiant servers that include NVIDIA GPUs and NVIDIA Riva software in a system capable of developing and running challenging speech AI and natural language processing workloads that can easily turn audio into insights. HPE ProLiant systems and NVIDIA Riva form a world-class, full-stack solution for running financial services and other industry applications.

To deliver the capabilities of NVIDIA Riva, HPE offers a Kubernetes-based NLP reference architecture based on HPE Ezmeral software, said Scott Ramsay, vice president of HPE GreenLake solutions at HPE. Delivered through the HPE GreenLake cloud platform, this system enables developers to accelerate the development and deployment of next-generation speech AI applications.

Deloitte supports clients looking to deploy ASR and TTS use cases, such as for order-taking systems in some of the world's largest quick-order restaurants. It's also developing chatbot services for healthcare providers that will enable accurate and efficient transcriptions for patient questions and chat summarizations.

Advances in natural language processing make it possible to design cost-efficient experiences that enable purposeful, simple and natural customer conversations, said Christine Ahn, principal at Deloitte US. Our clients are looking for a streamlined path to conversational AI deployment, and NVIDIA Riva supports that path.

Interactions has integrated Riva with its Curo software platform to create seamless, personalized engagements for customers in a broad range of industries that include telecommunications, as well as for companies such as 1-800-Flowers.com, which has deployed a speech AI order-taking system.

Kore.ai is integrating Riva with its SmartAssist speech AI contact-center-as-a-service, which powers its BankAssist, HealthAssist, AgentAssist, HR Assist and IT Assist products. Proof of concepts with NVIDIA Riva are in progress.

Quantiphi is a solution-delivery partner that is developing closed-captioning solutions using Riva for customers in media and entertainment, including Fox News. It's also developing digital avatars with Riva for telecommunications and other industries.

Complex Speech AI Pipelines, Easier Solutions Speech AI pipelines can be complex and require coordination across multiple services. Microservices are required to run at scale with ASR models, natural language understanding, TTS and domain-specific apps. NVIDIA GPUs are ideal for acceleration of these types of specialized tasks.

Riva offers softwar
LINK: https://blogs.nvidia.com/blog/2022/09/21/riva-speech-ai/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

14/04/2026

LTN Appoints Mark Romano, Edward Cox to Leadership Positions

Share Copy link Facebook X Linkedin Bluesky Email...

14/04/2026

Grass Valley Adds Telestream Vantage, Pulse and UP to AMPP

Share Copy link Facebook X Linkedin Bluesky Email...

14/04/2026

XenData Announces Backup, Archive and Cloud-Connect for LucidLink

Share Copy link Facebook X Linkedin Bluesky Email...

14/04/2026

Thomas Riedel Acquires ARRI

Share Copy link Facebook X Linkedin Bluesky Email...

14/04/2026

Shotoku Introduces the World to Aura P2 PTZ Prompter Pann...

Shotoku Introduces the World to Aura P2 PTZ Prompter Panner at NAB 2026 New system removes PTZ pan restrictions for teleprompter-based productions Shotoku US...

14/04/2026

Telestream and Grass Valley Connect Live and File-Based W...

Integration of Vantage, Pulse, and Telestream UP with Grass Valley AMPP Ecosystem enables scalable, interoperable workflows spanning live production and file-ba...

14/04/2026

Dalet Appoints Brian Doheny as President and Chief Revenu...

Enterprise growth leader to scale Dalet's next phase of innovation and global expansion New York, NY April 14, 2026 Dalet, a leading technology and ser...

14/04/2026

Berklee Celebrates Prince's Legacy in Two-Night Signature Series Event

Berklee Celebrates Prince's Legacy in Two-Night Signature Series Event Directed by Tia Fuller, the Prince Project (April 16-17) brings together more than ...

14/04/2026

Arooj Aftab Is Anything but Predictable

Arooj Aftab Is Anything but Predictable The singular artist explores the juxtaposition of grief and joy, dark and light, in her distinctive sound. April 14, ...

14/04/2026

Appear Expands X Platform from Core to Edge at NAB Show 2...

Appear launches include XM estate management and new X Platform processing enhancements to add density for next-generation hybrid & IP workflows, X5 is also now...

14/04/2026

Synamedia turns OTT content into TikTok-style feeds with...

Addressing the needs of a new generation's viewing habits, Synamedia launches GO Shorts. The AI-powered module turns existing catalogues into TikTok-style ...

14/04/2026

NAB 2026 - Vubiquity and Eluvio Showcase Streaming Soluti...

Vubiquity, an Amdocs company and global leader in technology-led media services, will be showcasing a new end-to-end streaming solution in collaboration with El...

14/04/2026

LiveU Announces Expanded Collaboration with Sony at NAB S...

LiveU today announced a significant expansion of its collaboration with Sony Corporation, introducing integrated support for Sony's file-based workflow solu...

14/04/2026

BBC World Service TV selects Open Broadcast Systems for I...

Open Broadcast Systems (https://www.obe.tv/) has announced that BBC World Service has selected its decoders for IP Television distribution. The high-quality, lo...

14/04/2026

Blackmagic Design Announces DaVinci Resolve 21

Blackmagic Design Announces DaVinci Resolve 21 Brie Clayton April 14, 2026 0 Comments Major update adds new Photo page bringing Hollywood's most a...

14/04/2026

Sinclair's WTOV Taps Brightline for Lighting Upgrade

Share Copy link Facebook X Linkedin Bluesky Email...

14/04/2026

Grass Valley Showcases Alliance Ecosystem at 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

14/04/2026

FCC Selects New Lead Administrator for U.S. Cyber Trust Mark Program

Share Copy link Facebook X Linkedin Bluesky Email...

14/04/2026

Gray Media Names Jim Hays GM of WTHI

Share Copy link Facebook X Linkedin Bluesky Email...

14/04/2026

NAB Blasts CTA in FCC Sports Probe Comments

Share Copy link Facebook X Linkedin Bluesky Email...

14/04/2026

Wowza to Showcase AI-Powered Video Workflows and Emerging...

Wowza will return to NAB Show 2026 with a set of live demonstrations focused on how video infrastructure is evolving for a new generation of AI-powered and oper...

14/04/2026

Stegawave Debuts Real-Time Forensic Watermarking to Tackle Piracy in Live Sports Streaming

Stegawave Debuts Real-Time Forensic Watermarking to Tackle Piracy in Live Sports...

14/04/2026

Living in Boston: A Guide for Incoming Boston Conservatory Students

Living in Boston: A Guide for Incoming Boston Conservatory Students From navigating the T to balancing school with professional gigs, a current student shar...

14/04/2026

Just What Is Genre These Days, Anyway?

Just What Is Genre These Days, Anyway? Understanding the business and art of genre-bending in 2026. April 13, 2026 By Bryan Parys Illustration by Jack Fla...

14/04/2026

Lenora Helm Hammonds Is Turning Passion Into Plan A

Lenora Helm Hammonds Is Turning Passion Into Plan A The dean of the Professional Education Division has seen the industry from all sides. Now shes bringing it...

14/04/2026

How Michelle Zalabak Found Her Dream Career in Music and Finance

How Michelle Zalabak Found Her Dream Career in Music and Finance The Warner Music Group deal analysis manager helps determine what artist catalogs are worth a...

14/04/2026

SES, Japan Airlines to Expand Multi-Orbit Inflight Connectivity to Long-Haul Fleet

Luxembourg, April 14, 2026 - SES, a leading space solutions company, today annou...

13/04/2026

ToolsOnAir Composition Builder 2026 Boilerplate

ToolsOnAir Composition Builder 2026 Boilerplate More Details: The Composition Builder 2026 application for macOS enables TV stations and Live Event broadcast...

13/04/2026

ToolsOnAr just:live pro 2026 Boilerplate

ToolsOnAr just:live pro 2026 Boilerplate More Details: just:live pro 2026 is a Multi-Channel Live Production Playout solution for video and static or real-ti...

13/04/2026

ToolsOnAr just:play pro 2026 Boilerplate

ToolsOnAr just:play pro 2026 Boilerplate More Details: just:play pro 2026 is a Multi-Channel automated 24/7 Master Control playout solution with SD, HD and U...

13/04/2026

ToolsOnAr live:cut 2026 Boilerplate

ToolsOnAr live:cut 2026 Boilerplate More Details: live:cut is an option to just:in mac pro 2025 and enables multicamera production workflows for up to 16 cam...

13/04/2026

ToolsOnAir Just In Mac Lite NDI 2026 Boilerplate

ToolsOnAir Just In Mac Lite NDI 2026 Boilerplate More Details: The Just In Mac Lite NDI application is a streamlined media capture solution designed specific...

13/04/2026

ToolsOnAir Just In Mac Lite 2026 Boilerplate

ToolsOnAir Just In Mac Lite 2026 Boilerplate More Details: The Just In Mac Lite application is a streamlined media capture solution designed specifically for...

13/04/2026

ToolsOnAir just:in mac pro 2026 Boilerplate

ToolsOnAir just:in mac pro 2026 Boilerplate More Details: just:in mac pro is a macOS-based client-server multichannel capture solution to record SDI, HDMI, N...

13/04/2026

Jnger Audio Joins EBU ADM Implementers Group as Founding Member

Telos Alliance has announced that J nger Audio has joined the EBU ADM Implementers Group (ADM-IG) as a founding member. The group is focused on advancing ADM an...

13/04/2026

NAB 2026: Grass Valley to Showcase Alliance Partner Ecosystem

Grass Valley will demonstrate its Alliance Partner ecosystem at NAB Show 2026 (Booth C2408, Central Hall, April 19-22), showing AMPP integrations across live pr...

13/04/2026

NAB 2026: Media Links to Demonstrate IP Transport Solutions

Media Links will exhibit at NAB Show 2026 (Booth W2033), demonstrating IP transport solutions for live production including hitless protection technology, Xscen...

13/04/2026

NBC Sports Partners with Overtime for OT7 Football League and Navy All-American Bowl

NBC Sports has announced a programming, distribution, and sales partnership with...

13/04/2026

FloSports Promotes Jayar Donlan from COO to President

FloSports has promoted Chief Operating Officer Jayar Donlan to President, effective immediately. In his new role, Donlan will lead the company's commercial,...

13/04/2026

MASV Case Study: PanCam Pictures Uses MASV for Remote Post-Production at Senior Bowl 2026

PanCam Pictures, the documentary production company founded by Paul Camarata, us...

13/04/2026

NAB 2026: Mimir to Showcase Cloud Production Platform

Mimir will exhibit at NAB Show 2026 (North Hall, Booth N2850), demonstrating its cloud-native media production platform with new capabilities including Mimir Cu...

13/04/2026

NAB 2026: BBright Adds RIST Protocol Support to IP Gateway

BBright has announced that its IP Gateway now supports the Reliable Internet Stream Transport (RIST) protocol. The addition will be introduced at NAB Show 2026 ...

13/04/2026

Net Insight Awarded ESA NAVISP Development Project for PNT Technology

Net Insight has been awarded a development project through the European Space Agency's Navigation Innovation and Support Program (NAVISP), with co-funding f...

13/04/2026

NAB 2026: intoPIX to Showcase JPEG XS, IPMX, and SMPTE 2110 Solutions

intoPIX will exhibit at NAB Show 2026, marking the company's 20th anniversary. The company will demonstrate its JPEG XS compression portfolio and IPMX-appro...