Sony Pixel Power calrec Sony

Now You're Speaking My Language: NVIDIA Riva Sets New Bar for Fully Customizable Speech AI

21/09/2022

Whether for virtual assistants, transcriptions or contact centers, voice AI services are turning words and conversations into bits and bytes of business magic.

At GTC this week, NVIDIA announced new additions to NVIDIA Riva, a GPU-accelerated software development kit for building and deploying speech AI applications.

Riva's pretrained models are now offered in seven languages, including French and Hindi. Additional languages on the horizon: Arabic, Italian, Japanese, Korean and Portuguese. Riva also brings improvements in accuracy for English, German, Mandarin, Russian and Spanish. Additionally, it adds capabilities like word-level confidence scores and speaker diarization - the process of identifying speakers in audio streams.

Riva is built to be fully customizable at every stage of the speech AI pipeline to help solve unique problems efficiently. Developers can also deploy it where they want their data to be: on premises, for hybrid multiclouds, at the edge or in embedded devices. It's used by enterprises to bolster services, efficiency and competitive advantage.

While AI for voice services has been in high demand, development tools have lagged. More people are working and learning from home, shopping online and seeking remote customer support, which strains call centers and pushes voice applications to their limits. Customer service wait times have recently tripled as staffing shortages have hit call centers hard, according to a 2022 Bloomberg report.

Advances in speech AI offer the way forward. NVIDIA Riva enables companies to explore larger deep learning models and develop more nuanced voice systems. Speech AI applications built on Riva provide an accelerated path to better services, promising improved customer experiences and engagement.

Rising Demand for Voice AI Applications The worldwide market for contact center software reached about $27 billion in 2021, a figure expected to nearly triple to $79 billion by 2029, according to Fortune Business Insights.

This increase is due to the benefits that customized voice applications offer businesses of any size, in almost every industry - from global enterprises, to original equipment manufacturers delivering speech AI-based systems and cloud services, to systems integrators and independent software vendors.

Riva SDK Accelerates AI Workflows NVIDIA Riva includes pretrained language models that can be used as is or fine-tuned using transfer learning from the NVIDIA TAO Toolkit, which allows for custom datasets in a no-code environment. Riva automated speech recognition (ASR) and text-to-speech (TTS) models can be optimized, exported and deployed as speech services.

Voice AI is making its way into ever more types of applications, such as customer support virtual assistants and chatbots, video conferencing systems, drive-thru convenience food orders, retail by phone, and media and entertainment. Global organizations have adopted Riva to drive voice AI efforts, including T-Mobile, Deloitte, HPE, Interactions, 1-800-Flowers.com, Quantiphi and Kore.ai.

T-Mobile adopted Riva for its T-Mobile Expert Assist - a custom-built call center application that uses AI to transcribe real-time customer conversations and recommend solutions - for 17,000 customer service agents. T-Mobile plans to deploy Riva worldwide soon.

Hewlett Packard Enterprise offers HPE ProLiant servers that include NVIDIA GPUs and NVIDIA Riva software in a system capable of developing and running challenging speech AI and natural language processing workloads that can easily turn audio into insights. HPE ProLiant systems and NVIDIA Riva form a world-class, full-stack solution for running financial services and other industry applications.

To deliver the capabilities of NVIDIA Riva, HPE offers a Kubernetes-based NLP reference architecture based on HPE Ezmeral software, said Scott Ramsay, vice president of HPE GreenLake solutions at HPE. Delivered through the HPE GreenLake cloud platform, this system enables developers to accelerate the development and deployment of next-generation speech AI applications.

Deloitte supports clients looking to deploy ASR and TTS use cases, such as for order-taking systems in some of the world's largest quick-order restaurants. It's also developing chatbot services for healthcare providers that will enable accurate and efficient transcriptions for patient questions and chat summarizations.

Advances in natural language processing make it possible to design cost-efficient experiences that enable purposeful, simple and natural customer conversations, said Christine Ahn, principal at Deloitte US. Our clients are looking for a streamlined path to conversational AI deployment, and NVIDIA Riva supports that path.

Interactions has integrated Riva with its Curo software platform to create seamless, personalized engagements for customers in a broad range of industries that include telecommunications, as well as for companies such as 1-800-Flowers.com, which has deployed a speech AI order-taking system.

Kore.ai is integrating Riva with its SmartAssist speech AI contact-center-as-a-service, which powers its BankAssist, HealthAssist, AgentAssist, HR Assist and IT Assist products. Proof of concepts with NVIDIA Riva are in progress.

Quantiphi is a solution-delivery partner that is developing closed-captioning solutions using Riva for customers in media and entertainment, including Fox News. It's also developing digital avatars with Riva for telecommunications and other industries.

Complex Speech AI Pipelines, Easier Solutions Speech AI pipelines can be complex and require coordination across multiple services. Microservices are required to run at scale with ASR models, natural language understanding, TTS and domain-specific apps. NVIDIA GPUs are ideal for acceleration of these types of specialized tasks.

Riva offers softwar
LINK: https://blogs.nvidia.com/blog/2022/09/21/riva-speech-ai/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

06/09/2026

Dolby and MagentaTV Bring Fans Closer to the FIFA World Cup 2026 in Germany with Dolby Vision and Dolby Atmos

June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

22/06/2026

Behind the Mic: SportsCenters Lisa Cohn to Retire This June From ESPN as Longest-Tenured Anchor

Behind The Mic provides a roundup of recent news regarding on-air talent, includ...

22/06/2026

Cosm Appoints David Ho as Chief Legal Officer

Cosm has announced the appointment of David Ho as Chief Legal Officer, a newly created executive role reporting to President and CEO Jeb Terry. Ho will oversee ...

22/06/2026

Warner Bros. Discovery and AWS Announce AI-Powered Advertising Technology Platform

Warner Bros. Discovery and Amazon Web Services (AWS) have announced the developm...

22/06/2026

Daktronics Completes Audio Control System Upgrade at Petco Park for San Diego Padres

Daktronics has completed an audio control system upgrade at Petco Park in San Di...

22/06/2026

Accelerate Media Names John Willi President, Launches Accelerate Sports Network

Accelerate Media has named John Willi as President and announced the launch of the Accelerate Sports Network (ASN), a prep sports media and streaming platform c...

22/06/2026

AWSN to Air 3XBA Womens Basketball Tournament Live June 26-27

All Women's Sports Network (AWSN) and 3XBA (3 3 Basketball Association) have announced live television coverage of the annual 3XBA tournament on Friday, Jun...

22/06/2026

OWL AI Appoints Jay Prasad as Chief Executive Officer

OWL AI has announced the appointment of Jay Prasad as Chief Executive Officer and member of the Board of Directors. Prasad succeeds Josh Gwyther, who has served...

22/06/2026

CP Communications Provides RF Support for Inside the NBA at 2026 NBA Finals

CP Communications delivered RF video and audio support for TNT's Inside the NBA at the 2026 NBA Finals, providing main show coverage in San Antonio and ea...

22/06/2026

Polymarket and GRID Partner to Integrate Esports Data and Streaming into Trading Platform

Polymarket has announced a partnership with GRID, an official esports data platf...

22/06/2026

SVG New Sponsor Spotlight: Metinteractive's Rachel Mele, Ken Cyr on Building Technology Backbones for Sports Venues

As sports venues continue to evolve into more video-centric, fan-engagement-driv...

22/06/2026

SVG All-Stars: Corbin Perkins, Chief Engineer, Victory+

As the regional sports production scene shifts toward streaming, this Texan helps lead the engineering behind Victory+'s growing live platform...

22/06/2026

Meet the 2026 Sundance Institute Documentary Edit Intensive Fellows

By Kristin Feeley, Director, Documentary Film & Artist Programs the memories of your elders [are] a scaffolding for you to build your identity on - and t...

22/06/2026

Blade joins CEDAR Audio Icons line-up

New hyper-resolution analyser EQ revealed CEDAR Audio's all-new Icons plug-in series has just gained its newest member, Blade. Described by the compan...

22/06/2026

Sampleson release Aeronaut

Turn any live input into a cinematic soundscape Designed for use in the studio and on stage, Sampleson's latest creation is capable of taking any audio ...

22/06/2026

ADDAC System's new Four Strings Series

Adds guitar strings to Eurorack rigs ADDAC System are renowned for their weird and wonderful synth designs, and their line-up includes plenty of gear that&#...

22/06/2026

FIFA World Cup 2026 fever grows, as more than one third of Australians tune in to SBS coverage

FIFA World Cup 2026 fever grows, as more than one third of Australians tune in ...

22/06/2026

NAGRA Venturi - Turning Piracy Intelligence into Measurable Business Impact

In our latest blog, Tim Pearson explores NAGRA Venturi, the new streaming security solution for the AI era from NAGRAVISION. Designed to aggregate and analyze ...

22/06/2026

Xumo Expands Contextual Targeting Capabilities Through Gracenote and IRIS.TV Integrations

Expanded integrations give advertisers access to distinct contextual signals acr...

22/06/2026

Greg Garmon Joins Imagine as Senior VP, Americas Video Sales

Share Copy link Facebook X Linkedin Bluesky Email...

22/06/2026

Kaleidescape Breaks the 8K and 4:4:4 Barriers

Share Copy link Facebook X Linkedin Bluesky Email...

22/06/2026

Xilica introduces Dynamic Voice Lift in new Designer

Xilica today announced the release of Dynamic Voice Lift, a new feature in Xilica Designer v4.12 that brings adaptive speech reinforcement to large meeting spac...

22/06/2026

Official trailer released for Katie Price: Nothing to Hide, coming to Sky and NOW on 8 July

Monday 22 June 2026 Official trailer released for Katie Price: Nothing to Hide,...

22/06/2026

Eco Wave Power Turns Waves Into Watts With NVIDIA AI Infrastructure and Digital Twins

The next era of AI will not be defined by compute alone. Its growth will be dete...

22/06/2026

NVIDIA Vera CPU Opens the Way for Agentic Scientific AI at Los Alamos National Laboratory

Mission, Vision and Veritas - new Los Alamos National Laboratory (LANL) supercom...

22/06/2026

From Materials Simulation to Experimental Astronomy, New NVIDIA AI Software Unlocks Scientific Discoveries

At the ISC conference running in Hamburg this week, NVIDIA is introducing new so...

22/06/2026

NAIRR Science Program Reshapes Scientific Research, Powered by NVIDIA AI Infrastructure

For the past two years, the U.S. National Science Foundation's National Arti...

22/06/2026

At ISC, JUPITER Shows What Exascale Science Looks Like

JUPITER, Europe's first exascale supercomputer at Germany's Forschungszentrum J lich, runs on NVIDIA Grace Hopper Superchips and NVIDIA Quantum-X800 Inf...

21/06/2026

FIFAs Oscar Sanchez on World Cup Effort: Were Feeling Good and Where We Want to Be

To call the 2026 FIFA World Cup a big undertaking would be a big understatement....

21/06/2026

John Walden's Cubase Video Tutorials

New series now live on Udemy Regular SOS contributor and Cubase workshop columnist John Walden has just released a new Cubase video course that is now avail...

21/06/2026

Hotter Than a Hot Tub: The 45C Breakthrough to Cool AI's Biggest Machines

Hot tubs sit at about 38 to 40 degrees Celsius, warm enough that most people can only soak for about 15 minutes. NVIDIA's newest AI servers can run their co...

21/06/2026

Sky announces immersive documentary series The Wargame

Sunday 21 June 2026 Sky announces immersive documentary series The Wargame The Wargame first looks ZIP (2MB) Sky today confirms the commission of The Wargam...

20/06/2026

IK Multimedia introduce ReSing Doubling

New add-on creates doubles & vocal stacks IK Multimedia's latest ReSing add-on kits the innovative software out with the ability to automatically genera...

20/06/2026

What's Next for Apogee? Start Here.

What exactly is Apogee Control V3? Control V3 is a new mixer application that controls Apogee interfaces. The new hit feature is that V3 finally allows for...

19/06/2026

NBC Sports U.S. Open Coverage Fires Up 92 Cameras, Bunker cams

Split compound eases operational challenges at Shinnecock Hills Golf Club...

19/06/2026

ESPN's Men's College World Series Production Adds Onsite Studio, POVORA CapCams, Expanded Drone Coverage for Finale in Omaha

North Carolina, Oklahoma meet in the best-of-three Finals as ESPN leans into spe...

19/06/2026

Ninja AB from The Him DSP

Company launch comprehensive mix-comparison tool The Him DSP are a plug-in company founded by The Him, an EDM DJ and producer who has amassed over half a bi...

19/06/2026

Bitwig Studio 6.1 enters beta testing

Major Sampler upgrades introduced The latest version of Bitwig's DAW software has just entered public beta testing, and is available now for all users w...

19/06/2026

Akai Pro's MPC One & MPC Key 37 get G2 upgrade

Four times the power of their predecessors Akai Pro have just introduced upgraded versions of two of their popular standalone MPC systems, kitting them out ...

19/06/2026

Eurovision secures top four position as content distributor rankings hold steady in Poland

Data from May shows seasonal outdoor trends triggers lower viewing Warsaw, Pola...

19/06/2026

Bitfocus Buttons wins another top industry award

Buttons is best control system in the rAVe Best of Infocomm Awards 2026...

19/06/2026

Mavis Studio Makes iPad Production More Powerful

Mavis Studio Makes iPad Production More Powerful Brie Clayton June 19, 2026 0 Comments InfoComm update brings new NDI Preview, PTZ control, USB audio ...

19/06/2026

Immersive Studio Metaverse Stage Tackles Post with Blackmagic Design

Immersive Studio Metaverse Stage Tackles Post with Blackmagic Design Brie Clayton June 19, 2026 0 Comments New narrative projects rely on DaVinci Reso...

19/06/2026

How to Run the Original 1993 After Effects

How to Run the Original 1993 After Effects Graham Quince June 19, 2026 0 Comments How to the original After Effects v1 in an emulator, and you don'...

19/06/2026

IBC Show to Increase Focus on Networking, Startups

Share Copy link Facebook X Linkedin Bluesky Email...

19/06/2026

Irdeto Taps Axel Gallant as CEO

Share Copy link Facebook X Linkedin Bluesky Email...

19/06/2026

SMPTE Makes Its Standards Freely Accessible - Opening St...

SMPTE , the home of media professionals, technologists and engineers, has announced that its entire Standards catalog is now freely available to the global medi...