
More than 75 million people speak Telugu, predominantly in India's southern regions, making it one of the most widely spoken languages in the country.
Despite such prevalence, Telugu is considered a low-resource language when it comes to speech AI. This means there aren't enough hours' worth of speech datasets to easily and accurately create AI models for automatic speech recognition (ASR) in Telugu.
And that means billions of people are left out of using ASR to improve transcription, translation and additional speech AI applications in Telugu and other low-resource languages.
To build an ASR model for Telugu, the NVIDIA speech AI team turned to the NVIDIA NeMo framework for developing and training state-of-the-art conversational AI models. The model won first place in a competition conducted in October by IIIT-Hyderabad, one of India's most prestigious institutes for research and higher education.
NVIDIA placed first in accuracy for both tracks of the Telugu ASR Challenge, which was held in collaboration with the Technology Development for Indian Languages program and India's Ministry of Electronics and Information Technology as a part of its National Language Translation Mission.
For the closed track, participants had to use around 2,000 hours of a Telugu-only training dataset provided by the competition organizers. And for the open track, participants could use any datasets and pretrained AI models to build the Telugu ASR model.
NVIDIA NeMo-powered models topped the leaderboards with a word error rate of approximately 13% and 12% for the closed and open tracks, respectively, outperforming by a large margin all models built on popular ASR frameworks like ESPnet, Kaldi, SpeechBrain and others.
What sets NVIDIA NeMo apart is that we open source all of the models we have - so people can easily fine-tune the models and do transfer learning on them for their use cases, said Nithin Koluguri, a senior research scientist on the conversational AI team at NVIDIA. NeMo is also one of the only toolkits that supports scaling training to multi-GPU systems and multi-node clusters.
Building the Telugu ASR Model The first step in creating the award-winning model, Koluguri said, was to preprocess the data.
Koluguri and his colleague Megh Makwana, an applied deep learning solution architect manager at NVIDIA, removed invalid letters and punctuation marks from the speech dataset that was provided for the closed track of the competition.
Our biggest challenge was dealing with the noisy data, Koluguri said. This is when the audio and the transcript don't match - in this case you cannot guarantee the accuracy of the ground-truth transcript you're training on.
The team cleaned up the audio clips by cutting them to be less than 20 seconds, chopped out clips of less than 1 second and removed sentences with a greater-than-30 character rate, which measures characters spoken per second.
Makwana then used NeMo to train the ASR model for 160 epochs, or full cycles through the dataset, which had 120 million parameters.
For the competition's open track, the team used models pretrained with 36,000 hours of data on all 40 languages spoken in India. Fine-tuning this model for the Telugu language took around three days using an NVIDIA DGX system, according to Makwana.
Inference test results were then shared with the competition organizers. NVIDIA won with around 2% better word error rates than the second-place participant. This is a huge margin for speech AI, according to Koluguri.
The impact of ASR model development is very high, especially for low-resource languages, he added. If a company comes forward and sets a baseline model, as we did for this competition, people can build on top of it with the NeMo toolkit to make transcription, translation and other ASR applications more accessible for languages where speech AI is not yet prevalent.
NVIDIA Expands Speech AI for Low-Resource Languages ASR is gaining a lot of momentum in India majorly because it will allow digital platforms to onboard and engage with billions of citizens through voice-assistance services, Makwana said.
And the process for building the Telugu model, as outlined above, is a technique that can be replicated for any language.
Of around 7,000 world languages, 90% are considered to be low resource for speech AI - representing 3 billion speakers. This doesn't include dialects, pidgins and accents.
Open sourcing all of its models on the NeMo toolkit is one way NVIDIA is improving linguistic inclusion in the field of speech AI.
In addition, pretrained models for speech AI, as part of the NVIDIA Riva software development kit, are now available in 10 languages - with many additions planned for the future.
And NVIDIA last month hosted its inaugural Speech AI Summit, featuring speakers from Google, Meta, Mozilla Common Voice and more. Learn more about Unlocking Speech AI Technology for Global Language Users by watching the presentation on demand.
Get started building and training state-of-the-art conversational AI models with NVIDIA NeMo.
Most recent headlines
09/11/2025
Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...
01/11/2025
NEW YORK ITN and the sell-side advertising company Magnite have announced the launch of what they are billing as the industrys first Local Linear TV Private Mar...
31/10/2025
FanDuel Sports Network To Deliver Selected Live NBA, NHL Games to Major Streamin...
31/10/2025
NBC Jumps Out of the Gate in Extended Breeder's Cup Deal With Dual Drones, J...
31/10/2025
FOR IMMEDIATE RELEASE
30 October 2025
It is with great sadness that we mourn the passing of Segomotso Keorapetse, an award- winning South African television d...
31/10/2025
IRVING, Texas As station groups move into an era that promises rapid tech, regulatory and economic changes, Nexstar Media Group said its board has extended chai...
31/10/2025
While some analysts have questioned the ongoing economic viability of broacast-TV late night shows amid ongoing declines in linear viewing, new data from Tubula...
31/10/2025
The contentious contract negotiations between The Walt Disney Co. and YouTube TV have resulted in a blackout of Disney-owned programming on the pay TV operator....
31/10/2025
CINCINNATI Video conversion and AV signal distribution specialist tvONE and Matrox Video have struck a strategic partnership, combining CALICO PRO's video p...
31/10/2025
NEW YORK The Interactive Advertising Bureau (IAB) today released a new industry guide that discusses the urgency of adopting new standards that will help advert...
31/10/2025
While some analysts have questioned the ongoing economic viability of late night shows on broadcast TV amid ongoing declines in linear viewing, new data from Tu...
31/10/2025
Berklee Celebrates the Inauguration of President Jim Lucchese In his inaugural address, Lucchese shared an optimistic vision for Berklee's future as a for...
31/10/2025
Back to All News
Family, Food, and Films: Netflix's Dining with the Kapoors...
31/10/2025
The review highlights DPA 4055 Kick Drum Microphone for its compact design, ease of placement, and authentic tone that captures the true character of the drum p...
31/10/2025
The RT Raidi na Gaeltachta Award 2025 will be presented to journalist P il n N Chiar in at the Oireachtas na Samhna in Belfast tomorrow, Saturday 1 November,...
31/10/2025
RT lyric fm is calling for choirs across Ireland to share their festive music-m...
31/10/2025
Three awards were presented to RT Raidi na Gaeltachta broadcasters at the Oire...
31/10/2025
RT continues its proud tradition of championing Ireland's vibrant arts and cultural landscape through its RT Supporting the Arts initiative. This November...
31/10/2025
RT selects Irish independent production company to produce Christian Worship on...
31/10/2025
Amidst Gyeongju, South Korea's ancient temples and modern skylines, Jensen H...
30/10/2025
Midwich has signed a UK and Ireland distribution deal with X2O Media, a worldwid...
30/10/2025
SVG Students To Watch: Sam Newitt, Kansas State UniversityThe South Dakota native thrives in many roles behind the scenes at K-StateHD.TVBy Brandon Costa, Direc...
30/10/2025
SVG Sit-Down: Swerve Sports' Christy Tanner Explores the Young FAST Channel&...
30/10/2025
SVG Campus Shot Callers: Andy Liebsch, Senior Director, Video Services, Kansas S...
30/10/2025
Diversified Names Paul Lidsky CEO, Expanding Leadership Role After Serving as Bo...
30/10/2025
NBA, Cosm Enter Long-Term Partnership for Shared Reality Production, Distributio...
30/10/2025
SVG New Sponsor Spotlight: FanConnect's Brett Crossley on Reimagining the Ga...
30/10/2025
FanDuel Sports Network to Deliver Select Live NBA, NHL Games to Major Streaming ...
30/10/2025
As the year comes to a close, we can feel the invigorating wind sweeping in for ...
30/10/2025
By Bailey Pennick
One of the most exciting things about the Sundance Film Festi...
30/10/2025
The SGL Carbon site in Bonn has a long tradition of training. For many years, young talent has been successfully trained here, regularly achieving excellent exa...
30/10/2025
SBS, NITV and Screen Australia announce 2025 Digital Originals Shortlist
29 October, 2025
Media releases
SBS, NITV and Screen Australia are excited to unve...
30/10/2025
Jon Rambeau, President of Integrated Mission Systems at L3Harris Technologies, speaks about industrial collaboration at the Asia-Pacific Economic Cooperation (A...
30/10/2025
MELBOURNE, Fla., October 30, 2025 - L3Harris Technologies (NYSE: LHX) reports th...
30/10/2025
WASHINGTON Federal Communications Commission Chair Brendan Carr said he has circulated a proposal for the agency to auction additional midband spectrum in the U...
30/10/2025
PLANO, Texas Technology solutions provider Diversified has named Paul Lidsky as CEO, tasked with guiding the company's next stage of growth, driving market ...
30/10/2025
CUPERTINO, Calif. Interra Systems today unveiled ORION stream recording support and seamless integration with BATON Media Player, a combination that lets broadc...
30/10/2025
WILMINGTON, Del. InterDigital today announced the acquisition of Deep Render, an artificial intelligence startup with a team of AI experts focused on video code...
30/10/2025
NEW YORK TAG Video Systems has earned a higher-rated Digital Product Passport (DPP) Committed to Sustainability badge and the Aclymate Climate Wise Silver Tier ...
30/10/2025
IRVING, Texas As station groups move into an era that promises rapid tech, regulatory and economic changes, the Nexstar Media Group, Inc. has announced that its...
30/10/2025
Television viewers are spending more time watching streaming content than linear TV, but sports continues to be a bright spot for broadcasters, according to the...
30/10/2025
NEW YORK Advertising technology company Operative Media has named Mike Napadano as its new CEO....
30/10/2025
Walmart Inc. has chosen Marshall Electronics cameras for use across its brand-new corporate campus studios and event center. The installation includes Marshall ...
30/10/2025
NETGEAR, Inc. (NASDAQ: NTGR), a global leader in intelligent networking solutions designed to power extraordinary experiences, today announced the launch of its...
30/10/2025
Clear-Com recently contributed its award-winning Gen-IC virtual intercom solution to power real-time communications for On-Air Student TV, a 24-hour global st...
30/10/2025
Maxon, maker of powerful, approachable software solutions for creators working in 2D and 3D design, motion graphics, visual effects, and more, today announced t...
30/10/2025
Studio Technologies, a leading manufacturer of high-quality audio, video, and fiber-optic solutions, announces that its new Model 394 GPI Interface and Model 39...
30/10/2025
Broadpeak , a leader in streaming and monetization at scale, has been selected by leading Malaysian content and entertainment company Astro to enable two major ...
30/10/2025
Riedel Communications is pleased to announce that Ulrich Voigt has joined the company as Director Live Production Solutions, taking over the SimplyLive business...
30/10/2025
LiveU, the global leader in live IP-video contribution, production, and distribution, today announced a new partnership with Kinetiq, the AI-powered platform un...