
Thanks to their work driving AI forward, Akshit Arora and Rafael Valle could someday speak to their spouses' families in their native languages.
Arora and Valle - along with colleagues Sungwon Kim and Rohan Badlani - won the LIMMITS '24 challenge which asks contestants to recreate in real time a speaker's voice in English or any of six languages spoken in India with the appropriate accent. Their novel AI model only required a three-second speech sample.
The NVIDIA team advanced the state of the art in an emerging field of personalized voice interfaces for more than a billion native speakers of Bengali, Chhattisgarhi, Hindi, Kannada, Marathi and Telugu.
Making Voice Interfaces Realistic The technology for personalized text-to-speech translation is a work in progress. Existing services sometimes fail to accurately reflect the accents of the target language or nuances of the speaker's voice.
The challenge judged entries by listening for the naturalness of models' resulting speech and its similarity to the original speaker's voice.
The latest improvements promise personalized, realistic conversations and experiences that break language barriers. Broadcasters, telcos, universities, as well as e-commerce and online gaming services are eager to deploy such technology to create multilingual movies, lectures and virtual agents.
We demonstrated we can do this at a scale not previously seen, said Arora, who has two uses close to his heart.
Breaking Down Linguistic Barriers A senior data scientist who supports one of NVIDIA's biggest customers, Arora speaks Punjabi, while his wife and her family are native Tamil speakers.
It's a gulf he's long wanted to bridge for himself and others. I had classmates who knew their native languages much better than the Hindi and English used in school, so they struggled to understand class material, he said.
The gulf crosses continents for Valle, a native of Brazil whose wife and family speak Gujarati, a language popular in west India.
It's a problem I face every day, said Valle, an AI researcher with degrees in computer music and machine listening and improvisation. We've tried many products to help us have clearer conversations.
Badlani, an AI researcher, said living in seven different Indian states, each with its own popular language, inspired him to work in the field.
A Race to the Finish Line The initiative started nearly two years ago when Arora and Badlani formed the four-person team to work on the very different version of the challenge that would be held in 2023.
Their efforts generated a working code base for the so-called Indic languages. But getting to the win announced in January required a full-on sprint because the 2024 challenge didn't get on the team's radar until 15 days before the deadline.
Luckily, Kim, a deep learning researcher in NVIDIA's Seoul office, had been working for some time on an AI model well suited to the challenge.
A specialist in text-to-speech voice synthesis, Kim was designing a so-called P-Flow model prior to starting his second internship at NVIDIA in 2023. P-Flow models borrow the technique large language models employ of using short voice samples as prompts so they can respond to new inputs without retraining.
I created the model for English, but we were able to generalize it for any language, he said.
We were talking and texting about this model even before he started at NVIDIA, said Valle, who mentored Kim in two internships before he joined full time in January.
Giving Others a Voice P-Flow will soon be part of NVIDIA Riva, a framework for building multilingual speech and translation AI software, included in the NVIDIA AI Enterprise software platform.
The new capability will let users deploy the technology inside their data centers, on personal systems or in public or private cloud services. Today, voice translation services typically run on public cloud services.
I hope our customers are inspired to try this technology, Arora said. I enjoy being able to showcase in challenges like this one the work we do every day.
The contest is part of an initiative to develop open-source datasets and AI models for nine languages most widely spoken in India.
Hear Arora and Badlani share their experiences in a session at GTC next month.
And listen to the results of the team's model below, starting with a three-second sample of a native Kannada speaker:
https://blogs.nvidia.com/wp-content/uploads/2024/02/pr_kannada_f_indictts_prompt_3s-1.mp3
Here's a similar-sounding synthesized voice reading the first sentence of this blog in Hindi:
https://blogs.nvidia.com/wp-content/uploads/2024/02/pr_kannada_f_indictts_speaking_hindi_3-2.mp3
And then in English:
https://blogs.nvidia.com/wp-content/uploads/2024/02/pr_kannada_f_indictts_speaking_english-1.mp3 See notice regarding software product information.
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
01/06/2026
January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026
Throughout the week, Dolby brings to life the latest innovatio...
01/05/2026
January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...
01/04/2026
January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION
Douyin Users Can Now Create And Share Videos With Stun...
15/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
15/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
15/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
15/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
15/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
15/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
15/01/2026
Back to All News
Nah Yung-suk Presents Take a Hike!' - A Snowy Reality Adv...
15/01/2026
Back to All News
Firebreak Premieres on Netflix February 20
Entertainment
15 January 2026
GlobalSpain
Link copied to clipboard
DOWNLOAD THE FIRST LOOK IMA...
15/01/2026
Back to All News
The Variety, Voices, and Vision Shaping What's Next on Net...
15/01/2026
Back to All News
Netflix and Sony Pictures Entertainment Enter New Pay-1 Deal W...
15/01/2026
The Hollywood Professional Association (HPA) today announced the nominees for th...
15/01/2026
Award-winning production solutions bridge traditional and next-generation workflows
FOR-A MixBoard
FOR-A IMPULSE
viztrick AiDi
MFR-3100EX...
15/01/2026
Arvato Systems Named Launch Partner for AWS European Sovereign Cloud
As a launch partner for the AWS European Sovereign Cloud, Arvato Systems enables customer...
15/01/2026
NVIDIA kicked off the year at CES, where the crowd buzzed about the latest gaming announcements - including the native GeForce NOW app for Linux and Amazon Fire...
14/01/2026
Staines-upon-Thames, UK, 13th January, 2026 ITV, one of the UKs leading broadcasters, has selected Yospace, the global leader in Dynamic Ad Insertion (DAI), to ...
14/01/2026
Tech Focus: Audio Consoles, Part 2 - New Options for Virtual MixingA variety of solutions offer both technical and economic benefitsBy Dan Daley, Audio Editor
...
14/01/2026
Tech Focus: Audio Consoles, Part 1 - Key Component Evolves Toward the Totally Vi...
14/01/2026
SVG Summit 2025: Audio from Monday Workshops Now AvailableListen to sessions from Live Production Innovation, AI Production Tools, Cloud Production, Content Wor...
14/01/2026
The L3Harris large T7 robotic systems will provide U.S. Navy and U.S. Marines wi...
14/01/2026
Steiger Media's adoption of Calrec's compact Argo M console not only makes its innovative new hybrid truck faster, more efficient, and agile, but also e...
14/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
14/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
14/01/2026
Share Share by:
Copy link
Facebook
X
Whatsapp
Pinterest
Flipboard...
14/01/2026
Press Release: The Boston Globe Names Cartesian a Top Place to Work in 2025
January 14, 2026
News
Cartesian - January 14, 2026 - EINPresswire.com - Sp...
14/01/2026
Comscore and Marcus Theatres Announce Five-Year Extension for Cinema ACE and Ent...
14/01/2026
Comscore and Santikos Entertainment Announce Five-Year Circuit Wide Commitment t...
14/01/2026
January 14th, 2026
TRIBECA ANNOUNCES BEST NEW YORK SHORT AWARD FOR 25TH ANNIVERSARY FESTIVAL
In Celebration of Its 25th Anniversary, Tribeca Introduces a N...
14/01/2026
Wednesday 14 January 2026
Sky News announces Cathy Newman to lead flagship new political programme
Sky News today announces that award-winning journalist and ...
14/01/2026
Back to All News
State of Fear, The First Spin-Off of a Netflix Brazil Producti...
14/01/2026
The first stamp of An Post's 2026 Stamp Programme, marking 100 Years of Broadcasting, was unveiled at the GPO by Patrick O'Donovan TD, Minister for Cult...
14/01/2026
It's official! Beverley Callard has landed in Carrigstown. The beloved actor, known for her unforgettable roles and iconic screen presence, is joining the c...
13/01/2026
Independent media in Brazil and Colombia is facing an urgent crisis of traditional business models alongside a deteriorating security environment, according to ...
13/01/2026
NHL Situation Room 2.0: How Sony Hawk-Eye Powers Centralized Officiating, Player...
13/01/2026
NBC Sports Ices the Audio for the 2026 Prevagen U.S. Figure Skating Championship...
13/01/2026
DMF and MXL in practice: Which vendors are adopting it, and how fast is the ecos...
13/01/2026
CES 2026: Five Important Sports-Tech BuzzwordsThe terms highlight innovations for sports production at the showBy Daniel Frankel, SVG Contributor
Tuesday, Jan...
13/01/2026
For TGL Season 2, Unity 6 Boosts Virtual-Graphic Quality; COSM 360 Cameras Impro...
13/01/2026
Resetting Expectations? The State of the Sports Industry with Devoncroft's J...
13/01/2026
Top Row L-R: Ana Katz, Natalia Almada, Bao Nguyen, Tatiana Maslany, A.V. Rockwell, Dr. Heather Berlin
Second Row L-R: Sophie Barthes, Azazel Jacobs, Janicza Br...
13/01/2026
DoW to invest $1B in planned independently traded Missile Solutions business...
13/01/2026
L3Harris Chairman and CEO Christopher Kubasik and Under Secretary of War for Acq...
13/01/2026
April 10, 2025
First Gulf has taken a significant step in its U.S. expansion with the launch of its first industrial development in the country.
First Westla...
13/01/2026
April 11, 2025
Canadian footwear retailer SoftMoc has signed a lease for 145,600 square feet at 901 Hopkins Street in Whitby, where the space will serve as a w...
13/01/2026
April 14, 2025
First Gulf is proud to announce that 25 Ontario has officially received its occupancy permit, marking the transition from an active construction...
13/01/2026
April 28, 2025
First Gulf has been awarded a design-build lease for a new 350,000 square foot office and warehouse facility for Sherwin-Williams. This project ...
13/01/2026
August 13, 2025
First Gulf Expands U.S. Industrial Footprint with First Savanna...