Sony Pixel Power calrec Sony

Tongues Untied: Dataset Starts Global Dialogue in Conversational AI

09/07/2021

A startup in East Africa is harnessing conversational AI to get the word out about a third wave of COVID-19 passing through the region. It hopes its Mbaza AI Chatbot will lead to partnerships that use the technology to tackle other concerns across the continent's many languages.

COVID is here to stay, unfortunately, and it's a volatile topic with measures that tighten and loosen from week to week, so it's important for people to have access to the latest information, said Audace Niyonkuru, founder and CEO of Digital Umuganda, the startup developing the software.

Based in Rwanda's capital of Kigali, his team aims to deploy a basic voice service in August. It will follow up with a version by year's end that can interpret and respond to spoken questions.

Conversational AI Gets the Word Out Ours is a more oral culture where there are still barriers to access because it's easier for people to talk than write, Niyonkuru said of the primarily rural country where three-quarters of the 12 million population are literate.

It's a challenge shared widely across Africa, home to more than 2,000 languages and dialects. But Niyonkuru, a lifelong entrepreneur, prefers to see the glass as half full.

There's a huge opportunity globally because conversational AI is a bridge over barriers to access - people can use their phones to get all sorts of medical or legal information, he said.

Giving AI a Common Voice To train a conversational AI model, you need an extremely large dataset of voice samples, something that takes lots of time to build or lots of money to buy. The startup trained its models on Mozilla Common Voice, a free and publicly available multilingual platform and dataset created by Mozilla and supported by NVIDIA. The Common Voice dataset was built through contributions from thousands of contributors across the world.

Digital Umuganda is Africa's largest contributor to the platform. To date, it's organized contributors to create 2,200 hours of Kinyarwanda, the language spoken by 40 million people in and around Rwanda. It's the largest dataset after English in Common Voice today.

To create the dataset, the startup tapped into Rwanda's tradition where neighbors gather on the last Saturday of each month to work on a community project. The startup embraced and extended the practice called umuganda.

The spirit of open source software is embedded in Rwanda's culture, so we just applied it to the digital world and datasets, he said.

Donations Shared with All Digital Umuganda started collecting data with student gatherings at universities, then went to the countryside to make sure the dataset represented people of all ages.

The beautiful thing is because it's in the open we see researchers around the world working with it, said Niyonkuru.

Two branches of the Rwandan government have expressed interest in using the startup's technology, and at least one third party has already created a conversational AI model using the dataset.

The COVID project got its start last spring when government call centers were overwhelmed by peaks of more than 10,000 calls for information about the pandemic. The Mbaza chatbot will be deployed on existing government healthcare lines as a 24/7 information service.

It's one example of how Common Voice is democratizing access to conversational AI around the globe, both for companies that develop the technology and consumers who use it.

Giving More Languages a Voice First launched in 2017, the Common Voice dataset gets an updated release twice a year. It focuses on expanding support in underrepresented languages, filling wide gaps left by commercial voice projects that typically focus on a handful of the most popular American, Asian and European languages.

Common Voice currently packs more than 10,000 hours of recorded voice samples, collected and validated by volunteers. It's a treasure trove for startups, researchers and small- to medium-sized developers who don't have the time or money to collect or purchase datasets of their own.

The next release, coming at the end of July, provides data from 75 languages, 15 of them debuting in Common Voice for the first time. They include Urdu, spoken by 70 million people in south Asia; Hausa, the language of 60 million Africans; as well as Azerbaijani, Armenian, Serbian and Uighur - none of which are supported by major commercial AI services.

It will be the first release since NVIDIA became a partner with Mozilla in April 2021, supporting Common Voice as part of a shared vision of making conversational AI available for everyone.

How You Can Help We created the NVIDIA Jarvis framework to give developers state-of-the-art pre-trained deep learning models and software tools to create interactive conversational AI services. Now we're helping make this rich, open dataset available, too.

Everyone is invited to join the global effort to make this technology available to all developers in all languages by going to Common Voice and contributing or validating voice samples as part of a dataset anyone can use freely.

Above: Digital Umuganda co-founder Ali Nyiringabo (right) with volunteers at an event in Kigali collecting and validating samples for Common Voice.
LINK: https://blogs.nvidia.com/blog/2021/07/09/common-voice-conversational-a...
See more stories from nvidia

Most recent headlines

04/08/2024

Dalet Appoints Santiago Solanas as CEO to Lead Next Era of Growth and Innovation

Dalet, a leading technology and service provider for media-rich organizations, is excited to announce Santiago Solanas as its new Chief Executive Officer (CEO)....

03/06/2024

Dalet and Veritone Reach Agreement to Distribute, Transact and Monetize Media Archives

Dalet, a leading technology and service provider for media-rich organizations, a...

07/05/2024

Disney Streaming DTC Operations Produce Their First Profits

BURBANK, Calif. The Walt Disney Company has finally delivered some from profits from its hefty streaming investments, with the second quarter of its fiscal year...

07/05/2024

Bonnie Hammer, Jen Psaki Share Books and Workplace Lessons at 92nd Street Y

Bonnie Hammer, vice chair of NBCUniversal, and Jen Psaki, MSNBC host, read from their new books at the 92nd Street Y in Manhattan Wednesday, May 8. Hammer's...

07/05/2024

Paramount Plus Orders Tracy Morgan Comedy Set in World of The Neighborhood'

Paramount Plus has ordered the series Crutch, a comedy that is a spinoff of CBS comedy The Neighborhood. Tracy Morgan stars....

07/05/2024

Daytime Emmys Announce Lifetime Achievement Recipients

Edward J. and Melody Thomas Scott and Lidia Bastianich will receive Lifetime Achievement honors at the 51st annual Daytime Emmy Awards in June, the National Aca...

07/05/2024

Al Roker, Wendy McMahon, Stephen A. Smith Set for Giants of Broadcasting Honors

The 2024 Giants of Broadcasting & Electronic Arts luncheon and awards ceremony happens in New York November 12, and the honorees are Al Roker, weather and featu...

07/05/2024

Dabl Debuts New Weekday Schedule

Dabl Network has added The Wayans Bros., The Jamie Foxx Show, Living Single and Everybody Hates Chris to its weekday and weekend lineups. Those shows join the l...

07/05/2024

Disney Entertainment DTC Business Gets Out of the Red in Q2

After winning a proxy fight, The Walt Disney Co. said its entertainment business turned a profit and added subscribers in its fiscal second quarter....

07/05/2024

Victor Wembanyama, Top NBA Rookie, Featured On Pass The Rock'

NBA Entertainment is closing out the second season of its series Pass the Rock with a look at Victor Wembanyama of the San Antonio Spurs, the league's rooki...

07/05/2024

Syncbak Rebrands as Zeam Media After Streaming Platform Rollout

Syncbak, which provides stations with streaming capabilities, said it is rebranding as Zeam Media....

07/05/2024

Magnolia Pictures Licenses Content To Stream on Samsung TV Plus

Samsung has made a deal with Magnolia Pictures that will bring titles from Magnolia to the Samsung TV Plus free streaming platform....

07/05/2024

Amazon Rolling Out Interactive Commercial Formats for Prime Video (Upfronts)

Amazon, which has added commercials to Amazon Prime Video, is rolling out new interactive ad formats that will enable advertisers to engage streamers and sell s...

07/05/2024

Syncbak Now Zeam Media

Syncbak, a 15 year-old provider of streaming tech for local broadcast stations has rebranded itself, adopting the name of its streaming service launched in Febr...

07/05/2024

Maximising resources: Keys to POST Luxembourg's success in the evolving media landscape

POST Luxembourgs journey with TAG Video underscores how finding a vendor who und...

07/05/2024

Survey: Amazon's Push into Ad-Supported Streaming Is Working

PORTSMOUTH, N.H. New findings from Hub Entertainment Research provides extensive data showing that the majority of consumers will opt for lower cost ad-supporte...

07/05/2024

The Library of American Broadcasting Foundation Unveils the 2024 Award Recipients

NEW YORK The Library of American Broadcasting Foundation (LABF) has announced th...

07/05/2024

Lindsey Reiser Joins CBS News 24/7 as Anchor and Correspondent

CBS News has named Lindsey Reiser an anchor and correspondent for CBS News 24/7, the network's live, streaming news service. Reiser, who was most recently a...

07/05/2024

Tablet Shipments Show Signs of Recovery in Q1

NEEDHAM, Mass. After more than two years of decline, worldwide tablet shipments posted modest year-over-year growth of 0.5% in the first quarter of 2024 (1Q24),...

07/05/2024

ESPN Pulls in Highest April Prime Time Audiences on Record

ESPN is reporting that April was a record-setting month as the network delivered its best April prime time audience on record, dating back more than 30 years....

07/05/2024

Kirsten Donaldson Joins NAB as VP of Public Policy

WASHINGTON, D.C. The National Association of Broadcasters (NAB) has announced that Kirsten Donaldson has joined NAB as vice president of public policy. Donaldso...

07/05/2024

Don't miss Bark in the Park, Margaritaville Night & Bull Sharks Night this week at the DBAP

The Bulls are back home again this week from May 7-12! Don't miss out on any...

07/05/2024

Taylor Swift's Eras Tour arrives to shake up Europe

Taylor Swift's Eras Tour arrives to shake up EuropeHaving shaken four continents, Taylor Swift's Eras Tour finally brings the biggest pop culture icon o...

07/05/2024

Isiphetho: Destiny' beats Scandal!'

Isiphetho: Destiny' beats Scandal!'E.tv's latest telenovela Isiphetho: Destiny' beat the channel's longest running soapie, Scandal!' ...

07/05/2024

Now there's HELP for ex-prisoners to find work in South Africa

Now there's HELP for ex-prisoners to find work in South AfricaNew initiative for ex-prisoners to find work in South Africa proves it's never too late fo...

07/05/2024

Tonight on Scandal: Cohen is pulled into keeping a secret

Tonight on Scandal: Cohen is pulled into keeping a secretDon't miss Tuesday, 7 May's riveting episode of South African soapie Scandal! on e.tv on DStv c...

07/05/2024

Skeem Saam: Monday's episode, 6 May 2024 [video]

Skeem Saam: Monday's episode, 6 May 2024 [video]Missed an episode of Skeem Saam? No problem! Watch the latest episode of your favourite South African soapie...

06/05/2024

Gathering Is a Call to Action: A Letter From Ilyse McKimmie

By Ilyse McKimmie Now, more than ever That's a phrase so often used in the last few years that I've come to dread seeing it in notes like this one. A...

06/05/2024

From Petabytes To Exabytes: The Future Of Shared Storage

alt= class=wp-image-12099 data-lazy-src=/wp-content/uploads/2024/05/Blog-Exabyte-Storage-Demand-960x540-1.jpg/> Demand for storage solutions has reached unprece...

06/05/2024

Spotify Uplifts Bold, Emerging Artists in Honor of Asian and Pacific Islander Heritage Month

Around the world, Asian and Pacific Islander (API) artists continue to impact mu...

06/05/2024

Never Miss a New Release With Countdown Pages for Audiobooks

Spotify is making it easier for booklovers to count down the days, hours, minutes, and seconds until a new audiobook releases. With Countdown Pages for audioboo...

06/05/2024

Get Ready to join Dan Hong as he hits the streets in his ultimate culinary journey

Get Ready to join Dan Hong as he hits the streets in his ultimate culinary journ...

06/05/2024

Lighting a Day-Interior Caf With LEE Filters

In this video, cinematographer Simon Rowling welcomes viewers behind the scenes as he lights a daytime-interior scene inside a coffee shop. Shooting on Panavisi...

06/05/2024

Technology for the Next Generation of Special Forces

L3Harris is well positioned to support the complex and multifaceted nature of special operations forces in all domains through our agile and responsive technolo...

06/05/2024

Canada Plans May 8 Public Alert System Test

OAKVILLE, Ontario As part of Emergency Preparedness Week, Alert Ready, Canadas national public alerting system, will be distributing a test alert to Canadians i...

06/05/2024

Survey: Pay TV Penetration Falls to 40% in U.S. Hispanic Homes

NEW ROCHELLE, N.Y. Horowitz Research has released a new study on the viewing and media habits of U.S. Hispanic/Latine audiences that shows a dramatic decline in...

06/05/2024

RE:Vision Effects Autograph 2024.4 released! 50% Off Through May 9th

RE:Vision Effects Autograph 2024.4 released! 50% Off Through May 9th Brie Clayton May 6, 2024 0 Comments New game-changing motion graphics & VFX featu...

06/05/2024

NBC Orders More Night Court'

NBC has renewed Night Court for a third season. The courtroom comedy was on the network from 1984 to 1992, and NBC rebooted it in early 2023....

06/05/2024

ABC Shares Summer Premiere Dates

ABC has revealed its summer schedule. The Bachelorette gets going Monday, July 8, with Jenn Tran the star in season 21. Celebrity Family Feud starts up Tuesday,...

06/05/2024

Holly Springs Salamanders Home Opener Less Than Three Weeks Away, Tickets on Sale Now

It's almost go-time for the Holly Springs Salamanders! The season opener is ...

06/05/2024

Tonight on Scandal: Deception is rife as a man places his trust in the unfaithful

Tonight on Scandal: Deception is rife as a man places his trust in the unfaithfu...

06/05/2024

Rap beef between Drake and Kendrick Lamar turns NASTY

Rap beef between Drake and Kendrick Lamar turns NASTYA long-simmering feud between rap titans Drake and Kendrick Lamar has exploded into allegations of pedophil...

06/05/2024

INRED and SES to Provide High-Throughput Connectivity Across Colombia's Amazonas

The Amazonas Digital initiative will see INRED leverage SES's MEO satellites...

06/05/2024

Tonight on Skeem Saam: Kobus busts Pretty in a compromising position

Tonight on Skeem Saam: Kobus busts Pretty in a compromising positionDon't miss Monday, 6 May's riveting episode of South African soapie Skeem Saam on SA...

06/05/2024

Tonight on House of Zwide: Bra Zakes and Sphamandla agree to leave Tembisa and never come back

Tonight on House of Zwide: Bra Zakes and Sphamandla agree to leave Tembisa and n...

06/05/2024

RT Announces Series of Major TV Debates as part of its European Parliament Election Coverage

RT has announced details of its coverage across digital, TV and Radio in the ru...

06/05/2024

The Pros and Cons of Cloud, Hybrid, or On-Premises Radio Operations

For years, radio stations have used on-premises servers to broadcast content and manage automation, traffic, and billing systems. As technology continues to adv...

06/05/2024

AI and Big Data Take the Centre Stage in Central Asia at Beetech 2024 Hosted by Beeline Kazakhstan and QazCode

06 May 2024 AI and Big Data Take the Centre Stage in Central Asia at Beetech 20...