Sony Pixel Power calrec Sony

Microsoft uses AI to transform smartphones into...

18/09/2019

MIcrosoft is presenting a research paper this week at Interspeech 2019 in Austria entitled Meeting Transcription Using Asynchronous Distant Microphones, which shows the potential to allow meeting participants to use smartphones, laptops, and tablets, which are already equipped with microphones, instead of specially designed mics.

The full details are posted on a blog on the Microsoft website.

The central idea is to use any internet-connected devices, such as the laptops and smart phones that attendees typically bring to meetings, and form an ad-hoc microphone array in the cloud. With this approach, teams would be able to choose to use the smartphones, laptops, and tablets they already have to enable high-accuracy transcription without needing special-purpose hardware.

While the idea sounds simple, it requires overcoming many technical challenges to be effective. The audio quality of devices varies significantly. The speech signals captured by different microphones are not aligned with each other. The number of devices and their relative positions are unknown. For these reasons and others, consolidating the information streams from multiple independent devices in a coherent way is much more complicated than it may seem. In fact, although the concept of ad hoc microphone arrays dates back to the beginning of this century, to our knowledge it has not been realized as a product or public prototype so far. Meanwhile, techniques for combining multiple information streams were developed in different research areas. At the same time, general advances in speech recognition, especially via the use of neural network models, have helped bring transcription accuracy closer to usable levels.

The diagram shown above depicts the resulting processing pipeline. It starts with aligning signals from different microphones, followed by blind beamforming. The term blind refers to the fact that beamforming is achieved without any knowledge about the microphones and their locations. This is achieved by using neural networks optimised to recover input features for acoustic models, as we reported previously. This beamformer generates multiple signals so that the downstream modules (speech recognition and speaker diarisation) can still leverage the acoustic diversity offered by the random microphone placement. After speech recognition and speaker diarisation, the speaker-annotated transcripts from multiple streams are consolidated by combining confusion networks that encode both word and speaker hypotheses and they are sent back to the meeting attendees. After the meeting, the attendees can choose to keep the transcripts available only to themselves or share them with specified people.

The work published at Interspeech 2019 is part of a longer focused effort, codenamed Project Denmark.
LINK: http://www.inavateonthenet.net/news/article/microsoft-uses-ai-to-trans...
See more stories from teracue

Most recent headlines

09/11/2025

Dalet Unveils Agentic AI Media Workflows at IBC2025

Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...

24/10/2025

Spectrum Reach Has Deployed More Than 15,000 AI-Powered Ad Campaigns

NEW YORK Charters Spectrum Reach has announced that its clients have used Waymark's AI-driven ad creation platform to create more than 15,000 ads since Spec...

24/10/2025

Avid Releases Pro Tools 2025.10

BURLINGTON, Mass. Avid has today announced the release of Pro Tools 2025.10, a feature-rich update that the company said offers notable advances in immersive mu...

24/10/2025

Comcast Advertising Unveils Programmatic Solution for Linear TV

NEW YORK In a major change for the ad industry, Comcast Advertising will unveil technology that enables agencies and brands to buy targetable, biddable ads on l...

24/10/2025

ATSC Expands Its Influence with Growing International Ties

WASHINGTON The ATSC broadcast standards group has outlined a growing list of international activities that the group said is expanding its influence and solidif...

23/10/2025

Unlocking Character: Sportcast on Executing the Bundesliga and Bundesliga 2 New Season Production

Unlocking character: Sportcast on executing the Bundesliga and Bundesliga 2 new ...

23/10/2025

Clear Coordination: Juggling the New Bundesliga Rights Cycle Requirements and Pushing Innovation Forward at Sportcast

Clear coordination: Juggling the new Bundesliga rights cycle requirements and pu...

23/10/2025

Analysis: Is Piracy Just the Cost of Doing Business?

Analysis: Is piracy just the cost of doing business? By Callum McCarthy, Editor-at-Large Tuesday, October 21, 2025 - 09:58 Print This Story It's high ...

23/10/2025

ESPN's Adam Whitlock on Driving Real-World Innovation Across the Video-Transmission Industry

ESPN's Adam Whitlock on Driving Real-World Innovation Across the Video-Trans...

23/10/2025

SVG TranSPORT 2025 Unites 300+ Industry Leaders in New York for Deep Dive Into Live Transmission Technology

SVG TranSPORT 2025 Unites 300+ Industry Leaders in New York for Deep Dive Into L...

23/10/2025

NBA Tip-Off: League Starts Season With Two New Broadcast Partners, In-House NBA TV/NBA App Ops

NBA Tip-Off: League Starts Season With Two New Broadcast Partners, In-House NBA ...

23/10/2025

NFL Deepens Business Partnership with EA Sports; More Madden Casts to Come?

NFL Deepens Business Partnership with EA Sports; More Madden Casts to Come?EA Sports will remain the exclusive producer and distributor of Madden NFL video game...

23/10/2025

NFL Moves Pro Bowl Games Indoors and to Super Bowl Week; Leans Into a Made-for-TV Presentation

NFL Moves Pro Bowl Games Indoors and to Super Bowl Week; Leans Into a Made-for-T...

23/10/2025

Together in Time: Alan Domnguez on the Common Themes in his Films and Sundance Institute's Support

By Alan Dominguez Recently I have been thinking about the intersection of two e...

23/10/2025

Coexistence, My Ass! Dares Peacemaking to Not Be So Serious

(L-R) Amber Fares and Noam Shuster Eliassi attend the 2025 Sundance Film Festival premiere of Coexistence, My Ass! at the Egyptian Theatre on January 26, 2025...

23/10/2025

A Force Multiplier for High-Frequency Communications: The L3Harris ARGUS-HF

The new solution is industry's first multi-channel receiver available for L3Harris's resilient tactical high-frequency data waveforms....

23/10/2025

Survey: Americans Concerned' About AI's Impact on Journalism

NEW YORK During a high-profile session at NAB Show New York, new survey data was shared that revealed significant public concern over artificial intelligence (A...

23/10/2025

Fox Weather Taps T-Mobile's SuperMobile for Extreme Weather Coverage

BELLEVUE, Wash. and NEW YORK Fox Weather has tapped T-Mobile as its preferred communications provider and said all of its reporters will be equipped with SuperM...

23/10/2025

Mike Wright Joins Lawo as VP, Sales, North America

RASTATT, Germany Broadcast and media workflow technology vendor Lawo has tapped Mike Wright as VP of sales, North America....

23/10/2025

European Broadcaster ARTE Taps Grass Valley for IP Transition

MONTREAL European cultural broadcaster ARTE has selected Grass Valley LDX 135 cameras and Creative Grading solution as part of its move from SDI/1080i to a nati...

23/10/2025

Scripps Names Daniel Parsons Chief Information Security Officer

CINCINNATI The E.W. Scripps Company has named Daniel Parsons as its new chief information security officer, effective Oct. 20....

23/10/2025

WWTV Completes IP Studio Upgrade

ALAMEDA, Calif. Northern Michigan broadcaster WWTV recently completed a major IP-based upgrade that connects its new Traverse City studio with its control room ...

23/10/2025

Verizon Fios TV, Nexstar Blackout Looms as Contract Ends on Oct. 24

A deadline is looming for a new carriage deal between Verizon's Fios TV and Nexstar, with both Verizon and the pay TV-backed American Television Alliance bl...

23/10/2025

Survey: Americans 'Concerned' About AI's Impact on Journalism

NEW YORK During a high-profile session at NAB Show New York, new survey data was shared that revealed significant public concern over artificial intelligence (A...

23/10/2025

Fox Weather Taps T-Mobile's Supermobile for Extreme-Weather Coverage

BELLEVUE, Wash. and NEW YORK Fox Weather has tapped T-Mobile has as its preferred communications provider and announced that all Fox Weather reporters are being...

23/10/2025

PBS Taps Amazon Bedrock to Improve Search on Digital Platforms

PBS will use generative AI from Amazon Web Services to provide enhanced search results to viewers on the PBS App and PBS LearningMedia platforms, the network an...

23/10/2025

Actor Jessica Barden joins Becoming Victoria Wood - U&GOLD's feature-length documentary celebrating the life of Victoria Wood

The 90-minute film is produced by Rogan Scotland, part of BAFTA-winning Rogan Pr...

23/10/2025

The Resurrected' Marks First Chinese-Language Series to Launch Netflix Profile Icons

Back to All News The Resurrected' Marks First Chinese-Language Series to L...

23/10/2025

RT publishes Register of External Activities for Q2/2025 (statistical summary)

RT is today publishing a statistical summary from the Register of External Activities for the second quarter of 2025. The RT Register of External Activities ...

23/10/2025

THE BOYS ARE BACK IN TOWN THE 2 JOHNNIES LATE NIGHT LOCK IN RETURNS FOR SERIES 3

Series three of the award winning, hit comedy entertainment series The 2 Johnnies Late Night Lock In is back on your screens, celebrating the very best of all t...

23/10/2025

Fleadh Cheoil, presented by Dith S and Muireann Nic Amhlaoibh returns to RT

Performances by Michael Flatley, Andy Irvine, Cuckoo's Nest, Foster and Allen and more Friday 24 October, 8pm on RT One and RT Player Fleadh Cheoil re...

23/10/2025

Fangs Out, Frames Up: Vampire: The Masquerade - Bloodlines 2' Leads a Killer GFN Thursday

The nights grow longer and the shadows get bolder with Vampire The Masquerade: B...

22/10/2025

ITE Singapore Officially Opens Next-Generation Hybrid Learning Space with X2O Media's OneRoom

MONTR AL - October 2, 2025 - The Institute of Technical Education (ITE) last mon...

22/10/2025

Prime Video Inks Deal To Present NFL Black Friday Game Worldwide

Prime Video Inks Deal To Present NFL Black Friday Game Worldwide By SVG Staff Wednesday, October 22, 2025 - 10:06 am Print This Story | Subscribe Story ...

22/10/2025

NBA Tip-Off: ESPN Goes 1080p HDR End-to-End, Flipping HDR Switch on REMI and REMCO Shows

NBA Tip-Off: ESPN Goes 1080p HDR End-to-End, Flipping HDR Switch on REMI and REM...

22/10/2025

FloSports Empowers Division II, III Athletic Departments With Turnkey Production Suite for Livestreaming Production

FloSports Empowers Division II, III Athletic Departments With Turnkey Production...

22/10/2025

Wall Street Video Summit Debuts, Bringing Together 200 Financial Enterprise Video Executives in NYC

Wall Street Video Summit Debuts, Bringing Together 200 Financial Enterprise Vide...

22/10/2025

Dueling Pianos: International Chopin Piano Competition Is as Competitive as a Ballgame - and Has Amazing Audio

Dueling Pianos: International Chopin Piano Competition Is as Competitive as a Ba...

22/10/2025

Celebrate the Anniversaries of Shakira's Landmark Albums With Spotify-Exclusive EP and Video Special

In 1995, a young Colombian artist released an album that would change Latin pop ...

22/10/2025

SGL Carbon expands sustainable energy supply and invests in photovoltaic system at Meitingen site

Over the past few months, a photovoltaic system has been installed on a three-he...

22/10/2025

Orion Meets SLS: L3Harris Technology Ready to go to the Moon

The Orion spacecraft for NASA's Artemis II mission is stacked on the Space Launch System (SLS) rocket in High Bay 3 of the Vehicle Assembly Building at Kenn...

22/10/2025

Hybrid SATCOM: Delivering Resilient and Agile Connectivity Today

L3Harris' Hybrid SATCOM is resilient by design, offering path diversity that eliminates vulnerabilities by routing data across the best available networks i...

22/10/2025

The 2025 NAB Show New York Opens With More Than 12,000 Attendees Expected

WASHINGTON, D.C. Organizers of NAB Show New York said they are expecting more than 12,000 registered attendees from about 100 countries along with 260 exhibitor...

22/10/2025

The 2025 NAB Show New York Set to Open with More Than 12,000 Attendees Expected

WASHINGTON, D.C The organizers of The 2025 NAB Show New York have announced that they are expecting more than 12,000 registered attendees from about 100 countr...

22/10/2025

Masque Sound and Jaffe Holden Create Transformative Perfo...

Masque Sound, a leading theatrical sound reinforcement, installation and design company, supplied an extensive gear package of professional-grade equipment for ...

22/10/2025

Lightware UCX-3x3-TPX-RX20 sets new standard for connecte...

Lightware, a global leader in signal management and AV connectivity solutions, is seeing strong market momentum for the UCX-3x3-TPX-RX20, a compact transmitter-...

22/10/2025

Chyron Releases PAINT 10.2 Telestration Platform

MELVILLE, N.Y. Chyron has released PAINT 10.2, the latest update for its telestration platform, adding support for SMPTE ST 2110 IP workflows, expanding brandin...

22/10/2025

NBCUniversal Invests in ATSC 3.0 Authority Behind Run3TV

WASHINGTON Run3TV today said NBCUniversal is joining as an investor in the ATSC 3.0 Framework Authority, which develops the Run3TV NextGen TV application platfo...

22/10/2025

swXtch.io to Feature SRT-X Gateway, groundSwXtch at NAB Show New York

ATLANTA swXtch.io will feature two new networking solutions extending the company's reach across more cloud and on-prem workflows at NAB Show New York, set ...

22/10/2025

HBO Max Increases Prices for All Tiers

The Warner Bros. Discoverys HBO Max streaming services has increased prices for all its streaming tiers effectively immediately for new customers. Existing cust...