
MIcrosoft is presenting a research paper this week at Interspeech 2019 in Austria entitled Meeting Transcription Using Asynchronous Distant Microphones, which shows the potential to allow meeting participants to use smartphones, laptops, and tablets, which are already equipped with microphones, instead of specially designed mics.
The full details are posted on a blog on the Microsoft website.
The central idea is to use any internet-connected devices, such as the laptops and smart phones that attendees typically bring to meetings, and form an ad-hoc microphone array in the cloud. With this approach, teams would be able to choose to use the smartphones, laptops, and tablets they already have to enable high-accuracy transcription without needing special-purpose hardware.
While the idea sounds simple, it requires overcoming many technical challenges to be effective. The audio quality of devices varies significantly. The speech signals captured by different microphones are not aligned with each other. The number of devices and their relative positions are unknown. For these reasons and others, consolidating the information streams from multiple independent devices in a coherent way is much more complicated than it may seem. In fact, although the concept of ad hoc microphone arrays dates back to the beginning of this century, to our knowledge it has not been realized as a product or public prototype so far. Meanwhile, techniques for combining multiple information streams were developed in different research areas. At the same time, general advances in speech recognition, especially via the use of neural network models, have helped bring transcription accuracy closer to usable levels.
The diagram shown above depicts the resulting processing pipeline. It starts with aligning signals from different microphones, followed by blind beamforming. The term blind refers to the fact that beamforming is achieved without any knowledge about the microphones and their locations. This is achieved by using neural networks optimised to recover input features for acoustic models, as we reported previously. This beamformer generates multiple signals so that the downstream modules (speech recognition and speaker diarisation) can still leverage the acoustic diversity offered by the random microphone placement. After speech recognition and speaker diarisation, the speaker-annotated transcripts from multiple streams are consolidated by combining confusion networks that encode both word and speaker hypotheses and they are sent back to the meeting attendees. After the meeting, the attendees can choose to keep the transcripts available only to themselves or share them with specified people.
The work published at Interspeech 2019 is part of a longer focused effort, codenamed Project Denmark.
Most recent headlines
09/11/2025
Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...
24/10/2025
NEW YORK Charters Spectrum Reach has announced that its clients have used Waymark's AI-driven ad creation platform to create more than 15,000 ads since Spec...
24/10/2025
BURLINGTON, Mass. Avid has today announced the release of Pro Tools 2025.10, a feature-rich update that the company said offers notable advances in immersive mu...
24/10/2025
NEW YORK In a major change for the ad industry, Comcast Advertising will unveil technology that enables agencies and brands to buy targetable, biddable ads on l...
24/10/2025
WASHINGTON The ATSC broadcast standards group has outlined a growing list of international activities that the group said is expanding its influence and solidif...
23/10/2025
Unlocking character: Sportcast on executing the Bundesliga and Bundesliga 2 new ...
23/10/2025
Clear coordination: Juggling the new Bundesliga rights cycle requirements and pu...
23/10/2025
Analysis: Is piracy just the cost of doing business? By Callum McCarthy, Editor-at-Large
Tuesday, October 21, 2025 - 09:58
Print This Story
It's high ...
23/10/2025
ESPN's Adam Whitlock on Driving Real-World Innovation Across the Video-Trans...
23/10/2025
SVG TranSPORT 2025 Unites 300+ Industry Leaders in New York for Deep Dive Into L...
23/10/2025
NBA Tip-Off: League Starts Season With Two New Broadcast Partners, In-House NBA ...
23/10/2025
NFL Deepens Business Partnership with EA Sports; More Madden Casts to Come?EA Sports will remain the exclusive producer and distributor of Madden NFL video game...
23/10/2025
NFL Moves Pro Bowl Games Indoors and to Super Bowl Week; Leans Into a Made-for-T...
23/10/2025
By Alan Dominguez
Recently I have been thinking about the intersection of two e...
23/10/2025
(L-R) Amber Fares and Noam Shuster Eliassi attend the 2025 Sundance Film Festival premiere of Coexistence, My Ass! at the Egyptian Theatre on January 26, 2025...
23/10/2025
The new solution is industry's first multi-channel receiver available for L3Harris's resilient tactical high-frequency data waveforms....
23/10/2025
NEW YORK During a high-profile session at NAB Show New York, new survey data was shared that revealed significant public concern over artificial intelligence (A...
23/10/2025
BELLEVUE, Wash. and NEW YORK Fox Weather has tapped T-Mobile as its preferred communications provider and said all of its reporters will be equipped with SuperM...
23/10/2025
RASTATT, Germany Broadcast and media workflow technology vendor Lawo has tapped Mike Wright as VP of sales, North America....
23/10/2025
MONTREAL European cultural broadcaster ARTE has selected Grass Valley LDX 135 cameras and Creative Grading solution as part of its move from SDI/1080i to a nati...
23/10/2025
CINCINNATI The E.W. Scripps Company has named Daniel Parsons as its new chief information security officer, effective Oct. 20....
23/10/2025
ALAMEDA, Calif. Northern Michigan broadcaster WWTV recently completed a major IP-based upgrade that connects its new Traverse City studio with its control room ...
23/10/2025
A deadline is looming for a new carriage deal between Verizon's Fios TV and Nexstar, with both Verizon and the pay TV-backed American Television Alliance bl...
23/10/2025
NEW YORK During a high-profile session at NAB Show New York, new survey data was shared that revealed significant public concern over artificial intelligence (A...
23/10/2025
BELLEVUE, Wash. and NEW YORK Fox Weather has tapped T-Mobile has as its preferred communications provider and announced that all Fox Weather reporters are being...
23/10/2025
PBS will use generative AI from Amazon Web Services to provide enhanced search results to viewers on the PBS App and PBS LearningMedia platforms, the network an...
23/10/2025
The 90-minute film is produced by Rogan Scotland, part of BAFTA-winning Rogan Pr...
23/10/2025
Back to All News
The Resurrected' Marks First Chinese-Language Series to L...
23/10/2025
RT is today publishing a statistical summary from the Register of External Activities for the second quarter of 2025.
The RT Register of External Activities ...
23/10/2025
Series three of the award winning, hit comedy entertainment series The 2 Johnnies Late Night Lock In is back on your screens, celebrating the very best of all t...
23/10/2025
Performances by Michael Flatley, Andy Irvine, Cuckoo's Nest, Foster and Allen and more
Friday 24 October, 8pm on RT One and RT Player
Fleadh Cheoil re...
23/10/2025
The nights grow longer and the shadows get bolder with Vampire The Masquerade: B...
22/10/2025
MONTR AL - October 2, 2025 - The Institute of Technical Education (ITE) last mon...
22/10/2025
Prime Video Inks Deal To Present NFL Black Friday Game Worldwide By SVG Staff
Wednesday, October 22, 2025 - 10:06 am
Print This Story | Subscribe
Story ...
22/10/2025
NBA Tip-Off: ESPN Goes 1080p HDR End-to-End, Flipping HDR Switch on REMI and REM...
22/10/2025
FloSports Empowers Division II, III Athletic Departments With Turnkey Production...
22/10/2025
Wall Street Video Summit Debuts, Bringing Together 200 Financial Enterprise Vide...
22/10/2025
Dueling Pianos: International Chopin Piano Competition Is as Competitive as a Ba...
22/10/2025
In 1995, a young Colombian artist released an album that would change Latin pop ...
22/10/2025
Over the past few months, a photovoltaic system has been installed on a three-he...
22/10/2025
The Orion spacecraft for NASA's Artemis II mission is stacked on the Space Launch System (SLS) rocket in High Bay 3 of the Vehicle Assembly Building at Kenn...
22/10/2025
L3Harris' Hybrid SATCOM is resilient by design, offering path diversity that eliminates vulnerabilities by routing data across the best available networks i...
22/10/2025
WASHINGTON, D.C. Organizers of NAB Show New York said they are expecting more than 12,000 registered attendees from about 100 countries along with 260 exhibitor...
22/10/2025
WASHINGTON, D.C The organizers of The 2025 NAB Show New York have announced that they are expecting more than 12,000 registered attendees from about 100 countr...
22/10/2025
Masque Sound, a leading theatrical sound reinforcement, installation and design company, supplied an extensive gear package of professional-grade equipment for ...
22/10/2025
Lightware, a global leader in signal management and AV connectivity solutions, is seeing strong market momentum for the UCX-3x3-TPX-RX20, a compact transmitter-...
22/10/2025
MELVILLE, N.Y. Chyron has released PAINT 10.2, the latest update for its telestration platform, adding support for SMPTE ST 2110 IP workflows, expanding brandin...
22/10/2025
WASHINGTON Run3TV today said NBCUniversal is joining as an investor in the ATSC 3.0 Framework Authority, which develops the Run3TV NextGen TV application platfo...
22/10/2025
ATLANTA swXtch.io will feature two new networking solutions extending the company's reach across more cloud and on-prem workflows at NAB Show New York, set ...
22/10/2025
The Warner Bros. Discoverys HBO Max streaming services has increased prices for all its streaming tiers effectively immediately for new customers. Existing cust...