Sony Pixel Power calrec Sony

University converts audio clips into lip-synched videos of...

17/07/2017

University of Washington researchers have developed new algorithms that can turn audio clips into a realistic, lip-synced video of the person speaking those words.

As detailed in a paper to be presented August 2 at SIGGRAPH 2017 in L.A., the team successfully generated realistic video of former president Barack Obama talking about terrorism, fatherhood, job creation and other topics using audio clips of those speeches and existing weekly video addresses that were originally on a different topic.

Ira Kemelmacher-Shlizerman, an assistant professor at the UW's Paul G. Allen School of Computer Science & Engineering said, Realistic audio-to-video conversion has practical applications like improving video conferencing for meetings, as well as futuristic ones such as being able to hold a conversation with a historical figure in virtual reality by creating visuals just from audio.

In a visual form of lip-syncing, the system converts audio files of an individual's speech into realistic mouth shapes, which are then grafted onto and blended with the head of that person from another existing video.

In the future video, chat tools like Skype or Messenger will enable anyone to collect videos that could be used to train computer models, Kemelmacher-Shlizerman said.

Because streaming audio over the internet takes up far less bandwidth than video, the new system has the potential to end video chats that are constantly timing out from poor connections.

When you watch Skype or Google Hangouts, often the connection is stuttery and low-resolution and really unpleasant, but often the audio is pretty good, said co-author and Allen School professor Steve Seitz. So if you could use the audio to produce much higher-quality video, that would be terrific.

By reversing the process feeding video into the network instead of just audio the team could also potentially develop algorithms that could detect whether a video is real or manufactured.

The new machine learning tool makes significant progress in overcoming what's known as the uncanny valley problem, which has dogged efforts to create realistic video from audio. When synthesised human likenesses appear to be almost real but still manage to somehow miss the mark people find them creepy or off-putting.

People are particularly sensitive to any areas of your mouth that don't look realistic, said lead author Supasorn Suwajanakorn, a recent doctoral graduate in the Allen School. If you don't render teeth right or the chin moves at the wrong time, people can spot it right away and it's going to look fake. So you have to render the mouth region perfectly to get beyond the uncanny valley.

A neural network first converts the sounds from an audio file into basic mouth shapes. Then the system grafts and blends those mouth shapes onto an existing target video and adjusts the timing to create a new realistic, lip-synced video.

Previously, audio-to-video conversion processes have involved filming multiple people in a studio saying the same sentences over and over to try to capture how a particular sound correlates to different mouth shapes, which is expensive, tedious and time-consuming. By contrast, Suwajanakorn developed algorithms that can learn from videos that exist in the wild on the internet or elsewhere.

There are millions of hours of video that already exist from interviews, video chats, movies, television programs and other sources. And these deep learning algorithms are very data hungry, so it's a good match to do it this way, Suwajanakorn said.

Rather than synthesising the final video directly from audio, the team tackled the problem in two steps. The first involved training a neural network to watch videos of an individual and translate different audio sounds into basic mouth shapes.

By combining previous research from the UW Graphics and Image Laboratory team with a new mouth synthesis technique, they were then able to realistically superimpose and blend those mouth shapes and textures on an existing reference video of that person. Another key insight was to allow a small time shift to enable the neural network to anticipate what the speaker is going to say next.

The new lip-syncing process enabled the researchers to create realistic videos of Obama speaking in the White House, using words he spoke on a television talk show or during an interview decades ago.

Currently, the neural network is designed to learn on one individual at a time, meaning that Obama's voice speaking words he actually uttered is the only information used to drive the synthesised video. Future steps, however, include helping the algorithms generalise across situations to recognise a person's voice and speech patterns with less data with only an hour of video to learn from, for instance, instead of 14 hours.

The research was funded by Samsung, Google, Facebook, Intel and the UW Animation Research Labs.

A neural network first converts the sounds from an audio file into basic mouth shapes. Then the system grafts and blends those mouth shapes onto an existing target video and adjusts the timing to create a new realistic, lip-synced video.
LINK: http://www.inavateonthenet.net/news/article/university-converts-audio-...
See more stories from teracue

Most recent headlines

04/08/2024

Dalet Appoints Santiago Solanas as CEO to Lead Next Era of Growth and Innovation

Dalet, a leading technology and service provider for media-rich organizations, is excited to announce Santiago Solanas as its new Chief Executive Officer (CEO)....

03/06/2024

Dalet and Veritone Reach Agreement to Distribute, Transact and Monetize Media Archives

Dalet, a leading technology and service provider for media-rich organizations, a...

28/05/2024

OBS Taps Alibaba Cloud for AI-Enhanced MultiCamera Replays at Paris 2024

LONDON Olympic Broadcasting Services recently tested AI-enhanced multcamera replay tech from Alibaba Cloud at the Olympic Qualifier Series in Shanghai in prepar...

28/05/2024

Dune Part 2 and Avatar colourists to take part in DaVinci Resolve Live Tour

The events are for filmmakers, editors, colourists, and visual effects artists, whether theyre beginners and experienced users By Jenny Priestley Published: ...

28/05/2024

Meet the head of sound

1185 Films Mark Hodgkin explains his journey from studying classic guitar and piano to working on the sound of TV adverts, films and documentaries By TVBEurope...

28/05/2024

Alfalite presents its LED displays at InfoComm 2024

Alfalite, the European LED display manufacturer, returns for the second consecutive year to InfoComm with its LED displays for the rental, fixed installation an...

28/05/2024

Leader Electronics Corporation Appoints AV Group Technolo...

Leader Electronics Corporation, globally active innovator of broadcast-quality test and measurement instrumentation, announces the appointment of Sydney-based A...

28/05/2024

Advanced 3D qualifier in DaVinci Resolve

Advanced 3D qualifier in DaVinci Resolve Kasia Jarco May 27, 2024 0 Comments In today's advanced tutorial, I want to show you how and why to use 3...

28/05/2024

Deadline Approaching for 2024 Emerging Leaders Intern Program

Applications for CBC & Leadership Triangle Due May 31 Only a few days remain for HBCU students in the Triangle to apply for the 2024 Emerging Leadership Intern...

28/05/2024

Thales' FlytEDGE digitally remasters the inflight entertainment experience

Facebook Twitter LinkedIn Live personalization for a journey filled with unique experiences Instantly stream favorites and never miss a beat, continue wa...

28/05/2024

AI-backbone for FCAS operational

AI-backbone for FCAS operational The HIS consortium and partners provide BAAINBw and industry with a cross-sectional AI development platform for FCAS (AI-back...

27/05/2024

Experience inspiration. Master challenges. With SIGRAFLEX and SIGRAFINE at ACHEMA 2024

It will soon be that time again: ACHEMA, the worlds most important trade show fo...

27/05/2024

How L3Harris Evolved into Canada's Trusted Tanker Aircraft In-Service Support Provider

L3Harris has been maintaining Canada's CC-150 Polaris fleet for over a decad...

27/05/2024

Tech Lifestyle Influencer Shelby Church Uses Blackmagic Cloud Storage with DaVinci Resolve Studio

Tech Lifestyle Influencer Shelby Church Uses Blackmagic Cloud Storage with DaVin...

27/05/2024

Bridge Technologies Introduce StreamOverview to the VB330

Bridge Technologies Introduce StreamOverview to the VB330 Brie Clayton May 27, 2024 0 Comments Single page diagnostics overview gives first-line engin...

27/05/2024

Intelligent Video Effects from Film Impact

Intelligent Video Effects from Film Impact Colin Smith May 27, 2024 0 Comments Take an incredible trip though the many unbelievable transitions from F...

27/05/2024

Midwest Regional Broadcasters Clinic Announces Agenda

The Midwest Regional Broadcasters Clinic (MRBC) announced its agenda for the clinic being held Tuesday, Sept. 10, and Wednesday, Sept. 11, in Middleton, Wis....

27/05/2024

NVIDIA Scoops Up Wins at COMPUTEX Best Choice Awards

Building on more than a dozen years of stacking wins at the COMPUTEX trade show's annual Best Choice Awards, NVIDIA was today honored with BCAs for its late...

27/05/2024

Live From NCAA Men's Lacrosse National Championship: ESPN Travels Down I-95 to Familiar Lincoln Financial Field

Live From NCAA Men's Lacrosse National Championship: ESPN Travels Down I-95 ...

27/05/2024

Rohde & Schwarz presents its solutions for next generation wide bandgap device test and debug at PCIM Europe

Rohde & Schwarz presents its solutions for next generation wide bandgap device t...

27/05/2024

Hierarchy' Trailer Teases a Dark Scandal and Social Upheaval at Jooshin High School

Back to All News Hierarchy' Trailer Teases a Dark Scandal and Social Uphea...

27/05/2024

SKY Perfect JSAT selects Thales Alenia Space to build a new cutting-edge software-defined satellite JSAT-31

Facebook Twitter LinkedIn Tokyo / Cannes, May 27th 2024 - Asia's large...

26/05/2024

Vizrt to showcase state-of-the-art proAV solutions at Inf...

Vizrt, the leader in real-time graphics and live production solutions for content creators, will be present at InfoComm for the first time since unifying with N...

26/05/2024

Alfredo Valdes Named Noticiero Telemundo Arizona' Meteorologist

Alfredo Valdes has been named meteorologist for Noticiero Telemundo Arizona weekday morning newscasts, which run on KTAZ Phoenix and KHRR Tucson. Both stations ...

26/05/2024

Paramount, Charter Reach Carriage Deal That Includes Linear Networks, TV Stations and Streaming Services

Paramount Global and Charter Communications said they reached a new carriage agr...

26/05/2024

Daytime Emmys To Again Be Hosted by ET's Kevin Frazier, Nischelle Turner

Entertainment Tonight's Kevin Frazier and Nischelle Turner are returning to host the 51st annual Daytime Emmys, CBS and the National Academy of Television A...

25/05/2024

Get to Know This Summer's Filmmakers Through These 12 Sundance Films

(L-R) Writer-director Hannah Pearl Utt and co-writer Jen Tullock star as sisters in Before You Know It, which premiered at the 2019 Sundance Film Festival....

25/05/2024

Study: Digital Media Ad Spend Grew 18% in Q1 24

NEW YORK A new study from Guideline indicates that In Q1 2024, large US advertisers expanded their overall ad spend by 7% compared to the year prior and that di...

25/05/2024

Accedo Helps ITV Expand ITVX to Sony PlayStation 4 and 5

STOCKHOLM Global video solutions provider, Accedo has announced that it worked with ITV in the U.S. to expand the reach of the broadcasters streaming service, I...

25/05/2024

Broadband Forum Celebrates 20th Anniversary of TR-069 Standard

Broadband Forum has announced that it is celebrating the 20-year anniversary of its groundbreaking TR-069 standard that has paved the way for the open standards...

25/05/2024

TV Tech Weekly Tech Wrap-Up

Missed any of our coverage of new products, services and deployments during your busy week? The TV Tech weekly wrap-up provides links to all of our product cove...

25/05/2024

HBO Original Series 30 Coins Season Two Finished with DaVinci Resolve Studio

HBO Original Series 30 Coins Season Two Finished with DaVinci Resolve Studio Brie Clayton May 24, 2024 0 Comments Blackmagic Design announced today th...

25/05/2024

VideoProc Converter AI: Your Answer to Video Format Challenges and Quality Enhancement

VideoProc Converter AI: Your Answer to Video Format Challenges and Quality Enhan...

24/05/2024

The Hives Celebrate 50 Years of Sweden's Global Music Success With Spotify Singles Cover

On April 6, 1974, the Swedish pop quartet ABBA won the Eurovision Song Contest w...

24/05/2024

The U.K. Holds Firm in the Fight for Fair Competition With the DMCC Act, But It's Not Over Yet

For more than a year, the U.K. government has been working to redefine how the i...

24/05/2024

Alone Australia continues to build as it moves towards finale

Alone Australia continues to build as it moves towards finale 23 May, 2024 Media releases The program continues to deliver for SBS with significant uplifts...

24/05/2024

EditShare Introduces Expanded Product Line-Up at BroadcastAsia

EditShare Introduces Expanded Product Line-Up at BroadcastAsia Transforming innovations in workflow, server and delivery from storyboard to screen Boston, MA...

24/05/2024

ZEISS CinCraft Scenario Camera Tracking Now Compatible wi...

Scenario 2.0 introduces pre-calibrated lens templates and the Lens Template Finetuner, increasing flexibility and compability while also saving a great amount o...

24/05/2024

Chyron Unlocks a Complete Newsroom in the Cloud With News...

Based on a long-term, coordinated development effort, Chyron today announced sweeping improvements across its news workflow portfolio that empower broadcasters ...

24/05/2024

Cobalt Expands its Reach into the AV Market with Plans to...

Cobalt Digital, known for its vast array of signal processing products, is strengthening its position in the Pro AV market by exhibiting at InfoComm 2024 for th...

24/05/2024

IHSE USA Earns Coveted Awards at the 2024 NAB Show

HSE USA today announced that the company s JPEG-XS IP Core for KVM and kvm-tec Scalable Pro Line 5K were honored with three awards at this year's NAB Show i...

24/05/2024

Aputure Gears Up for the 2024 Cine Gear Expo

Aputure, creators of LED lighting for filmmakers, is excited to showcase its award-winning lineup of professional lighting solutions at the upcoming Cine Gear E...

24/05/2024

EVERTZAV JOINS GPA GLOBAL PARTNER PROGRAM

EvertzAV (https://av.evertz.com), a division of Evertz, the global leader in providing professional A/V over IP solutions, is proud to announce its partnership ...

24/05/2024

Metropolis Studios Upgrades To Prism Sound Dream ADA-128...

With 25 Prism Sound ADA-8XR multichannel converters already in use across its five studios, the internationally acclaimed Metropolis Studios in London is no str...

24/05/2024

Leader Expands LVB440 IP Analyzer with New Measurement To...

Leader Electronics Corporation, globally active innovator of broadcast-quality test and measurement instrumentation, announces an expansion to the capabilities ...

24/05/2024

Digital Alert Systems and Inovonics Partner on Joint Solu...

Digital Alert Systems, the global leader in emergency communications solutions for video services providers, and broadcast equipment provider Inovonics today an...

24/05/2024

Time in Pixels Updates OmniScope with New Twin Peaks Scope for Professional Colorists and DITs

Time in Pixels Updates OmniScope with New Twin Peaks Scope for Professional Co...

24/05/2024

ZEISS CinCraft Scenario Camera Tracking Now Compatible with Most Popular Lens Brands

ZEISS CinCraft Scenario Camera Tracking Now Compatible with Most Popular Lens Br...

24/05/2024

Inaugural Gnome Opener in Greenville TONIGHT

Tonight's the night! The Greenville Yard Gnomes open their inaugural home season at Guy Smith Stadium! Join the Gnomes for all the fun and excitement of t...

24/05/2024

Boston Conservatory Plays a Leading Role at the 41st Annual Elliot Norton Awards

Boston Conservatory Plays a Leading Role at the 41st Annual Elliot Norton Awards With six wins and eight additional nominations, the Conservatory's theate...