Sony Pixel Power calrec Sony

More Than Meets the AI: How GANs Research Is Reshaping Video Conferencing

24/06/2021

Roll out of bed, fire up the laptop, turn on the webcam - and look picture-perfect in every video call, with the help of AI developed by NVIDIA researchers.

Vid2Vid Cameo, one of the deep learning models behind the NVIDIA Maxine software development kit for video conferencing, uses generative adversarial networks (known as GANs) to synthesize realistic talking-head videos using a single 2D image of a person.

To use it, participants submit a reference image - which could be either a real photo of themselves or a cartoon avatar - before joining a video call. During the meeting, the AI model will capture each individual's real-time motion and apply it to the previously uploaded still image.

That means that by uploading a photo of themselves in formal attire, meeting attendees with mussed hair and pajamas can appear on a call in work-appropriate attire, with AI mapping the user's facial movements to the reference photo. If the subject is turned to the left, the technology can adjust the viewpoint so the attendee appears to be directly facing the webcam.

Besides helping meeting attendees look their best, this AI technique also shrinks the bandwidth needed for video conferencing by up to 10x, avoiding jitter and lag. It'll soon be available in the NVIDIA Video Codec SDK as the AI Face Codec.

Many people have limited internet bandwidth, but still want to have a smooth video call with friends and family, said NVIDIA researcher Ming-Yu Liu, co-author on the project. In addition to helping them, the underlying technology could also be used to assist the work of animators, photo editors and game developers.

Vid2Vid Cameo was presented this week at the prestigious Conference on Computer Vision and Pattern Recognition - one of 28 NVIDIA papers at the virtual event. It's also available on the AI Playground, where anyone can experience our research demos firsthand.

AI Steals the Show In a nod to classic heist movies (and a hit Netflix show), NVIDIA researchers put their talking-head GAN model through its paces for a virtual meeting. The demo highlights key features of Vid2Vid Cameo, including facial redirection, animated avatars and data compression.

These capabilities are coming soon to the NVIDIA Maxine SDK, which gives developers optimized pretrained models for video, audio and augmented reality effects in video conferencing and live streaming.

Developers can already adopt Maxine AI effects including intelligent noise removal, video upscaling and body pose estimation. The free-to-download SDK can also be paired with the NVIDIA Jarvis platform for conversational AI applications, including transcription and translation.

Hello from the AI Side Vid2Vid Cameo requires just two elements to create a realistic AI talking head for video conferencing: a single shot of the person's appearance and a video stream that dictates how that image should be animated.

Developed on NVIDIA DGX systems, the model was trained using a dataset of 180,000 high-quality talking head videos. The network learned to identify 20 key points that can be used to model facial motion without human annotations. The points encode the location of features including the eyes, mouth and nose.

It then extracts these key points from a reference image of the caller, which could be sent to other video conference participants ahead of time or re-used from previous meetings. This way, instead of sending bulky live video streams from one participant to the other, video conferencing platforms can simply send data on how the speaker's key facial points are moving.

On the receiver's side, the GAN model uses this information to synthesize a video that mimics the appearance of the reference image.

By compressing and sending just the head position and key points back and forth, instead of full video streams, this technique can reduce bandwidth needs for video conferences by 10x, providing a smoother user experience. The model can be adjusted to transmit a differing number of key points to adapt to different bandwidth environments without compromising visual quality.

The viewpoint of the resulting talking head video can also be freely adjusted to show the user from a side profile or straight on, as well as from lower or higher camera angles. This feature could also be applied by photo editors working with still images.

NVIDIA researchers found that Vid2Vid Cameo outperforms state-of-the-art models by producing more realistic and sharper results - whether the reference image and the video are from the same person, or when the AI is tasked with transferring movement from one person onto a reference image of another.

The latter feature can be used to apply the facial motions of a speaker to animate a digital avatar in a video conference, or even lend realistic expression and movement to a video game or cartoon character.

The paper behind Vid2Vid Cameo was authored by NVIDIA researchers Ting-Chun Wang, Arun Mallya and Ming-Yu Liu. The NVIDIA Research team consists of more than 200 scientists around the globe, focusing on areas such as AI, computer vision, self-driving cars, robotics and graphics.

Our thanks to actor Edan Moses, who performed the English voiceover of The Professor on La Casa De Papel/Money Heist on Netflix, for his contribution to the video above featuring our latest AI research.
LINK: https://blogs.nvidia.com/blog/2021/06/24/vid2vid-cameo-ai-research-vid...
See more stories from nvidia

Most recent headlines

04/08/2024

Dalet Appoints Santiago Solanas as CEO to Lead Next Era of Growth and Innovation

Dalet, a leading technology and service provider for media-rich organizations, is excited to announce Santiago Solanas as its new Chief Executive Officer (CEO)....

03/06/2024

Dalet and Veritone Reach Agreement to Distribute, Transact and Monetize Media Archives

Dalet, a leading technology and service provider for media-rich organizations, a...

02/05/2024

Reality Slips Away in the Eerie I Saw the TV Glow

PARK CITY, UTAH - JANUARY 18: Jane Schoenbrun introduces the 2024 Sundance Film Festival I Saw the TV Glow premiere at the Library Center Theatre in Park City...

02/05/2024

Meet the 10 Hampton University Students Receiving Spotify NextGen's Scholarship

Spotify is committed to amplifying the voices of underrepresented groups, and th...

02/05/2024

Advertisers and Creators Come Together at Our First-Ever Spotify Sparks in London

With more than 600 million users around the world tuned in to Spotify, there'...

02/05/2024

Gear up for the 2024 Cycling Grand Tours on Australia's Home of Cycling, SBS

Gear up for the 2024 Cycling Grand Tours on Australia's Home of Cycling, SBS Media releases All of the rivalry begins with the Giro d'Italia this we...

02/05/2024

New SBS documentary dives into one of Australia's greatest underdog stories of all time!

New SBS documentary dives into one of Australia's greatest underdog stories ...

02/05/2024

Alone Australia cements its place as one of Australia's biggest hits of 2024

Alone Australia cements its place as one of Australia's biggest hits of 2024 2 May, 2024 Media releases Alone Australia has cemented its place as one o...

02/05/2024

The Making of All of Us Strangers

I always like to base my lighting choices in reality, says cinematographer Jamie D. Ramsay, SASC. For his recent collaboration with writer-director Andrew Haig...

02/05/2024

Hiltron Introduces Field-Upgradable Motorisation Kit for CPI 2385 Satcom Antenna

Accompanying image shows Hiltron's new HMAM-based field-upgradable motorisation kit for the CPI 2385 satellite communications antenna. Backnang, Germany,...

02/05/2024

Millennium Space Systems Selects L3Harris to Build Space Development Agency Electro-Optical Infrared Payloads

MELBOURNE, Fla., May 2, 2024 - L3Harris Technologies (NYSE:LHX) has received a c...

02/05/2024

Samba TV Launches New Generative AI Ad Solution

NEW YORK Samba TV is debuting new capabilities for Samba AI, the company's suite of generative AI technologies. Samba AI's new capabilities provide a re...

02/05/2024

Television Academy Foundation Names Anne Vasquez Executive Director

LOS ANGELES The Television Academy Foundation has announced the appointment of Anne Vasquez as its executive director effective May 13....

02/05/2024

SAG-AFTRA to License Nielsen Streaming Data

NEW YORK SAG-AFTRA has inked a deal with Nielsen to become its third-party provider of streaming content measurement and has announced that it will use the Niel...

02/05/2024

Cineverse Unveils Public Beta of AI-Powered cineSearch

LOS ANGELES Cineverse has debuted cineSearch, its previously announced new content search and discovery service in public Beta....

02/05/2024

Can AI find a way to reduce the broadcast industry's energy consumption?

OpenDrives Trevor Morgan asks if AI has the answer to making data centres for the media and entertainment industry more sustainable By Contributor Published:...

02/05/2024

Bill Baggelaar Asks the Questions: An Unfiltered Look at Media Supply Chain Transformation

Bill Baggelaar Asks the Questions: An Unfiltered Look at Media Supply Chain Tran...

02/05/2024

Two Boston Conservatory at Berklee Alums Nominated for Tony Awards

Two Boston Conservatory at Berklee Alums Nominated for Tony Awards An additional eight alums and two current students performed in nominated productions. By...

02/05/2024

Viewers Call Finding New TV Content Frustrating' in Comcast Advertising Report

A majority of viewers 51% said the difficulty in finding new content on TV c...

02/05/2024

Programming Legend Art Moore Retiring After 53 Years With ABC Stations

Art Moore, who headed production of long-running syndicated series, including Live, said he plans to retire in September as VP of programming for WABC New York....

02/05/2024

Nexstar Will Move The CW Affiliation to WGN Chicago

Nexstar Media Group, which owns The CW, said the network's affiliation will be moving to Nexstar-owned WGN Chicago....

02/05/2024

Future Today Puts First Original Shows on Fawesome Channel (NewFronts)

Future Today said that it plans to launch the first original shows on its Fawesome streaming channel....

02/05/2024

Behind the Music' Returns on Paramount Plus

Behind the Music is back on Paramount Plus with new episodes May 1. Those profiled in the season two episodes are Bell Biv DeVoe, Trace Adkins and Wolfgang Van ...

02/05/2024

All American: Homecoming' Returns on The CW July 8

All American: Homecoming starts season three on The CW Monday, July 8, while season two of 61st Street kicks off Monday, July 22....

02/05/2024

Jerry Seinfeld's Pop-Tart Movie Starts on Netflix May 3

Unfrosted, a movie about the race to create a game-changing breakfast pastry, such as, say, the Pop-Tart, debuts on Netflix May 3. Jerry Seinfeld directs, his f...

02/05/2024

Former BT Sport COO Jamie Hindhaugh joins EMG / Gravity Media

Charlie Cubbon has also been appointed chief operating officer By Matthew Corrigan Published: May 2, 2024 Charlie Cubbon has also been appointed chief ope...

02/05/2024

Durable Goods Signs Lionel Coleman

Durable Goods Signs Lionel Coleman Brie Clayton May 2, 2024 0 Comments Durable Goods has signed multi-hyphenate director Lionel Coleman for commercial...

02/05/2024

Hitsujibungaku's Music Video GO!!! Shot by Kyotaro Hayashi with Blackmagic Cinema Camera 6K

Hitsujibungaku's Music Video GO!!! Shot by Kyotaro Hayashi with Blackmagic C...

02/05/2024

Diamond Sports RSNs Go Dark On Comcast Systems

Bally Sports Regional Networks were taken off Comcast's systems on April 30 when their existing distribution agreement expired and Diamond Sports Group was ...

02/05/2024

Diamond Sports Group, DirecTV Renew Distribution Deal

SOUTHPORT, Conn. and EL SEGUNDO, Calif. Diamond Sports Group ( Diamond or the Company ) and DirecTV have announced that they have reached a multi-year renewal...

02/05/2024

Three Nexstar Stations to Become CW Affiliates

IRVING, Texas Nexstar Media Group has announced that its owned and operated television stations in Chicago, Illinois (DMA #3), Norfolk, Virginia (DMA #43), and ...

02/05/2024

FCC, FTC Ink Agreement to Cooperate on Net Neutrality Enforcement

WASHINGTON, D.C. The Federal Communications Commission and Federal Trade Commission have signed a Memorandum of Understanding to coordinate consumer protection ...

02/05/2024

Haivision Celebrates 20th Anniversary

MONTREAL Haivision Systems Inc. is marking its 20th anniversary by detailing some of the accomplishments and developments that have helped the company become a ...

02/05/2024

Agora Introduces Adaptive Video Optimization Technology

SANTA CLARA, Calif. Agora today unveiled its Adaptive Video Optimization (AVO) technology that uses machine learning to adjust parameters dynamically at every s...

02/05/2024

Study: Streaming Market Is Saturated But Subscriptions Continue to Grow

NEW YORK Kantar has released a new study showing the U.S. streaming market has hit a saturation point, with the household penetration rate stagnating and at nea...

02/05/2024

Samba TV To Spotlight New Capabilities For Generative AI Ad Solution

NEW YORK Samba TV is debuting new capabilities for Samba AI, the company's suite of generative AI technologies, at the 2024 IAB (Interactive Advertising Bur...

02/05/2024

GSTV Pumps Up Research, Programming at NewFront

GSTV, the network that programs screens at gas stations, will be talking about new research and new programming at its NewFront presentation Wednesday....

02/05/2024

EMG / Gravity Media Taps Jamie Hindhaugh To Head Up UK, US, Australia, and Middle East; Names Charlie Cubbon as COO

EMG / Gravity Media appoints Charlie Cubbon COO and Jamie Hindhaugh regional CEO...

02/05/2024

Watch SVG NEXT Conversations, Ep. 2: How XR and Other Emerging Technologies Are Transforming the Landscape of M&E'

Watch SVG NEXT Conversations, Ep. 2: How XR and Other Emerging Technologies Are...

02/05/2024

SVG Sit-Down: Program Productions' Bob Carzoli, Integrum's Kathy Reiland on the Strategy Behind the New Alliance

SVG Sit-Down: Program Productions' Bob Carzoli, Integrum's Kathy Reiland...

02/05/2024

YES Network App Logs Record Usership With New Expanded Pick-N-Play Live Interactivity, YES Rewards

YES Network App Logs Record Usership With New Expanded Pick-N-Play Live Interact...

02/05/2024

Rohde & Schwarz presents its test solutions at CCW 2024 that enable a successful migration to mission-critical broadband

Rohde & Schwarz presents its test solutions at CCW 2024 that enable a successful...

02/05/2024

Baselight training. Paris. Les Lapins Bleus. 27-31 May 2024

Baselight accredited training partner, Les Lapins Bleus, is conducting a five day Baselight training course in Paris in May. Location: Paris Dates: 27-31 May ...

02/05/2024

Skeem Saam: Wednesday's episode, 1 May 2024 [video]

Skeem Saam: Wednesday's episode, 1 May 2024 [video]Missed an episode of Skeem Saam? No problem! Watch the latest episode of your favourite South African soa...

02/05/2024

Prison Journalism: It's alright not to have

Prison Journalism: It's alright not to haveWesley Leong was incarcerated at the age of 15 in 1996 at Pollsmoor Prison. He is currently part of Restore's...

02/05/2024

Tonight on Smoke and Mirrors: Thandiswa's quest to reclaim Caesar's house intensifies

Tonight on Smoke and Mirrors: Lulu navigates the complexities of her relationshi...

02/05/2024

GeForce NOW Delivers 24 A-May-zing Games This Month

GeForce NOW brings 24 new games for members this month. Ninja Theory's highly anticipated Senua's Saga: Hellblade II will be coming to the cloud soon -...

02/05/2024

NVIDIA AI Microservices for Drug Discovery, Digital Health Now Integrated With AWS

Harnessing optimized AI models for healthcare is easier than ever as NVIDIA NIM,...

02/05/2024

ARRI announces the ALEXA 35 Live - Multicam System

ARRI announces the ALEXA 35 Live - Multicam System posted: 02/05/2024 Arri Alexa 35 Live - Multicam System ARRI announces the ALEXA 35 Live - Multicam Sy...

02/05/2024

The Women': Author shines light on forgotten women in war

The Women': Author shines light on forgotten women in warThe bestselling author of The Nightingale', Kristin Hannah, has outdone herself with her new h...