Sony Pixel Power calrec Sony

More Than Meets the AI: How GANs Research Is Reshaping Video Conferencing

24/06/2021

Roll out of bed, fire up the laptop, turn on the webcam - and look picture-perfect in every video call, with the help of AI developed by NVIDIA researchers.

Vid2Vid Cameo, one of the deep learning models behind the NVIDIA Maxine software development kit for video conferencing, uses generative adversarial networks (known as GANs) to synthesize realistic talking-head videos using a single 2D image of a person.

To use it, participants submit a reference image - which could be either a real photo of themselves or a cartoon avatar - before joining a video call. During the meeting, the AI model will capture each individual's real-time motion and apply it to the previously uploaded still image.

That means that by uploading a photo of themselves in formal attire, meeting attendees with mussed hair and pajamas can appear on a call in work-appropriate attire, with AI mapping the user's facial movements to the reference photo. If the subject is turned to the left, the technology can adjust the viewpoint so the attendee appears to be directly facing the webcam.

Besides helping meeting attendees look their best, this AI technique also shrinks the bandwidth needed for video conferencing by up to 10x, avoiding jitter and lag. It'll soon be available in the NVIDIA Video Codec SDK as the AI Face Codec.

Many people have limited internet bandwidth, but still want to have a smooth video call with friends and family, said NVIDIA researcher Ming-Yu Liu, co-author on the project. In addition to helping them, the underlying technology could also be used to assist the work of animators, photo editors and game developers.

Vid2Vid Cameo was presented this week at the prestigious Conference on Computer Vision and Pattern Recognition - one of 28 NVIDIA papers at the virtual event. It's also available on the AI Playground, where anyone can experience our research demos firsthand.

AI Steals the Show In a nod to classic heist movies (and a hit Netflix show), NVIDIA researchers put their talking-head GAN model through its paces for a virtual meeting. The demo highlights key features of Vid2Vid Cameo, including facial redirection, animated avatars and data compression.

These capabilities are coming soon to the NVIDIA Maxine SDK, which gives developers optimized pretrained models for video, audio and augmented reality effects in video conferencing and live streaming.

Developers can already adopt Maxine AI effects including intelligent noise removal, video upscaling and body pose estimation. The free-to-download SDK can also be paired with the NVIDIA Jarvis platform for conversational AI applications, including transcription and translation.

Hello from the AI Side Vid2Vid Cameo requires just two elements to create a realistic AI talking head for video conferencing: a single shot of the person's appearance and a video stream that dictates how that image should be animated.

Developed on NVIDIA DGX systems, the model was trained using a dataset of 180,000 high-quality talking head videos. The network learned to identify 20 key points that can be used to model facial motion without human annotations. The points encode the location of features including the eyes, mouth and nose.

It then extracts these key points from a reference image of the caller, which could be sent to other video conference participants ahead of time or re-used from previous meetings. This way, instead of sending bulky live video streams from one participant to the other, video conferencing platforms can simply send data on how the speaker's key facial points are moving.

On the receiver's side, the GAN model uses this information to synthesize a video that mimics the appearance of the reference image.

By compressing and sending just the head position and key points back and forth, instead of full video streams, this technique can reduce bandwidth needs for video conferences by 10x, providing a smoother user experience. The model can be adjusted to transmit a differing number of key points to adapt to different bandwidth environments without compromising visual quality.

The viewpoint of the resulting talking head video can also be freely adjusted to show the user from a side profile or straight on, as well as from lower or higher camera angles. This feature could also be applied by photo editors working with still images.

NVIDIA researchers found that Vid2Vid Cameo outperforms state-of-the-art models by producing more realistic and sharper results - whether the reference image and the video are from the same person, or when the AI is tasked with transferring movement from one person onto a reference image of another.

The latter feature can be used to apply the facial motions of a speaker to animate a digital avatar in a video conference, or even lend realistic expression and movement to a video game or cartoon character.

The paper behind Vid2Vid Cameo was authored by NVIDIA researchers Ting-Chun Wang, Arun Mallya and Ming-Yu Liu. The NVIDIA Research team consists of more than 200 scientists around the globe, focusing on areas such as AI, computer vision, self-driving cars, robotics and graphics.

Our thanks to actor Edan Moses, who performed the English voiceover of The Professor on La Casa De Papel/Money Heist on Netflix, for his contribution to the video above featuring our latest AI research.
LINK: https://blogs.nvidia.com/blog/2021/06/24/vid2vid-cameo-ai-research-vid...
See more stories from nvidia

Most recent headlines

23/12/2025

Nielsen, Roku Expand Measurement Partnership

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

23/12/2025

PwC: Streaming Market Shifting to 'Scale and Sustainability'

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

23/12/2025

Inside the Gray Innovation Lab

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

23/12/2025

ESPN Renews Deal for Heisman Trophy Coverage

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

23/12/2025

Gray Media to Acquire WBBJ from Bahakel Communications

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

23/12/2025

Taking the Stage at Carnegie Hall-On a Global Scale

Taking the Stage at Carnegie Hall-On a Global Scale Boston Conservatory Orchestra students reflect on their epic concert marking the 80th session of the UN Gene...

22/12/2025

SVG New Sponsor Spotlight: Presidio's Neerav Shah on the Role of Its Captivate and Resonate Platforms in Sports Production

SVG New Sponsor Spotlight: Presidio's Neerav Shah on the Role of Its Captiva...

22/12/2025

Hitting the Bullseye: Sky Sports Readies Itself for the Biggest PDC World Darts Championship to Hit Ally Pally Yet

Hitting the bullseye: Sky Sports readies itself for the biggest PDC World Darts ...

22/12/2025

Unique Skillset: Bringing New Directors to the World of Darts at The Worlds with Sky Sports

Unique skillset: Bringing new directors to the world of darts at The Worlds with...

22/12/2025

Gravity Media Prepares for a Flight of Fancy With the PDC World Darts Championship 2025 for Sky Sports

Gravity Media prepares for a flight of fancy with the PDC World Darts Championsh...

22/12/2025

One Hundred and Eighty: Gravity Media on Hitting the Production Bullseye at the World Darts Championship 2025

One hundred and eighty: Gravity Media on hitting the production bullseye at the ...

22/12/2025

The Famous Group's Jon Slusser on Fascinating Fans Through Immersive Content Experiences

The Famous Group's Jon Slusser on Fascinating Fans Through Immersive Content...

22/12/2025

ESPN's Meg Aronowitz on Continuing High-Quality Broadcasts of Collegiate Sports, Expanding Growth of Internal Production Team

ESPN's Meg Aronowitz on Continuing High-Quality Broadcasts of Collegiate Spo...

22/12/2025

ESPN Takes Data-Driven Storytelling to New Heights with MNF Playbook with Next Gen Stats' NFL Altcasts

ESPN Takes Data-Driven Storytelling to New Heights with MNF Playbook with Next ...

22/12/2025

A Decade of Giving: Fest & Flauschig' Christmas Circus Celebrates Record Turnout and Generosity

For a decade, popular German podcast Fest & Flauschig has hosted an annual Chris...

22/12/2025

Paramount and Netflix Boast Double-Digit Gains in Nielsen's November Media Distributor Gauge

Paramount Scores Largest Share Increase Among Distributors as Paramount and CBS...

22/12/2025

Nielsen and Roku Expand Strategic Measurement Partnership

New multi-year deal integrates Roku's data to fuel Nielsen's measurement suite Roku gains access to Nielsen's streaming ratings, showing The Roku C...

22/12/2025

Allen Media Group to Deploy Infillion TrueX for Streaming Services

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

22/12/2025

Berklee Wrapped 2025: Our Top News and Stories

Berklee Wrapped 2025: Our Top News and Stories A look back at a year highlighted by faculty milestones, major film and television projects, Bob Dylan's ho...

22/12/2025

Marine Biological Laboratory Explores Human Memory With AI and Virtual Reality

The works of Plato state that when humans have an experience, some level of change occurs in their brain, which is powered by memory - specifically long-term me...

22/12/2025

Space42 and LatConnect 60 Expand Access to Advanced Geospatial Intelligence

Partnership integrates complementary satellite data and AI analytics to enhance security, infrastructure, and environmental monitoring solutions for global cust...

22/12/2025

Simplify Playlist Management with Workflows in WO Automation for Radio

Workflows allow you to create a sequence of planned events which may be added to your template(s) or inserted directly into your sequential or background playli...

22/12/2025

Sky extends PGA TOUR partnership until 2029, as Sky Sports remains the unrivalled home for golf fans in the UK and Ireland

Monday 22 December 2025 Sky extends PGA TOUR partnership until 2029, as Sky Spo...

22/12/2025

Global Anime Hits and New Releases Take Center Stage at Jump Festa 2026

Back to All News Global Anime Hits and New Releases Take Center Stage at Jump Festa 2026 Entertainment 22 December 2025 GlobalJapan Link copied to clipboar...

22/12/2025

Christmas with Oliver Callan on RT

Siobh n McSweeney, Rory McIlroy, Elon Musk, Catherine Connolly, Jim Gavin, Ivan Yates and Traitor Paudie Moloney lead new characters for Callan Kicks the Year 2...

22/12/2025

Monaghan's McKenna family crowned Ireland's Fittest Family 2025

Winner announced in the picturesque surroundings of Wicklow's Avondale Tower and Treetop Walk Andrew Trimble wins the show in his first series as coach Th...

22/12/2025

RT lyric fm Choirs for Christmas 2025 Winners Announced

The 2025 winners have been announced today, Sunday 21 December, for Ireland's largest choral competition Choirs for Christmas hosted by RT lyric fm. Ove...

21/12/2025

Legoshi and Haru's Story Reaches Its Finale: BEASTARS Final Season Part 2' Premieres March 2026, Main Trailer Out Now

Back to All News Legoshi and Haru's Story Reaches Its Finale: BEASTARS Fin...

21/12/2025

Rory McIlroy caps stellar year by winning the RT Sport Sportsperson of the Year 2025

John Shortt named Young Sportsperson of the Year Kerry are the Team of the Year ...

20/12/2025

Atomos Updates Ninja TX GO-Ninja TX With ProRes RAW and C...

Atomos announced the immediate availability of a new firmware update for its Ninja TX GO and Ninja TX monitor-recorders, unlocking ProRes RAW recording from the...

20/12/2025

CJP Broadcast Completes Digitisation of European Gymnasti...

CJP Broadcast has completed the digitisation of the European Gymnastics tape archive, converting 328 tapes containing more than forty years of recorded material...

20/12/2025

Bitmovin Launches Stream Lab MCP Server

Bitmovin, the leading provider of video streaming solutions, today announced the launch of the Stream Lab MCP Server, to give AI agents and large language model...

20/12/2025

Gracenote Unveils New Immersive Features for Sports Hubs

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

20/12/2025

Morgan Murphy Media Promotes Jill Shiroma to VP of Digital Strategy

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

20/12/2025

Samsung Makes GameBreaks Ad Format Available Programmatically

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

20/12/2025

FCC Extends Deadline for Comments on Upper C-Band Proposals

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

20/12/2025

Scripps Sports Inks Deals for Soccer and Professional Cheerleading

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

20/12/2025

Atomos Unveils Firmware Update For Ninja TX and TX GO

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

20/12/2025

Barack Obama Includes Laufey on His 2025 Favorite Music List

Barack Obama Includes Laufey on His 2025 Favorite Music List The former presidents roundup of books, music, and movies includes a song from the Berklee alums ...

20/12/2025

December 19, 2025

Study reveals a key hormonal circuit in the kidneys Scripps Research scientists identify the protein that helps kidney cells regulate renin, providing foundatio...

19/12/2025

With Playout Release 2025.4, ToolsOnAir continues to push professional playout workflows forward on macOS.

With Playout Release 2025.4, ToolsOnAir continues to push professional playout w...

19/12/2025

SVG Sit-Down: Diversified's Jared Timmins on AI for Broadcast Sports and Creating the Smart Venue'

SVG Sit-Down: Diversified's Jared Timmins on AI for Broadcast Sports and Cre...

19/12/2025

2025 SVG Summit Audio Recap: Say What?

2025 SVG Summit Audio Recap: Say What?The Audio Production and Distribution Workshop at the SVG Summit 20 took on issues including speech intelligibility, Next-...

19/12/2025

Gamified Fun: Channel 5 on its NFL Big Game Night Ambitions with Hungry Bear Media

Gamified fun: Channel 5 on its NFL Big Game Night ambitions with Hungry Bear Med...

19/12/2025

College Football Playoff Preview: For ESPN, Round 1 is a Fantastic Yet Familiar Saturday of Production

College Football Playoff Preview: For ESPN, Round 1 is a Fantastic Yet Familia...

19/12/2025

AWS's Jason Dvorkin on Developing Partnerships With the NBA and PGA Tour, Embracing the Use of Agentic AI

AWS's Jason Dvorkin on Developing Partnerships With the NBA and PGA Tour, Em...

19/12/2025

Netflix Kicks Off Packed Sports Week with Paul-Joshua Fight Before Shifting to NFL Christmas Doubleheader

Netflix Kicks Off Packed Sports Week with Paul-Joshua Fight Before Shifting to N...

19/12/2025

SVG New Sponsor Spotlight: Presidio's Nareev Shah on the Role of Its Captivate and Resonate Platforms in Sports Production

SVG New Sponsor Spotlight: Presidio's Nareev Shah on the Role of Its Captiva...

19/12/2025

US Marine Corps Increases Affordable Mass Demonstrations with Successful Red Wolf Low-Altitude Live Fire

Mounted to the pylon of an AH-1Z Viper helicopter, a Red Wolf vehicle successful...