More Than Meets the AI: How GANs Research Is Reshaping Video Conferencing
24/06/2021
Vid2Vid Cameo, one of the deep learning models behind the NVIDIA Maxine software development kit for video conferencing, uses generative adversarial networks (known as GANs) to synthesize realistic talking-head videos using a single 2D image of a person.
To use it, participants submit a reference image - which could be either a real photo of themselves or a cartoon avatar - before joining a video call. During the meeting, the AI model will capture each individual's real-time motion and apply it to the previously uploaded still image.
That means that by uploading a photo of themselves in formal attire, meeting attendees with mussed hair and pajamas can appear on a call in work-appropriate attire, with AI mapping the user's facial movements to the reference photo. If the subject is turned to the left, the technology can adjust the viewpoint so the attendee appears to be directly facing the webcam.
Besides helping meeting attendees look their best, this AI technique also shrinks the bandwidth needed for video conferencing by up to 10x, avoiding jitter and lag. It'll soon be available in the NVIDIA Video Codec SDK as the AI Face Codec.
Many people have limited internet bandwidth, but still want to have a smooth video call with friends and family, said NVIDIA researcher Ming-Yu Liu, co-author on the project. In addition to helping them, the underlying technology could also be used to assist the work of animators, photo editors and game developers.
Vid2Vid Cameo was presented this week at the prestigious Conference on Computer Vision and Pattern Recognition - one of 28 NVIDIA papers at the virtual event. It's also available on the AI Playground, where anyone can experience our research demos firsthand.
AI Steals the Show In a nod to classic heist movies (and a hit Netflix show), NVIDIA researchers put their talking-head GAN model through its paces for a virtual meeting. The demo highlights key features of Vid2Vid Cameo, including facial redirection, animated avatars and data compression.
These capabilities are coming soon to the NVIDIA Maxine SDK, which gives developers optimized pretrained models for video, audio and augmented reality effects in video conferencing and live streaming.
Developers can already adopt Maxine AI effects including intelligent noise removal, video upscaling and body pose estimation. The free-to-download SDK can also be paired with the NVIDIA Jarvis platform for conversational AI applications, including transcription and translation.
Hello from the AI Side Vid2Vid Cameo requires just two elements to create a realistic AI talking head for video conferencing: a single shot of the person's appearance and a video stream that dictates how that image should be animated.
Developed on NVIDIA DGX systems, the model was trained using a dataset of 180,000 high-quality talking head videos. The network learned to identify 20 key points that can be used to model facial motion without human annotations. The points encode the location of features including the eyes, mouth and nose.
It then extracts these key points from a reference image of the caller, which could be sent to other video conference participants ahead of time or re-used from previous meetings. This way, instead of sending bulky live video streams from one participant to the other, video conferencing platforms can simply send data on how the speaker's key facial points are moving.
On the receiver's side, the GAN model uses this information to synthesize a video that mimics the appearance of the reference image.
By compressing and sending just the head position and key points back and forth, instead of full video streams, this technique can reduce bandwidth needs for video conferences by 10x, providing a smoother user experience. The model can be adjusted to transmit a differing number of key points to adapt to different bandwidth environments without compromising visual quality.
The viewpoint of the resulting talking head video can also be freely adjusted to show the user from a side profile or straight on, as well as from lower or higher camera angles. This feature could also be applied by photo editors working with still images.
NVIDIA researchers found that Vid2Vid Cameo outperforms state-of-the-art models by producing more realistic and sharper results - whether the reference image and the video are from the same person, or when the AI is tasked with transferring movement from one person onto a reference image of another.
The latter feature can be used to apply the facial motions of a speaker to animate a digital avatar in a video conference, or even lend realistic expression and movement to a video game or cartoon character.
The paper behind Vid2Vid Cameo was authored by NVIDIA researchers Ting-Chun Wang, Arun Mallya and Ming-Yu Liu. The NVIDIA Research team consists of more than 200 scientists around the globe, focusing on areas such as AI, computer vision, self-driving cars, robotics and graphics.
Our thanks to actor Edan Moses, who performed the English voiceover of The Professor on La Casa De Papel/Money Heist on Netflix, for his contribution to the video above featuring our latest AI research.
LINK: | https://blogs.nvidia.com/blog/2021/06/24/vid2vid-cameo-ai-research-vid... |
See more stories from nvidia |
Most recent headlines
04/08/2024
Dalet Appoints Santiago Solanas as CEO to Lead Next Era of Growth and Innovation
Dalet, a leading technology and service provider for media-rich organizations, is excited to announce Santiago Solanas as its new Chief Executive Officer (CEO)....
03/06/2024
Dalet and Veritone Reach Agreement to Distribute, Transact and Monetize Media Archives
Dalet, a leading technology and service provider for media-rich organizations, a...
02/05/2024
Reality Slips Away in the Eerie I Saw the TV Glow
PARK CITY, UTAH - JANUARY 18: Jane Schoenbrun introduces the 2024 Sundance Film Festival I Saw the TV Glow premiere at the Library Center Theatre in Park City...
02/05/2024
Meet the 10 Hampton University Students Receiving Spotify NextGen's Scholarship
Spotify is committed to amplifying the voices of underrepresented groups, and th...
02/05/2024
Advertisers and Creators Come Together at Our First-Ever Spotify Sparks in London
With more than 600 million users around the world tuned in to Spotify, there'...
02/05/2024
Gear up for the 2024 Cycling Grand Tours on Australia's Home of Cycling, SBS
Gear up for the 2024 Cycling Grand Tours on Australia's Home of Cycling, SBS Media releases All of the rivalry begins with the Giro d'Italia this we...
02/05/2024
New SBS documentary dives into one of Australia's greatest underdog stories of all time!
New SBS documentary dives into one of Australia's greatest underdog stories ...
02/05/2024
Alone Australia cements its place as one of Australia's biggest hits of 2024
Alone Australia cements its place as one of Australia's biggest hits of 2024 2 May, 2024 Media releases Alone Australia has cemented its place as one o...
02/05/2024
The Making of All of Us Strangers
I always like to base my lighting choices in reality, says cinematographer Jamie D. Ramsay, SASC. For his recent collaboration with writer-director Andrew Haig...
02/05/2024
Hiltron Introduces Field-Upgradable Motorisation Kit for CPI 2385 Satcom Antenna
Accompanying image shows Hiltron's new HMAM-based field-upgradable motorisation kit for the CPI 2385 satellite communications antenna. Backnang, Germany,...
02/05/2024
Millennium Space Systems Selects L3Harris to Build Space Development Agency Electro-Optical Infrared Payloads
MELBOURNE, Fla., May 2, 2024 - L3Harris Technologies (NYSE:LHX) has received a c...
02/05/2024
Samba TV Launches New Generative AI Ad Solution
NEW YORK Samba TV is debuting new capabilities for Samba AI, the company's suite of generative AI technologies. Samba AI's new capabilities provide a re...
02/05/2024
Television Academy Foundation Names Anne Vasquez Executive Director
LOS ANGELES The Television Academy Foundation has announced the appointment of Anne Vasquez as its executive director effective May 13....
02/05/2024
SAG-AFTRA to License Nielsen Streaming Data
NEW YORK SAG-AFTRA has inked a deal with Nielsen to become its third-party provider of streaming content measurement and has announced that it will use the Niel...
02/05/2024
Cineverse Unveils Public Beta of AI-Powered cineSearch
LOS ANGELES Cineverse has debuted cineSearch, its previously announced new content search and discovery service in public Beta....
02/05/2024
Can AI find a way to reduce the broadcast industry's energy consumption?
OpenDrives Trevor Morgan asks if AI has the answer to making data centres for the media and entertainment industry more sustainable By Contributor Published:...
02/05/2024
Bill Baggelaar Asks the Questions: An Unfiltered Look at Media Supply Chain Transformation
Bill Baggelaar Asks the Questions: An Unfiltered Look at Media Supply Chain Tran...
02/05/2024
Two Boston Conservatory at Berklee Alums Nominated for Tony Awards
Two Boston Conservatory at Berklee Alums Nominated for Tony Awards An additional eight alums and two current students performed in nominated productions. By...
02/05/2024
Viewers Call Finding New TV Content Frustrating' in Comcast Advertising Report
A majority of viewers 51% said the difficulty in finding new content on TV c...
02/05/2024
Programming Legend Art Moore Retiring After 53 Years With ABC Stations
Art Moore, who headed production of long-running syndicated series, including Live, said he plans to retire in September as VP of programming for WABC New York....
02/05/2024
Nexstar Will Move The CW Affiliation to WGN Chicago
Nexstar Media Group, which owns The CW, said the network's affiliation will be moving to Nexstar-owned WGN Chicago....
02/05/2024
Future Today Puts First Original Shows on Fawesome Channel (NewFronts)
Future Today said that it plans to launch the first original shows on its Fawesome streaming channel....
02/05/2024
Behind the Music' Returns on Paramount Plus
Behind the Music is back on Paramount Plus with new episodes May 1. Those profiled in the season two episodes are Bell Biv DeVoe, Trace Adkins and Wolfgang Van ...
02/05/2024
All American: Homecoming' Returns on The CW July 8
All American: Homecoming starts season three on The CW Monday, July 8, while season two of 61st Street kicks off Monday, July 22....
02/05/2024
Jerry Seinfeld's Pop-Tart Movie Starts on Netflix May 3
Unfrosted, a movie about the race to create a game-changing breakfast pastry, such as, say, the Pop-Tart, debuts on Netflix May 3. Jerry Seinfeld directs, his f...
02/05/2024
Former BT Sport COO Jamie Hindhaugh joins EMG / Gravity Media
Charlie Cubbon has also been appointed chief operating officer By Matthew Corrigan Published: May 2, 2024 Charlie Cubbon has also been appointed chief ope...
02/05/2024
Durable Goods Signs Lionel Coleman
Durable Goods Signs Lionel Coleman Brie Clayton May 2, 2024 0 Comments Durable Goods has signed multi-hyphenate director Lionel Coleman for commercial...
02/05/2024
Hitsujibungaku's Music Video GO!!! Shot by Kyotaro Hayashi with Blackmagic Cinema Camera 6K
Hitsujibungaku's Music Video GO!!! Shot by Kyotaro Hayashi with Blackmagic C...
02/05/2024
Diamond Sports RSNs Go Dark On Comcast Systems
Bally Sports Regional Networks were taken off Comcast's systems on April 30 when their existing distribution agreement expired and Diamond Sports Group was ...
02/05/2024
Diamond Sports Group, DirecTV Renew Distribution Deal
SOUTHPORT, Conn. and EL SEGUNDO, Calif. Diamond Sports Group ( Diamond or the Company ) and DirecTV have announced that they have reached a multi-year renewal...
02/05/2024
Three Nexstar Stations to Become CW Affiliates
IRVING, Texas Nexstar Media Group has announced that its owned and operated television stations in Chicago, Illinois (DMA #3), Norfolk, Virginia (DMA #43), and ...
02/05/2024
FCC, FTC Ink Agreement to Cooperate on Net Neutrality Enforcement
WASHINGTON, D.C. The Federal Communications Commission and Federal Trade Commission have signed a Memorandum of Understanding to coordinate consumer protection ...
02/05/2024
Haivision Celebrates 20th Anniversary
MONTREAL Haivision Systems Inc. is marking its 20th anniversary by detailing some of the accomplishments and developments that have helped the company become a ...
02/05/2024
Agora Introduces Adaptive Video Optimization Technology
SANTA CLARA, Calif. Agora today unveiled its Adaptive Video Optimization (AVO) technology that uses machine learning to adjust parameters dynamically at every s...
02/05/2024
Study: Streaming Market Is Saturated But Subscriptions Continue to Grow
NEW YORK Kantar has released a new study showing the U.S. streaming market has hit a saturation point, with the household penetration rate stagnating and at nea...
02/05/2024
Samba TV To Spotlight New Capabilities For Generative AI Ad Solution
NEW YORK Samba TV is debuting new capabilities for Samba AI, the company's suite of generative AI technologies, at the 2024 IAB (Interactive Advertising Bur...
02/05/2024
GSTV Pumps Up Research, Programming at NewFront
GSTV, the network that programs screens at gas stations, will be talking about new research and new programming at its NewFront presentation Wednesday....
02/05/2024
EMG / Gravity Media Taps Jamie Hindhaugh To Head Up UK, US, Australia, and Middle East; Names Charlie Cubbon as COO
EMG / Gravity Media appoints Charlie Cubbon COO and Jamie Hindhaugh regional CEO...
02/05/2024
Watch SVG NEXT Conversations, Ep. 2: How XR and Other Emerging Technologies Are Transforming the Landscape of M&E'
Watch SVG NEXT Conversations, Ep. 2: How XR and Other Emerging Technologies Are...
02/05/2024
SVG Sit-Down: Program Productions' Bob Carzoli, Integrum's Kathy Reiland on the Strategy Behind the New Alliance
SVG Sit-Down: Program Productions' Bob Carzoli, Integrum's Kathy Reiland...
02/05/2024
YES Network App Logs Record Usership With New Expanded Pick-N-Play Live Interactivity, YES Rewards
YES Network App Logs Record Usership With New Expanded Pick-N-Play Live Interact...
02/05/2024
Rohde & Schwarz presents its test solutions at CCW 2024 that enable a successful migration to mission-critical broadband
Rohde & Schwarz presents its test solutions at CCW 2024 that enable a successful...
02/05/2024
Baselight training. Paris. Les Lapins Bleus. 27-31 May 2024
Baselight accredited training partner, Les Lapins Bleus, is conducting a five day Baselight training course in Paris in May. Location: Paris Dates: 27-31 May ...
02/05/2024
Skeem Saam: Wednesday's episode, 1 May 2024 [video]
Skeem Saam: Wednesday's episode, 1 May 2024 [video]Missed an episode of Skeem Saam? No problem! Watch the latest episode of your favourite South African soa...
02/05/2024
Prison Journalism: It's alright not to have
Prison Journalism: It's alright not to haveWesley Leong was incarcerated at the age of 15 in 1996 at Pollsmoor Prison. He is currently part of Restore's...
02/05/2024
Tonight on Smoke and Mirrors: Thandiswa's quest to reclaim Caesar's house intensifies
Tonight on Smoke and Mirrors: Lulu navigates the complexities of her relationshi...
02/05/2024
GeForce NOW Delivers 24 A-May-zing Games This Month
GeForce NOW brings 24 new games for members this month. Ninja Theory's highly anticipated Senua's Saga: Hellblade II will be coming to the cloud soon -...
02/05/2024
NVIDIA AI Microservices for Drug Discovery, Digital Health Now Integrated With AWS
Harnessing optimized AI models for healthcare is easier than ever as NVIDIA NIM,...
02/05/2024
ARRI announces the ALEXA 35 Live - Multicam System
ARRI announces the ALEXA 35 Live - Multicam System posted: 02/05/2024 Arri Alexa 35 Live - Multicam System ARRI announces the ALEXA 35 Live - Multicam Sy...
02/05/2024
The Women': Author shines light on forgotten women in war
The Women': Author shines light on forgotten women in warThe bestselling author of The Nightingale', Kristin Hannah, has outdone herself with her new h...