
NVIDIA researchers are collaborating with academic centers worldwide to advance generative AI, robotics and the natural sciences - and more than a dozen of these projects will be shared at NeurIPS, one of the world's top AI conferences.
Set for Dec. 10-16 in New Orleans, NeurIPS brings together experts in generative AI, machine learning, computer vision and more. Among the innovations NVIDIA Research will present are new techniques for transforming text to images, photos to 3D avatars, and specialized robots into multi-talented machines.
NVIDIA Research continues to drive progress across the field - including generative AI models that transform text to images or speech, autonomous AI agents that learn new tasks faster, and neural networks that calculate complex physics, said Jan Kautz, vice president of learning and perception research at NVIDIA. These projects, often done in collaboration with leading minds in academia, will help accelerate developers of virtual worlds, simulations and autonomous machines.
Picture This: Improving Text-to-Image Diffusion Models Diffusion models have become the most popular type of generative AI models to turn text into realistic imagery. NVIDIA researchers have collaborated with universities on multiple projects advancing diffusion models that will be presented at NeurIPS.
A paper accepted as an oral presentation focuses on improving generative AI models' ability to understand the link between modifier words and main entities in text prompts. While existing text-to-image models asked to depict a yellow tomato and a red lemon may incorrectly generate images of yellow lemons and red tomatoes, the new model analyzes the syntax of a user's prompt, encouraging a bond between an entity and its modifiers to deliver a more faithful visual depiction of the prompt.
SceneScape, a new framework using diffusion models to create long videos of 3D scenes from text prompts, will be presented as a poster. The project combines a text-to-image model with a depth prediction model that helps the videos maintain plausible-looking scenes with consistency between the frames - generating videos of art museums, haunted houses and ice castles (pictured above).
Another poster describes work that improves how text-to-image models generate concepts rarely seen in training data. Attempts to generate such images usually result in low-quality visuals that aren't an exact match to the user's prompt. The new method uses a small set of example images that help the model identify good seeds - random number sequences that guide the AI to generate images from the specified rare classes.
A third poster shows how a text-to-image diffusion model can use the text description of an incomplete point cloud to generate missing parts and create a complete 3D model of the object. This could help complete point cloud data collected by lidar scanners and other depth sensors for robotics and autonomous vehicle AI applications. Collected imagery is often incomplete because objects are scanned from a specific angle - for example, a lidar sensor mounted to a vehicle would only scan one side of each building as the car drives down a street.
Character Development: Advancements in AI Avatars AI avatars combine multiple generative AI models to create and animate virtual characters, produce text and convert it to speech. Two NVIDIA posters at NeurIPS present new ways to make these tasks more efficient.
A poster describes a new method to turn a single portrait image into a 3D head avatar while capturing details including hairstyles and accessories. Unlike current methods that require multiple images and a time-consuming optimization process, this model achieves high-fidelity 3D reconstruction without additional optimization during inference. The avatars can be animated either with blendshapes, which are 3D mesh representations used to represent different facial expressions, or with a reference video clip where a person's facial expressions and motion are applied to the avatar.
Another poster by NVIDIA researchers and university collaborators advances zero-shot text-to-speech synthesis with P-Flow, a generative AI model that can rapidly synthesize high-quality personalized speech given a three-second reference prompt. P-Flow features better pronunciation, human likeness and speaker similarity compared to recent state-of-the-art counterparts. The model can near-instantly convert text to speech on a single NVIDIA A100 Tensor Core GPU.
Research Breakthroughs in Reinforcement Learning, Robotics In the fields of reinforcement learning and robotics, NVIDIA researchers will present two posters highlighting innovations that improve the generalizability of AI across different tasks and environments.
The first proposes a framework for developing reinforcement learning algorithms that can adapt to new tasks while avoiding the common pitfalls of gradient bias and data inefficiency. The researchers showed that their method - which features a novel meta-algorithm that can create a robust version of any meta-reinforcement learning model - performed well on multiple benchmark tasks.
Another by an NVIDIA researcher and university collaborators tackles the challenge of object manipulation in robotics. Prior AI models that help robotic hands pick up and interact with objects can handle specific shapes but struggle with objects unseen in the training data. The researchers introduce a new framework that estimates how objects across different categories are geometrically alike - such as drawers and pot lids that have similar handles - enabling the model to more quickly generalize to new shapes.
Supercharging Science: AI-Accelerated Physics, Climate, Healthcare NVIDIA researchers at NeurIPS will also present papers across the natural sciences - covering physics simulations, climate models and AI fo
Most recent headlines
09/11/2025
Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...
13/10/2025
Spectrum Brings Selected L.A. Lakers Games to Apple Vision Pro With New Immersiv...
13/10/2025
Media Climate Accord aims to offer united approach to M&E industry sustainabilit...
13/10/2025
Riot Games streamlines production of Valorant Champions Paris with ST 2110 flypa...
13/10/2025
Feeling the NRG: Riot Games puts on a show for Valorant Champions Paris final By Jo Ruddock
Monday, October 13, 2025 - 09:17
Print This Story
After more t...
13/10/2025
FOX Sports MLB Postseason Audio Aims To Make Officials' Calls More AccurateA1 Joe Carpenter hopes to bring some baseball CSI' to the ABS ump-cam system...
13/10/2025
New SBS and NITV Original RECKLESS a Deadly Funny Thriller Straight Out of Fre...
13/10/2025
Regional sports network moves from satellite to IP to cut distribution costs by more than half and streamline broadcast and direct-to-consumer delivery
Mid-Atl...
13/10/2025
Delta Live, the award-winning audio supplier, has underlined its position at the forefront of live sound with significant investments in cutting edge audio syst...
13/10/2025
Abu Dhabi, UAE October 13, 2025: Space42 (ADX: SPACE42), the UAE-based AI-powe...
13/10/2025
Nick Blood and Saffron Hocking lead casting for Hit Point, brand new original drama series for U and U&Dave
Developed & Produced by Urban Myth Films (a STUDIOC...
13/10/2025
The series from A24 will land in the UK & Ireland in 2026Monday 13 October 2025
...
13/10/2025
Back to All News
Grand Galaxy Hotel' Open for Business: Netflix Confirms Production and Cast
Entertainment
13 October 2025
GlobalSouth Korea
Link copi...
13/10/2025
Back to All News
Netflix Partners with GOBELINS Paris and Guillermo del Toro to...
13/10/2025
Back to All News
Stories Set to Thrill, Move, and Entertain: Netflix Announces ...
13/10/2025
Fox Corporation Executives to Discuss First Quarter Fiscal 2026 Financial Result...
13/10/2025
At the OCP Global Summit, NVIDIA is offering a glimpse into the future of gigawa...
13/10/2025
Season 2 brings murder and West of Ireland humour - and rain - to our screens, with M ir ad Tyers joining the cast
Watch trailer here.
A small-town obituary w...
13/10/2025
The Katie Hannon Interview Live airs tonight & Wednesday night at 7pm
As part of RT 's comprehensive election campaign coverage, journalist Katie Hannon w...
11/10/2025
SVG New Sponsor Spotlight: TAB M Solutions' Joe Wire, Kevin Tucker on Guidin...
11/10/2025
By Jessica Herndon
One of the most exciting things about the Sundance Film Fest...
11/10/2025
STAMFORD, Conn. In a move that highlights the growing importance of streaming apps on pay TV platforms, Charter Communications' Spectrum operating brand has...
11/10/2025
Netflix is expanding its video game offerings from mobile into TV by launching party games that its subscribers can play on smart TVs....
11/10/2025
STAMFORD, Conn. Charter Communications' Spectrum News has reached an deal with Comcast to expand distribution of its local news channels to Xfinity TV cust...
11/10/2025
Professional podcasts are booming. They're an effective way to establish company executives as industry leaders, humanize a large organization, drill down o...
11/10/2025
PlayBox Neo, a leading provider of media playout and channel branding solutions, will present its PlayBox Neo Suite media platform for the first time in the U.S...
11/10/2025
As a testament to its commitment to the broadcast market, FOR-A America will bring several popular and future-facing technologies to the NAB Show New York, runn...
11/10/2025
European technology developer Profuz Digital reflects on another successful IBC Show in Amsterdam from 12 15 September after showcasing the latest version of ...
11/10/2025
Cobalt Digital, the leading designer and manufacturer of award-winning signal processing products, and a founding partner in the openGear initiative, is headin...
11/10/2025
Lightware, an industry leader in signal management, is at the center of a growing range of high-profile integrations with its UBEX platform. Built to deliver un...
11/10/2025
FOR-A Latin America and the Caribbean (LAC) will bring its industry-leading signal processing, frame rate conversion and graphics playout software to CAPER 2025...
11/10/2025
Clear-Com is happy to announce its latest collaboration with BNE Productions, a premier production company known for delivering world-class audio for live even...
11/10/2025
Dean's List: Tommy Neblett Shares His YouTube Top Five Boston Conservatory's dean of dance reveals his favorite student dance videos.
By
Sarah Godcher...
10/10/2025
SVG New Sponsor Spotlight: TAB M Solutions' Joe Wire, Jeff Tucker on Guiding...
10/10/2025
SVG Students To Watch: Vincent Macri, Monmouth University The Jersey local runs Camera 1 on Hawks games and is expanding into technical directing By Brandon Co...
10/10/2025
Flexible budgets: Inside the DFL's new customisable camera concepts for Bund...
10/10/2025
Facing the future: TVN on its technical services for the new Bundesliga season with remote production and all the bells and whistles By Heather McLean
Monday...
10/10/2025
Evolving in-house: Developing broadcast expertise and pushing the women's ga...
10/10/2025
Growing the game: The Deutscher Fu ball-Bund on pushing production innovation fo...
10/10/2025
Proximity and authenticity: DFL kicks off the new football season with more broa...
10/10/2025
Spectrum Brings Select L.A. Lakers Games to Apple Vision Pro With New Immersive ...
10/10/2025
From left, Scoot McNairy, Andrew Durham, Nessa Dougherty, and Emilia Jones attend the premiere of Fairyland at the 2023 Sundance Film Festival. Photo by Jemal...
10/10/2025
By Chuck Parker, CEO of Sohonet
If you work in film and television, you can feel it: anxiety is high. Budgets are tight, schedules are tighter, and AI is a c...
10/10/2025
L3Harris' WESCAM MX-Series EO/IR sensor systems have a long history of supporting complex missions in harsh environments, as seen here on a Kaplan-20 Next G...
10/10/2025
Cobalt Digital Booth # 607 // Journalists: Click to visit Cobalt
NAB NY 2025 Audio monitors join Cobalt's platform, including its latest routers, multiview...
10/10/2025
NEW YORK - October 9, 2025 - Nielsen, the global leader in audience measurement, data and analytics, today announced the release of The Marketing ROI Blueprint:...
10/10/2025
CHAMPAIGN, Ill. Cobalt Digital will feature its Aria series of audio solutions designed to simplify monitoring, embedding and routing at NAB Show New York, set ...
10/10/2025
LOS ANGELES and PONTE VEDRA BEACH, Florida Amazon's Prime Video has announced a new deal that will allow it to exclusively stream a revival of the PGA Tour&...
10/10/2025
ATLANTA Local Now, Allen Media Group's free streaming service, will add five channels from Fox to its growing lineup. The new offerings are Fox Sports, Fox ...
10/10/2025
WASHINGTON The National Association of Broadcasters is applauding a draft notice from the Federal Communications Commission that would potentially speed up the ...