
Two NVIDIA Research papers - one exploring diffusion-based generative AI models and another on training generalist AI agents - have been honored with NeurIPS 2022 Awards for their contributions to the field of AI and machine learning.
These are among more than 60+ talks, posters and workshops with NVIDIA authors being presented at the NeurIPs conference, taking place this week in New Orleans and next week online.
Synthetic data generation - for images, text or video - is a key theme across several of the NVIDIA-authored papers. Other topics include reinforcement learning, data collection and augmentation, weather models and federated learning.
AI is an incredibly important technology, and NVIDIA is making fast progress across the gamut - from generative AI to autonomous AI agents, said Jan Kautz, vice president of learning and perception research at NVIDIA. In generative AI, we are not only advancing our theoretical understanding of the underlying models, but are also making practical contributions that will reduce the effort of creating realistic virtual worlds and simulations.
Reimagining the Design of Diffusion-Based Generative Models Diffusion-based models have emerged as a groundbreaking technique for generative AI. NVIDIA researchers won an Outstanding Main Track Paper award for work that analyzes the design of diffusion models, proposing improvements that can dramatically improve the efficiency and quality of these models.
The paper breaks down the components of a diffusion model into a modular design, helping developers identify processes that can be adjusted to improve the performance of the entire model. The researchers show that their modifications enable record scores on a metric that assesses the quality of AI-generated images.
Training Generalist AI Agents in a Minecraft-Based Simulation Suite While researchers have long trained autonomous AI agents in video-game environments such as Starcraft, Dota and Go, these agents are usually specialists in only a few tasks. So NVIDIA researchers turned to Minecraft, the world's most popular game, to develop a scalable training framework for a generalist agent - one that can successfully execute a wide variety of open-ended tasks.
Dubbed MineDojo, the framework enables an AI agent to learn Minecraft's flexible gameplay using a massive online database of more than 7,000 wiki pages, millions of Reddit threads and 300,000 hours of recorded gameplay (shown in image at top). The project won an Outstanding Datasets and Benchmarks Paper Award from the NeurIPS committee.
As a proof of concept, the researchers behind MineDojo created a large-scale foundation model, called MineCLIP, that learned to associate YouTube footage of Minecraft gameplay with the video's transcript, in which the player typically narrates the onscreen action. Using MineCLIP, the team was able to train a reinforcement learning agent capable of performing several tasks in Minecraft without human intervention.
Creating Complex 3D Shapes to Populate Virtual Worlds Also at NeurIPS is GET3D, a generative AI model that instantly synthesizes 3D shapes based on the category of 2D images it's trained on, such as buildings, cars or animals. The AI-generated objects have high-fidelity textures and complex geometric details - and are created in a triangle mesh format used in popular graphics software applications. This makes it easy for users to import the shapes into 3D renderers and game engines for further editing.
Named for its ability to Generate Explicit Textured 3D meshes, GET3D was trained on NVIDIA A100 Tensor Core GPUs using around 1 million 2D images of 3D shapes captured from different camera angles. The model can generate around 20 objects a second when running inference on a single NVIDIA GPU.
The AI-generated objects could be used to populate 3D representations of buildings, outdoor spaces or entire cities - digital spaces designed for industries such as gaming, robotics, architecture and social media.
Improving Inverse Rendering Pipelines With Control Over Materials, Lighting At the most recent CVPR conference, held in New Orleans in June, NVIDIA Research introduced 3D MoMa, an inverse rendering method that enables developers to create 3D objects composed of three distinct parts: a 3D mesh model, materials overlaid on the model, and lighting.
The team has since achieved significant advancements in untangling materials and lighting from the 3D objects - which in turn improves creators' abilities to edit the AI-generated shapes by swapping materials or adjusting lighting as the object moves around a scene.
The work, which relies on a more realistic shading model that leverages NVIDIA RTX GPU-accelerated ray tracing, is being presented as a poster at NeurIPS.
Enhancing Factual Accuracy of Language Models' Generated Text Another accepted paper at NeurIPS examines a key challenge with pretrained language models: the factual accuracy of AI-generated text.
Language models trained for open-ended text generation often come up with text that includes nonfactual information, since the AI is simply making correlations between words to predict what comes next in a sentence. In the paper, NVIDIA researchers propose techniques to address this limitation, which is necessary before such models can be deployed for real-world applications.
The researchers built the first automatic benchmark to measure the factual accuracy of language models for open-ended text generation, and found that bigger language models with billions of parameters were more factual than smaller ones. The team proposed a new technique, factuality-enhanced training, along with a novel sampling algorithm that together help train language models to generate accurate text - and demonstrated a reduction in the rate of factual errors from 33% to around 15%.
There are more th
North America Stories
27/05/2026
Telestream has announced that its Board of Directors has appointed Benjamin Desbois as Chief Executive Officer, effective July 1, 2026. Desbois, currently Teles...
27/05/2026
ESPN garnered 10 awards; NBC's Sunday Night Football received the Outstandin...
27/05/2026
Matrox Video is celebrating its 50th anniversary, marking five decades of operations from its headquarters in Montreal, Canada. Founded in 1976, the company has...
27/05/2026
Major League Baseball has announced a series of initiatives tied to America's Semiquincentennial, including a national marketing campaign, Fourth of July br...
27/05/2026
Advanced Systems Group (ASG) has announced that Brian Gross has joined the company as an Account Manager on its Audio team, based in the Burbank office. He will...
27/05/2026
Nielsen has released new research on soccer fandom ahead of the FIFA World Cup 2...
27/05/2026
ESL FACEIT Group (EFG) has unveiled a new partnership with TikTok to bring broad...
27/05/2026
FIFA's Oscar Sanchez gives a deeper look to how this tournament will be cove...
27/05/2026
The soon-to-be senior from Charlottesville is building her skills in replay, TD, and even creative content for HokieVision and its ACC Network productions
In t...
27/05/2026
FOX Sports' Mike Davies breaks down the vision for this summer's showcas...
27/05/2026
HBS's Paul King, FIFA's Oscar Sanchez preview how the masses at home wil...
27/05/2026
FOX's MLB coverage dominated the night at the 47th Annual Sports Emmy Awards...
27/05/2026
One of the most memorable Postseasons in baseball history would have had no memo...
27/05/2026
NBC's Sunday Night Football is among the most decorated and most watched programs in the history of television. It added to its jam-packed trophy case on Tu...
27/05/2026
The 2026 Sports Emmys marked a watershed moment for Prime Video Sports. After bu...
27/05/2026
With the Opening Match just over two weeks away, the entire sports-production-te...
27/05/2026
The XL Converge 300P radio system emerges with a groundbreaking feature set enhancing the mission-critical communications of public safety, federal and critica...
27/05/2026
Pairing Two47 MCX software with existing LTE networks means tailored system upgrades that can save time, money and lives....
27/05/2026
PAC-3 MSE offers improved range, speed, and maneuverability, making it an effect...
27/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
27/05/2026
Star Trek VFX: Recreating John Knoll's Iconic Warp Stars without a Slitscan ...
27/05/2026
Adventure World Uses Blackmagic Replay for Marine Live
Brie Clayton May 27, 2026
0 Comments
Large screen displays and slow motion replays dynamically ...
27/05/2026
Berklee Alumna and Assistant Professor Olivia P rez-Collellmir to Premiere Origi...
27/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
27/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
27/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
27/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
27/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
27/05/2026
Co-founder Dan Castles to transition to Executive Chair; internal promotion reinforces continuity and long-term growth
Telestream, a global leader in media wor...
27/05/2026
Big Blue Marble today announced that its Nakolos platform is the first end-to-end 5G Broadcast solution worldwide to implement the complete feature set introduc...
27/05/2026
Lightware recently hosted the Girls' Day event in April at its headquarters in Budapest, welcoming students for an interactive introduction to engineering a...
27/05/2026
May 27th, 2026 May 27, 2026
Press Materials Available Here
TRIBECA X Announc...
26/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/05/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/05/2026
Cobalt Digital to Showcase End-to-End IPMX Ecosystem at InfoComm 2026, Making ST 2110 Easy for Pro AV
blueCORE standalone processors headline solutions designe...
26/05/2026
Matrox Video today celebrates its 50th anniversary, marking five decades of innovation, engineering excellence, and customer-focused evolution from its headquar...
26/05/2026
ITV Studios, the production arm of the UKs largest commercial broadcaster, has deployed the Cuez live production platform to unify the management of three back-...
26/05/2026
CETA Software releases Morpheus, an AI tool for real-time post-production projec...
26/05/2026
In a move set to redefine motorsports coverage across the Asia Pacific region, Ikegami Electronics announces that Two Wheels Motor Racing Sdn Bhd (TWMR), a lead...
26/05/2026
Back to All News
Netflix Releases Official Trailer and Poster for The Root Of T...
26/05/2026
Back to All News
MED, Production Begins on Netflixs First Medical Drama From Br...
26/05/2026
Back to All News
Netflix Announces Five New Brazilian Productions and Expands I...
26/05/2026
Back to All News
Netflix Celebrates the Release of the New Animated Series Due ...
26/05/2026
Back to All News
Made In New Mexico: Building The Boroughs' From the Ground Up
A photo from The Boroughs.' (Courtesy of Netflix 2026)
Entertainme...
26/05/2026
Smart Production Control. Total Confidence.
Tyngsborough, MA, May 27, 2026 - Broadcast Pix today announced the ONix Pro Control Panel, its most advanced hard...
26/05/2026
The shift to agentic AI creates a new CPU requirement for the AI factory: fast cores, massive memory bandwidth and the ability to sustain high performance when ...
25/05/2026
The former University of Wyoming wrestler is essential in helping the rapidly growing streamer delivers more than 50,000 live events per year
The sports-produc...
25/05/2026
Image courtesy of GA-ASI...