
Two NVIDIA Research papers - one exploring diffusion-based generative AI models and another on training generalist AI agents - have been honored with NeurIPS 2022 Awards for their contributions to the field of AI and machine learning.
These are among more than 60+ talks, posters and workshops with NVIDIA authors being presented at the NeurIPs conference, taking place this week in New Orleans and next week online.
Synthetic data generation - for images, text or video - is a key theme across several of the NVIDIA-authored papers. Other topics include reinforcement learning, data collection and augmentation, weather models and federated learning.
AI is an incredibly important technology, and NVIDIA is making fast progress across the gamut - from generative AI to autonomous AI agents, said Jan Kautz, vice president of learning and perception research at NVIDIA. In generative AI, we are not only advancing our theoretical understanding of the underlying models, but are also making practical contributions that will reduce the effort of creating realistic virtual worlds and simulations.
Reimagining the Design of Diffusion-Based Generative Models Diffusion-based models have emerged as a groundbreaking technique for generative AI. NVIDIA researchers won an Outstanding Main Track Paper award for work that analyzes the design of diffusion models, proposing improvements that can dramatically improve the efficiency and quality of these models.
The paper breaks down the components of a diffusion model into a modular design, helping developers identify processes that can be adjusted to improve the performance of the entire model. The researchers show that their modifications enable record scores on a metric that assesses the quality of AI-generated images.
Training Generalist AI Agents in a Minecraft-Based Simulation Suite While researchers have long trained autonomous AI agents in video-game environments such as Starcraft, Dota and Go, these agents are usually specialists in only a few tasks. So NVIDIA researchers turned to Minecraft, the world's most popular game, to develop a scalable training framework for a generalist agent - one that can successfully execute a wide variety of open-ended tasks.
Dubbed MineDojo, the framework enables an AI agent to learn Minecraft's flexible gameplay using a massive online database of more than 7,000 wiki pages, millions of Reddit threads and 300,000 hours of recorded gameplay (shown in image at top). The project won an Outstanding Datasets and Benchmarks Paper Award from the NeurIPS committee.
As a proof of concept, the researchers behind MineDojo created a large-scale foundation model, called MineCLIP, that learned to associate YouTube footage of Minecraft gameplay with the video's transcript, in which the player typically narrates the onscreen action. Using MineCLIP, the team was able to train a reinforcement learning agent capable of performing several tasks in Minecraft without human intervention.
Creating Complex 3D Shapes to Populate Virtual Worlds Also at NeurIPS is GET3D, a generative AI model that instantly synthesizes 3D shapes based on the category of 2D images it's trained on, such as buildings, cars or animals. The AI-generated objects have high-fidelity textures and complex geometric details - and are created in a triangle mesh format used in popular graphics software applications. This makes it easy for users to import the shapes into 3D renderers and game engines for further editing.
Named for its ability to Generate Explicit Textured 3D meshes, GET3D was trained on NVIDIA A100 Tensor Core GPUs using around 1 million 2D images of 3D shapes captured from different camera angles. The model can generate around 20 objects a second when running inference on a single NVIDIA GPU.
The AI-generated objects could be used to populate 3D representations of buildings, outdoor spaces or entire cities - digital spaces designed for industries such as gaming, robotics, architecture and social media.
Improving Inverse Rendering Pipelines With Control Over Materials, Lighting At the most recent CVPR conference, held in New Orleans in June, NVIDIA Research introduced 3D MoMa, an inverse rendering method that enables developers to create 3D objects composed of three distinct parts: a 3D mesh model, materials overlaid on the model, and lighting.
The team has since achieved significant advancements in untangling materials and lighting from the 3D objects - which in turn improves creators' abilities to edit the AI-generated shapes by swapping materials or adjusting lighting as the object moves around a scene.
The work, which relies on a more realistic shading model that leverages NVIDIA RTX GPU-accelerated ray tracing, is being presented as a poster at NeurIPS.
Enhancing Factual Accuracy of Language Models' Generated Text Another accepted paper at NeurIPS examines a key challenge with pretrained language models: the factual accuracy of AI-generated text.
Language models trained for open-ended text generation often come up with text that includes nonfactual information, since the AI is simply making correlations between words to predict what comes next in a sentence. In the paper, NVIDIA researchers propose techniques to address this limitation, which is necessary before such models can be deployed for real-world applications.
The researchers built the first automatic benchmark to measure the factual accuracy of language models for open-ended text generation, and found that bigger language models with billions of parameters were more factual than smaller ones. The team proposed a new technique, factuality-enhanced training, along with a novel sampling algorithm that together help train language models to generate accurate text - and demonstrated a reduction in the rate of factual errors from 33% to around 15%.
There are more th
North America Stories
18/11/2025
NBA Debuts New Comms Infrastructure and Systems for RefereesTwo-phase rollout is intended to improve game flow, enhance officiating accuracyBy Dan Daley, Audio ...
18/11/2025
Platinum White Paper: How Aggreko Delivers Certainty For Broadcasters On The Wor...
18/11/2025
Kiswe Extends DTC Products With Kiswe Core Cloud-Based Tool for Distributing Con...
18/11/2025
Spanish Basketball Federation partners with ScorePlay to power digital transform...
18/11/2025
HBS selects BBright encoders and decoders to connect ST 2110 live production wor...
18/11/2025
Ashes to Ashes: Inside TNT Sports' hybrid and maverick' production plan...
18/11/2025
SVG All-Stars: Mimi Fotopoulos, Director, Talent and Production Operations, Tenn...
18/11/2025
SVG LIVE! Conference Explores the Tech Side of Sports and Entertainment ConvergenceLive sports and music productions increasingly share gear and infrastructureB...
18/11/2025
LPGA Ups Its Production Game in 2026 With 50% More Cameras, SSMO's & Drones,...
18/11/2025
ESPN's Dunk the Halls Real-Time Animated NBA Game Set to Return for Christma...
18/11/2025
By Jessica Herndon
When Chilean filmmakers Diego C spedes and Giancarlo Nasi ar...
18/11/2025
CANBERRA, Australia, Nov. 18, 2025 - L3Harris Technologies (NYSE: LHX) has launched a new, next-generation software device called NETCASTER - the NETwork Contr...
18/11/2025
Pictured are L3Harris ISR President Jason Lambert and Waleid Al Mesmari, EDGE President-Space & Cyber Technologies, signing with Carlo Igniades, L3Harris Region...
18/11/2025
During October, streaming's share of TV viewing in Mexico settled at 23.7%, a marginal shift of -0.8 share points from the previous month.
Disclaimer: YUMI...
18/11/2025
Broadcast Builds On Lead Over Cable, Driven by Football and Drama Programming Ga...
18/11/2025
CINCINNATI E.W. Scripps has issued a statement responding to news that Sinclair has acquired approximately 8.2% of the outstanding class A (non-voting) shares o...
18/11/2025
TYSONS, Va. Tegna has announced that its shareholders have voted overwhelmingly to approve the proposed $6.2 billion merger with Nexstar Media....
18/11/2025
NEW YORK Short-form vertical video has exploded across platforms like TikTok, Instagram, and YouTube, but a new survey commissioned by Media.net, a provider of ...
18/11/2025
SHENZHEN, China DJI today unveiled the Osmo Action 6, an all-in-one action camera featuring a variable aperture with a range from f/2.0 to f/4.0, the company...
18/11/2025
In the high-stakes world of film sound, there's no room for second chances. When an actor whispers a line with the weight of an entire scene, or lets loose ...
18/11/2025
Sonnet Technologies is having a sale on a selection of popular products, including a Thunderbolt 4 dock, an eGPU chassis, SATA and M.2 SSD PCIe cards, and mor...
18/11/2025
Amagi, a cloud-based SaaS technology solutions provider for broadcast and streaming TV, today announced the launch of 4Fangs, a new Free Ad-Supported Streaming ...
18/11/2025
Women In Media (WiM) announces its honorees for the 2025 Holiday Toast. The annual celebration recognizes legendary creatives whose work uplifts and inspires th...
18/11/2025
Cinematographer Adam Newport-Berra ( Good Fortune , The Bear , Euphoria ) emerged from the 2025 Emmy season with a statuette celebrating his Outstanding Cinem...
18/11/2025
Atlanta-based gaffer and lighting programmer Quinton Thomas brings the same practical, problem-solving instinct of a board op to every set he walks onto. With a...
18/11/2025
Suitelife Systems, a division of NFB Consulting Group, headquartered in California, has announced the appointments of Charles Sotto as Director of Media Technol...
18/11/2025
Sound Director and Audio Product Manager Scott Kramer has built a career around shaping stories through audio, guiding projects across film, television and stre...
18/11/2025
iWedia, a global leader in software solutions for connected TV devices, announces its participation at the APAC TV Summit 2025, taking place November 18 20 in B...
18/11/2025
Mark Roberts Motion Control (MRMC) is proud to announce the launch of Flair Bridge, a groundbreaking solution that redefines how operators control MRMC's in...
18/11/2025
Over the summer, world famous musician and rapper Drake continued the rollout of his new album Iceman, by live streaming the second and third episodes of a seri...
18/11/2025
KEIZER, Ore. Municipal broadcaster Keizer City Television, K23 TV, has deployed four Telycam PTZ cameras to upgrade the visual quality of its live meeting cover...
18/11/2025
STAMFORD, Conn. NBC Sports and Peacock have announced that they are working for the second consecutive year with the National Football League, EA Sports and Gen...
18/11/2025
LONDON A new Ampere Analysis study finds that familiar franchises are successfully driving kids' TV consumption on Netflix and that the streamers big bet on...
18/11/2025
PORTSMOUTH, N.H. TV viewers continue to find it challenging to find relevant programs in the fragmented universe of streaming content, according to new findings...
18/11/2025
20 Bob Dylan Songs That Reflect a Legacy of A-Changin On the heels of Bob Dylan receiving a Berklee honorary doctorate, we take stock of one of the most singu...
18/11/2025
Today, Microsoft, NVIDIA and Anthropic announced new strategic partnerships. Anthropic is scaling its rapidly growing Claude AI model on Microsoft Azure, powere...
18/11/2025
AI agents have the potential to become indispensable tools for automating complex tasks. But bringing agents to production remains challenging.
According to Ga...
17/11/2025
EA SPORTS Madden NFL Cast to Return Thanksgiving Night With Immersive, Data-Driv...
17/11/2025
Behind the Broadcast Booth: Impact Ventures' Greer Christian on Building Her...
17/11/2025
SVG Sit-Down: Sportradar's Brian Josephs Talks Peacock's Powerful Perfor...
17/11/2025
2025 Sports Broadcasting Hall of Fame: Greg Gumbel, Iconic Voice and Comforting ...
17/11/2025
ESPN, Pixar, the NFL, and Beyond Sports Team Up for ESPN's Dec. 8 Monsters,...
17/11/2025
SVG Sit-Down: EA SPORTS' Evan Dexter on How the EA SPORTS Madden NFL Cast...
17/11/2025
Dealmaking for broadcast stations continues to heat up, with Sinclair reporting it has built up a 8.2% stake in E.W. Scripps and has been talking with Scripps f...
17/11/2025
The worlds online population grew by more than 240 million people in 2025, according to Facts and Figures 2025 released today by the International Telecommunica...
17/11/2025
ASB GlassFloor, the leading provider in LED sports flooring, today announced the opening of Athletes Lab 2.0, a premier athletic training facility designed to e...
17/11/2025
SmallHD today announced the release of its popular Camera Control for Canon cinema cameras. This new integration enables filmmakers to adjust critical camera se...
17/11/2025
Luxury is in the details, and no one captures that better than the creative team at Samuel & Sons. As the visionaries behind iconic photography and advertising ...
17/11/2025
By Brian Thoman, Chief Technology Officer, WideOrbit Artificial intelligence (AI) is reshaping how media organizations operate, automating routine tasks, optimi...
17/11/2025
NVIDIA Apollo - a family of open models for accelerating industrial and computat...