Sony Pixel Power calrec Sony

NVIDIA Wins NeurIPS Awards for Research on Generative AI, Generalist AI Agents

28/11/2022

Two NVIDIA Research papers - one exploring diffusion-based generative AI models and another on training generalist AI agents - have been honored with NeurIPS 2022 Awards for their contributions to the field of AI and machine learning.

These are among more than 60+ talks, posters and workshops with NVIDIA authors being presented at the NeurIPs conference, taking place this week in New Orleans and next week online.

Synthetic data generation - for images, text or video - is a key theme across several of the NVIDIA-authored papers. Other topics include reinforcement learning, data collection and augmentation, weather models and federated learning.

AI is an incredibly important technology, and NVIDIA is making fast progress across the gamut - from generative AI to autonomous AI agents, said Jan Kautz, vice president of learning and perception research at NVIDIA. In generative AI, we are not only advancing our theoretical understanding of the underlying models, but are also making practical contributions that will reduce the effort of creating realistic virtual worlds and simulations.

Reimagining the Design of Diffusion-Based Generative Models Diffusion-based models have emerged as a groundbreaking technique for generative AI. NVIDIA researchers won an Outstanding Main Track Paper award for work that analyzes the design of diffusion models, proposing improvements that can dramatically improve the efficiency and quality of these models.

The paper breaks down the components of a diffusion model into a modular design, helping developers identify processes that can be adjusted to improve the performance of the entire model. The researchers show that their modifications enable record scores on a metric that assesses the quality of AI-generated images.

Training Generalist AI Agents in a Minecraft-Based Simulation Suite While researchers have long trained autonomous AI agents in video-game environments such as Starcraft, Dota and Go, these agents are usually specialists in only a few tasks. So NVIDIA researchers turned to Minecraft, the world's most popular game, to develop a scalable training framework for a generalist agent - one that can successfully execute a wide variety of open-ended tasks.

Dubbed MineDojo, the framework enables an AI agent to learn Minecraft's flexible gameplay using a massive online database of more than 7,000 wiki pages, millions of Reddit threads and 300,000 hours of recorded gameplay (shown in image at top). The project won an Outstanding Datasets and Benchmarks Paper Award from the NeurIPS committee.

As a proof of concept, the researchers behind MineDojo created a large-scale foundation model, called MineCLIP, that learned to associate YouTube footage of Minecraft gameplay with the video's transcript, in which the player typically narrates the onscreen action. Using MineCLIP, the team was able to train a reinforcement learning agent capable of performing several tasks in Minecraft without human intervention.

Creating Complex 3D Shapes to Populate Virtual Worlds Also at NeurIPS is GET3D, a generative AI model that instantly synthesizes 3D shapes based on the category of 2D images it's trained on, such as buildings, cars or animals. The AI-generated objects have high-fidelity textures and complex geometric details - and are created in a triangle mesh format used in popular graphics software applications. This makes it easy for users to import the shapes into 3D renderers and game engines for further editing.

Named for its ability to Generate Explicit Textured 3D meshes, GET3D was trained on NVIDIA A100 Tensor Core GPUs using around 1 million 2D images of 3D shapes captured from different camera angles. The model can generate around 20 objects a second when running inference on a single NVIDIA GPU.

The AI-generated objects could be used to populate 3D representations of buildings, outdoor spaces or entire cities - digital spaces designed for industries such as gaming, robotics, architecture and social media.

Improving Inverse Rendering Pipelines With Control Over Materials, Lighting At the most recent CVPR conference, held in New Orleans in June, NVIDIA Research introduced 3D MoMa, an inverse rendering method that enables developers to create 3D objects composed of three distinct parts: a 3D mesh model, materials overlaid on the model, and lighting.

The team has since achieved significant advancements in untangling materials and lighting from the 3D objects - which in turn improves creators' abilities to edit the AI-generated shapes by swapping materials or adjusting lighting as the object moves around a scene.

The work, which relies on a more realistic shading model that leverages NVIDIA RTX GPU-accelerated ray tracing, is being presented as a poster at NeurIPS.

Enhancing Factual Accuracy of Language Models' Generated Text Another accepted paper at NeurIPS examines a key challenge with pretrained language models: the factual accuracy of AI-generated text.

Language models trained for open-ended text generation often come up with text that includes nonfactual information, since the AI is simply making correlations between words to predict what comes next in a sentence. In the paper, NVIDIA researchers propose techniques to address this limitation, which is necessary before such models can be deployed for real-world applications.

The researchers built the first automatic benchmark to measure the factual accuracy of language models for open-ended text generation, and found that bigger language models with billions of parameters were more factual than smaller ones. The team proposed a new technique, factuality-enhanced training, along with a novel sampling algorithm that together help train language models to generate accurate text - and demonstrated a reduction in the rate of factual errors from 33% to around 15%.

There are more th
LINK: https://blogs.nvidia.com/blog/2022/11/28/nvidia-neurips-research/...
See more stories from nvidia

North America Stories

18/11/2025

NBA Debuts New Comms Infrastructure and Systems for Referees

NBA Debuts New Comms Infrastructure and Systems for RefereesTwo-phase rollout is intended to improve game flow, enhance officiating accuracyBy Dan Daley, Audio ...

18/11/2025

Platinum White Paper: How Aggreko Delivers Certainty for Broadcasters on the World Stage

Platinum White Paper: How Aggreko Delivers Certainty For Broadcasters On The Wor...

18/11/2025

Kiswe Extends DTC Products With Kiswe Core Cloud-Based Tool for Distributing Content to Any Platform

Kiswe Extends DTC Products With Kiswe Core Cloud-Based Tool for Distributing Con...

18/11/2025

Spanish Basketball Federation Partners with ScorePlay to Power Digital Transformation

Spanish Basketball Federation partners with ScorePlay to power digital transform...

18/11/2025

HBS Selects BBright Encoders and Decoders to Connect ST 2110 Live Production Workflows to the Cloud via SRT

HBS selects BBright encoders and decoders to connect ST 2110 live production wor...

18/11/2025

Ashes to Ashes: Inside TNT Sports' Hybrid and Maverick' Production Plan for Coverage of Cricket's Oldest Rivalry

Ashes to Ashes: Inside TNT Sports' hybrid and maverick' production plan...

18/11/2025

SVG All-Stars: Mimi Fotopoulos, Director, Talent and Production Operations, Tennis Channel

SVG All-Stars: Mimi Fotopoulos, Director, Talent and Production Operations, Tenn...

18/11/2025

Sports and Entertainment Convergence: Productions Share Tech, Infrastructure

SVG LIVE! Conference Explores the Tech Side of Sports and Entertainment ConvergenceLive sports and music productions increasingly share gear and infrastructureB...

18/11/2025

LPGA Ups Its Production Game in 2026 With 50% More Cameras, SSMO's & Drones, and Additional Mics

LPGA Ups Its Production Game in 2026 With 50% More Cameras, SSMO's & Drones,...

18/11/2025

ESPN's Dunk the Halls Real-Time Animated NBA Game Set to Return for Christmas Day

ESPN's Dunk the Halls Real-Time Animated NBA Game Set to Return for Christma...

18/11/2025

L3Harris Unveils Sovereign Software Router for Australia

CANBERRA, Australia, Nov. 18, 2025 - L3Harris Technologies (NYSE: LHX) has launched a new, next-generation software device called NETCASTER - the NETwork Contr...

18/11/2025

L3Harris and EDGE Group to Collaborate on Defense Technology Programs in UAE

Pictured are L3Harris ISR President Jason Lambert and Waleid Al Mesmari, EDGE President-Space & Cyber Technologies, signing with Carlo Igniades, L3Harris Region...

18/11/2025

The Gauge: Mexico October 2025

During October, streaming's share of TV viewing in Mexico settled at 23.7%, a marginal shift of -0.8 share points from the previous month. Disclaimer: YUMI...

18/11/2025

Nielsen's The Gauge: NFL Viewership Underscores How Sports Are Redefining Audience Behavior

Broadcast Builds On Lead Over Cable, Driven by Football and Drama Programming Ga...

18/11/2025

Scripps Sends Mixed Messages in Response to Possible Sinclair Deal

CINCINNATI E.W. Scripps has issued a statement responding to news that Sinclair has acquired approximately 8.2% of the outstanding class A (non-voting) shares o...

18/11/2025

Tegna Shareholders Approve Nexstar Merger

TYSONS, Va. Tegna has announced that its shareholders have voted overwhelmingly to approve the proposed $6.2 billion merger with Nexstar Media....

18/11/2025

Popularity of Online Short Form Content Moving Beyond Social Media

NEW YORK Short-form vertical video has exploded across platforms like TikTok, Instagram, and YouTube, but a new survey commissioned by Media.net, a provider of ...

18/11/2025

DJI Introduces Osmo Action 6 Camera With Variable Aperture

SHENZHEN, China DJI today unveiled the Osmo Action 6, an all-in-one action camera featuring a variable aperture with a range from f/2.0 to f/4.0, the company...

18/11/2025

Veteran Production Sound Mixer Tony Johnson Captures the...

In the high-stakes world of film sound, there's no room for second chances. When an actor whispers a line with the weight of an entire scene, or lets loose ...

18/11/2025

Sonnet Announces Black Friday Sale on Select Products

Sonnet Technologies is having a sale on a selection of popular products, including a Thunderbolt 4 dock, an eGPU chassis, SATA and M.2 SSD PCIe cards, and mor...

18/11/2025

4Fangs Launches Horror and Suspense FAST Channel on Amagi...

Amagi, a cloud-based SaaS technology solutions provider for broadcast and streaming TV, today announced the launch of 4Fangs, a new Free Ad-Supported Streaming ...

18/11/2025

WIM 8th Annual Holiday Toast to Honor Mandy Walker ASC AM...

Women In Media (WiM) announces its honorees for the 2025 Holiday Toast. The annual celebration recognizes legendary creatives whose work uplifts and inspires th...

18/11/2025

Unforgettable Cinematography of Adam Newport-Berra with Z...

Cinematographer Adam Newport-Berra ( Good Fortune , The Bear , Euphoria ) emerged from the 2025 Emmy season with a statuette celebrating his Outstanding Cinem...

18/11/2025

Thomas Secures - Friends and Lovers - Moving Lights Usin...

Atlanta-based gaffer and lighting programmer Quinton Thomas brings the same practical, problem-solving instinct of a board op to every set he walks onto. With a...

18/11/2025

Suitelife Systems Appoints Veterans Charles Sotto and Reb...

Suitelife Systems, a division of NFB Consulting Group, headquartered in California, has announced the appointments of Charles Sotto as Director of Media Technol...

18/11/2025

Scott Kramer Turns to NUGEN Audio to Bridge Creativity an...

Sound Director and Audio Product Manager Scott Kramer has built a career around shaping stories through audio, guiding projects across film, television and stre...

18/11/2025

iWedia Strengthens Long-Term Operator Partnerships and Cu...

iWedia, a global leader in software solutions for connected TV devices, announces its participation at the APAC TV Summit 2025, taking place November 18 20 in B...

18/11/2025

MRMC Introduces Flair Bridge - Simplifying Robot Control...

Mark Roberts Motion Control (MRMC) is proud to announce the launch of Flair Bridge, a groundbreaking solution that redefines how operators control MRMC's in...

18/11/2025

LiveU Delivers a Solid Connection to Groovy Gecko for Dra...

Over the summer, world famous musician and rapper Drake continued the rollout of his new album Iceman, by live streaming the second and third episodes of a seri...

18/11/2025

Oregon Municipal Channel Upgrades Visuals With Telycam PTZs

KEIZER, Ore. Municipal broadcaster Keizer City Television, K23 TV, has deployed four Telycam PTZ cameras to upgrade the visual quality of its live meeting cover...

18/11/2025

Peacock to Stream EA Sports Madden NFL Cast on Thanksgiving

STAMFORD, Conn. NBC Sports and Peacock have announced that they are working for the second consecutive year with the National Football League, EA Sports and Gen...

18/11/2025

Research: Netflix Boosts Viewing With Familiar Kids Franchises

LONDON A new Ampere Analysis study finds that familiar franchises are successfully driving kids' TV consumption on Netflix and that the streamers big bet on...

18/11/2025

Content Discovery Still a Challenge for Streamers: Hub Study

PORTSMOUTH, N.H. TV viewers continue to find it challenging to find relevant programs in the fragmented universe of streaming content, according to new findings...

18/11/2025

20 Bob Dylan Songs That Reflect a Legacy of A-Changin

20 Bob Dylan Songs That Reflect a Legacy of A-Changin On the heels of Bob Dylan receiving a Berklee honorary doctorate, we take stock of one of the most singu...

18/11/2025

Microsoft, NVIDIA and Anthropic Announce Strategic Partnerships

Today, Microsoft, NVIDIA and Anthropic announced new strategic partnerships. Anthropic is scaling its rapidly growing Claude AI model on Microsoft Azure, powere...

18/11/2025

Delivering AI-Ready Enterprise Data With GPU-Accelerated AI Storage

AI agents have the potential to become indispensable tools for automating complex tasks. But bringing agents to production remains challenging. According to Ga...

17/11/2025

EA SPORTS Madden NFL Cast to Return Thanksgiving Night With Immersive, Data-Driven Broadcast on Peacock

EA SPORTS Madden NFL Cast to Return Thanksgiving Night With Immersive, Data-Driv...

17/11/2025

Behind the Broadcast Booth: Impact Ventures' Greer Christian on Building Her Career Around Passion

Behind the Broadcast Booth: Impact Ventures' Greer Christian on Building Her...

17/11/2025

SVG Sit-Down: Sportradar's Brian Josephs Talks Peacock's Powerful Performance View - and Just Telling Better Sports Stories

SVG Sit-Down: Sportradar's Brian Josephs Talks Peacock's Powerful Perfor...

17/11/2025

2025 Sports Broadcasting Hall of Fame: Greg Gumbel, Iconic Voice and Comforting Presence

2025 Sports Broadcasting Hall of Fame: Greg Gumbel, Iconic Voice and Comforting ...

17/11/2025

ESPN, Pixar, the NFL, and Beyond Sports Team Up for ESPN's Dec. 8 Monsters, Inc.' MNF' Altcast Effort

ESPN, Pixar, the NFL, and Beyond Sports Team Up for ESPN's Dec. 8 Monsters,...

17/11/2025

SVG Sit-Down: EA SPORTS' Evan Dexter on How the EA SPORTS Madden NFL Cast' Blends Gaming and Live NFL Production

SVG Sit-Down: EA SPORTS' Evan Dexter on How the EA SPORTS Madden NFL Cast&#...

17/11/2025

Sinclair Acquires 8% Stake in E.W. Scripps

Dealmaking for broadcast stations continues to heat up, with Sinclair reporting it has built up a 8.2% stake in E.W. Scripps and has been talking with Scripps f...

17/11/2025

ITU: 6 Billion Now Online Globally

The worlds online population grew by more than 240 million people in 2025, according to Facts and Figures 2025 released today by the International Telecommunica...

17/11/2025

ASB GlassFloor Launches an Elite Training Facility in Par...

ASB GlassFloor, the leading provider in LED sports flooring, today announced the opening of Athletes Lab 2.0, a premier athletic training facility designed to e...

17/11/2025

SmallHD Unlocks Camera Control for Canon

SmallHD today announced the release of its popular Camera Control for Canon cinema cameras. This new integration enables filmmakers to adjust critical camera se...

17/11/2025

Matthews Helps Samuel and Sons Bring Luxury Trim to Life...

Luxury is in the details, and no one captures that better than the creative team at Samuel & Sons. As the visionaries behind iconic photography and advertising ...

17/11/2025

Supercharge Your Workforce with WideOrbit AI Agents

By Brian Thoman, Chief Technology Officer, WideOrbit Artificial intelligence (AI) is reshaping how media organizations operate, automating routine tasks, optimi...

17/11/2025

One Giant Leap for AI Physics: NVIDIA Apollo Unveiled as Open Model Family for Scientific Simulation

NVIDIA Apollo - a family of open models for accelerating industrial and computat...