Sony Pixel Power calrec Sony

NVIDIA Wins NeurIPS Awards for Research on Generative AI, Generalist AI Agents

28/11/2022

Two NVIDIA Research papers - one exploring diffusion-based generative AI models and another on training generalist AI agents - have been honored with NeurIPS 2022 Awards for their contributions to the field of AI and machine learning.

These are among more than 60+ talks, posters and workshops with NVIDIA authors being presented at the NeurIPs conference, taking place this week in New Orleans and next week online.

Synthetic data generation - for images, text or video - is a key theme across several of the NVIDIA-authored papers. Other topics include reinforcement learning, data collection and augmentation, weather models and federated learning.

AI is an incredibly important technology, and NVIDIA is making fast progress across the gamut - from generative AI to autonomous AI agents, said Jan Kautz, vice president of learning and perception research at NVIDIA. In generative AI, we are not only advancing our theoretical understanding of the underlying models, but are also making practical contributions that will reduce the effort of creating realistic virtual worlds and simulations.

Reimagining the Design of Diffusion-Based Generative Models Diffusion-based models have emerged as a groundbreaking technique for generative AI. NVIDIA researchers won an Outstanding Main Track Paper award for work that analyzes the design of diffusion models, proposing improvements that can dramatically improve the efficiency and quality of these models.

The paper breaks down the components of a diffusion model into a modular design, helping developers identify processes that can be adjusted to improve the performance of the entire model. The researchers show that their modifications enable record scores on a metric that assesses the quality of AI-generated images.

Training Generalist AI Agents in a Minecraft-Based Simulation Suite While researchers have long trained autonomous AI agents in video-game environments such as Starcraft, Dota and Go, these agents are usually specialists in only a few tasks. So NVIDIA researchers turned to Minecraft, the world's most popular game, to develop a scalable training framework for a generalist agent - one that can successfully execute a wide variety of open-ended tasks.

Dubbed MineDojo, the framework enables an AI agent to learn Minecraft's flexible gameplay using a massive online database of more than 7,000 wiki pages, millions of Reddit threads and 300,000 hours of recorded gameplay (shown in image at top). The project won an Outstanding Datasets and Benchmarks Paper Award from the NeurIPS committee.

As a proof of concept, the researchers behind MineDojo created a large-scale foundation model, called MineCLIP, that learned to associate YouTube footage of Minecraft gameplay with the video's transcript, in which the player typically narrates the onscreen action. Using MineCLIP, the team was able to train a reinforcement learning agent capable of performing several tasks in Minecraft without human intervention.

Creating Complex 3D Shapes to Populate Virtual Worlds Also at NeurIPS is GET3D, a generative AI model that instantly synthesizes 3D shapes based on the category of 2D images it's trained on, such as buildings, cars or animals. The AI-generated objects have high-fidelity textures and complex geometric details - and are created in a triangle mesh format used in popular graphics software applications. This makes it easy for users to import the shapes into 3D renderers and game engines for further editing.

Named for its ability to Generate Explicit Textured 3D meshes, GET3D was trained on NVIDIA A100 Tensor Core GPUs using around 1 million 2D images of 3D shapes captured from different camera angles. The model can generate around 20 objects a second when running inference on a single NVIDIA GPU.

The AI-generated objects could be used to populate 3D representations of buildings, outdoor spaces or entire cities - digital spaces designed for industries such as gaming, robotics, architecture and social media.

Improving Inverse Rendering Pipelines With Control Over Materials, Lighting At the most recent CVPR conference, held in New Orleans in June, NVIDIA Research introduced 3D MoMa, an inverse rendering method that enables developers to create 3D objects composed of three distinct parts: a 3D mesh model, materials overlaid on the model, and lighting.

The team has since achieved significant advancements in untangling materials and lighting from the 3D objects - which in turn improves creators' abilities to edit the AI-generated shapes by swapping materials or adjusting lighting as the object moves around a scene.

The work, which relies on a more realistic shading model that leverages NVIDIA RTX GPU-accelerated ray tracing, is being presented as a poster at NeurIPS.

Enhancing Factual Accuracy of Language Models' Generated Text Another accepted paper at NeurIPS examines a key challenge with pretrained language models: the factual accuracy of AI-generated text.

Language models trained for open-ended text generation often come up with text that includes nonfactual information, since the AI is simply making correlations between words to predict what comes next in a sentence. In the paper, NVIDIA researchers propose techniques to address this limitation, which is necessary before such models can be deployed for real-world applications.

The researchers built the first automatic benchmark to measure the factual accuracy of language models for open-ended text generation, and found that bigger language models with billions of parameters were more factual than smaller ones. The team proposed a new technique, factuality-enhanced training, along with a novel sampling algorithm that together help train language models to generate accurate text - and demonstrated a reduction in the rate of factual errors from 33% to around 15%.

There are more th
LINK: https://blogs.nvidia.com/blog/2022/11/28/nvidia-neurips-research/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

20/04/2026

Live From NAB 2026: Sonys Hugo Gaggioni Highlights HDR Advances, Software-Defined Workflows

At the 2026 NAB Show, Sony is showcasing a broad slate of innovations across liv...

20/04/2026

Live From NAB 2026: Fujinons Stosh Durbacz on Expanding the 4K Broadcast Lens Lineup With New Portable Zooms, 94x Box Lens

Fujifilm is sharpening its focus on core broadcast production with a new wave of...

20/04/2026

Live From NAB 2026: Rock-It Sports' John Walberg on Powering Logistics, Shipping for the 2026 FIFA Men's World Cup

This upcoming summer in North America is going to be a busy one. The 2026 FIFA M...

20/04/2026

NAB 2026: Glookast outlines product updates including Media Producer UX, connectors and Premiere Pro panel

Glookast (Booth W1661) announced a series of product updates at NAB Show 2026, c...

20/04/2026

NAB 2026: Matrox Video and Amagi collaborate on cloud-based broadcast workflows using ORIGIN framework

Matrox Video and Amagi announced a collaboration to integrate the Matrox ORIGIN ...

20/04/2026

NAB 2026: Riedel SimplyLive supports expanded centralised VAR system for Argentina football league

Riedel Communications (Booth C4908) announced that the Asociaci n del F tbol Arg...

20/04/2026

NAB 2026: Ikegami introduces VFE-P07D OLED viewfinder with integrated LCD monitor

Ikegami (Booth C3819) announced the VFE-P07D monocular OLED viewfinder at NAB Sh...

20/04/2026

NAB 2026: IABM rebrands as IAMT and launches AI discovery platform and global alliance

International Association of MediaTech (IAMT), formerly known as IABM, announced...

20/04/2026

NAB 2026: Harmonic supports DIRECTV DTH platform upgrade with VOS Media Software

Harmonic (Booth W2831) announced that DIRECTV is updating its US direct-to-home (DTH) video platform using Harmonic's VOS Media Software. The deployment is...

20/04/2026

NAB 2026: Wasabi Technologies acquires Seagate Lyve Cloud business

Wasabi Technologies announced that it has acquired the Lyve Cloud business from Seagate Technology. As part of the agreement, Seagate received equity in Wasabi ...

20/04/2026

NAB 2026: EVS introduces Choreon robotics orchestration platform for unified production control

EVS (Booth N1841) has launched Choreon, a robotics controller for media producti...

20/04/2026

SportsTechBuzz at NAB 2026, Day 2: Live Reports From the Show Floor in Vegas

The NAB Show is in full swing, and the SVG and SVG Europe editorial teams are chasing down the hottest stories from all over the Las Vegas Convention Center. He...

20/04/2026

NAB 2026: Skyline Communications launches DataMiner packages on Grass Valley AMPP App Store

Skyline Communications announced the availability of its DataMiner xOps platform...

20/04/2026

NAB 2026: SNS launches Outpost, Trio and AI Suite for connected post-production workflows

Studio Network Solutions (Booth N1129) introduced a set of new products at NAB S...

20/04/2026

NAB 2026: Dell Technologies and NVIDIA present AI data platform for media workflows

Dell Technologies is showcasing its Dell AI Data Platform with NVIDIA at NAB Sho...

20/04/2026

NAB 2026: Blackmagic Design Announces Fairlight Live Software Audio Mixer

Blackmagic Design has announced Fairlight Live, a software-based live audio mixer with SMPTE 2110 support and spatial audio mixing. A public beta is available n...

20/04/2026

Live From NAB 2026: Imagine Comms Jimbo Haneklau Talks Prismon, Hybrid IP/SDI Workflows, and Cloud Playout

At the 2026 NAB Show in Las Vegas, Imagine Communications VP of Sales, Sports an...

20/04/2026

Live From NAB 2026: LiveUs Phillip Broaddus on LU900Q Launch, Nexus Cloud Platform, and REMI Growth

At the 2026 NAB Show in Las Vegas, LiveU Senior Director of Sales, Sports Philli...

20/04/2026

3 New Ways to Dive Deeper Into the Music You Love

A song that perfectly captures a moment is magic. But when you uncover the story behind it, who made it, what inspired it, and the meaning woven into the lyrics...

20/04/2026

Deity Microphones announce the PR-4

Ultra-compact 32-bit recorder set for launch Deity Microphones will soon be launching a new 32-bit six-track recorder that's been designed with producti...

20/04/2026

Lectrosonics preview the S1

Uncoming lightweight shotgun mic announced Production-sound experts Lectrosonics have recently announced the upcoming launch of a new lightweight shotgun mi...

20/04/2026

The story of Focusrite ISA

New 20-minute documentary explores iconic preamp In 2025, Focusrite commissioned a new short-form documentary with filmmaker Chris Mayes-Wright - the direct...

20/04/2026

Sampleson release Boomcha

Turn quick sketches into real drum grooves Sampleson have been experimenting with assitive production tools recently, and their latest creation aims to make...

20/04/2026

Rohde & Schwarz rolls out its full ARDRONIS counter UAS suite in a demonstration van at Counter UAS Technology Europe 2026

Rohde & Schwarz rolls out its full ARDRONIS counter UAS suite in a demonstration...

20/04/2026

Protecting America's Shores: L3Harris Keeps the Coast Guard Mission-Ready

L3Harris delivers integrated communications, navigation and C4ISR capabilities that empower the U.S. Coast Guard to protect Americas maritime interests and resp...

20/04/2026

Google Cloud Embraces the Rise of Agentic Production

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

Creators Go All in on AI, Niche Content

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

NBC Sports' Jon Miller: Broadcast Is Having a Moment'

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

Beyond the Lift and Shift': Cloud Migration's New Mandate

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

Virtual Production Finds Its Footing

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

Corporate Creators: All Companies Are Media Companies Now

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

IABM Rebrands as the International Association of MediaTech

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

CBS Detroit Debuts New AR/VR Technology-Driven Studio

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

Fox Sports Taps Appear X Platform for Remote Production

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

CueScript and Lighting Design Group Expand Customer Oppor...

CueScript and Lighting Design Group Expand Customer Opportunities Through New Partnership Find both companies at 2026 NAB Show in CueScript Booth # C 4720 ...

20/04/2026

Layercake Deepens Bitmovin Integration to Power End-to-En...

[Sydney, NSW, 20 April 2026] - Layercake, the company behind the intelligent media orchestration platform Streamcake, today announced the formalisation of its i...

20/04/2026

FOX Sports selects Appear X Platform for next-generation...

Deployment spans FOX Sports' REMI infrastructure, IP production for a major global soccer event, and its Jewel Events production systems Appear, a global l...

20/04/2026

Pro Sound Effects Launches the Industry's First and Only Native Sound Effects Integration for Avid Media Composer at NAB 2026

Pro Sound Effects Launches the Industry's First and Only Native Sound Effect...

20/04/2026

SBE Elevates Fred Willard to SBE Fellow

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

Blackmagic Design Announces Blackmagic Camera for iOS 3.3 Update

Blackmagic Design Announces Blackmagic Camera for iOS 3.3 Update Brie Clayton April 20, 2026 0 Comments New update adds camera control and monitoring ...

20/04/2026

Maxon Announces Free Tools and Mobile Expansion of ZBrush and Cinema 4D

Maxon Announces Free Tools and Mobile Expansion of ZBrush and Cinema 4D Brie Clayton April 20, 2026 0 Comments Cinema 4D brings professional 3D workfl...

20/04/2026

Vizrt AI Keyer kills the green screen and creates virtual scenes in any environment

Vizrt AI Keyer kills the green screen and creates virtual scenes in any environm...

20/04/2026

Register now - Market & Audience Department Ask Me Anything (AMA) Session

Register now - Market & Audience Department Ask Me Anything (AMA) Session 11 February 2026 Screen Australia Head of Market & Audience Rakel Tansley Talking to...