Sony Pixel Power calrec Sony

NVIDIA Wins NeurIPS Awards for Research on Generative AI, Generalist AI Agents

28/11/2022

Two NVIDIA Research papers - one exploring diffusion-based generative AI models and another on training generalist AI agents - have been honored with NeurIPS 2022 Awards for their contributions to the field of AI and machine learning.

These are among more than 60+ talks, posters and workshops with NVIDIA authors being presented at the NeurIPs conference, taking place this week in New Orleans and next week online.

Synthetic data generation - for images, text or video - is a key theme across several of the NVIDIA-authored papers. Other topics include reinforcement learning, data collection and augmentation, weather models and federated learning.

AI is an incredibly important technology, and NVIDIA is making fast progress across the gamut - from generative AI to autonomous AI agents, said Jan Kautz, vice president of learning and perception research at NVIDIA. In generative AI, we are not only advancing our theoretical understanding of the underlying models, but are also making practical contributions that will reduce the effort of creating realistic virtual worlds and simulations.

Reimagining the Design of Diffusion-Based Generative Models Diffusion-based models have emerged as a groundbreaking technique for generative AI. NVIDIA researchers won an Outstanding Main Track Paper award for work that analyzes the design of diffusion models, proposing improvements that can dramatically improve the efficiency and quality of these models.

The paper breaks down the components of a diffusion model into a modular design, helping developers identify processes that can be adjusted to improve the performance of the entire model. The researchers show that their modifications enable record scores on a metric that assesses the quality of AI-generated images.

Training Generalist AI Agents in a Minecraft-Based Simulation Suite While researchers have long trained autonomous AI agents in video-game environments such as Starcraft, Dota and Go, these agents are usually specialists in only a few tasks. So NVIDIA researchers turned to Minecraft, the world's most popular game, to develop a scalable training framework for a generalist agent - one that can successfully execute a wide variety of open-ended tasks.

Dubbed MineDojo, the framework enables an AI agent to learn Minecraft's flexible gameplay using a massive online database of more than 7,000 wiki pages, millions of Reddit threads and 300,000 hours of recorded gameplay (shown in image at top). The project won an Outstanding Datasets and Benchmarks Paper Award from the NeurIPS committee.

As a proof of concept, the researchers behind MineDojo created a large-scale foundation model, called MineCLIP, that learned to associate YouTube footage of Minecraft gameplay with the video's transcript, in which the player typically narrates the onscreen action. Using MineCLIP, the team was able to train a reinforcement learning agent capable of performing several tasks in Minecraft without human intervention.

Creating Complex 3D Shapes to Populate Virtual Worlds Also at NeurIPS is GET3D, a generative AI model that instantly synthesizes 3D shapes based on the category of 2D images it's trained on, such as buildings, cars or animals. The AI-generated objects have high-fidelity textures and complex geometric details - and are created in a triangle mesh format used in popular graphics software applications. This makes it easy for users to import the shapes into 3D renderers and game engines for further editing.

Named for its ability to Generate Explicit Textured 3D meshes, GET3D was trained on NVIDIA A100 Tensor Core GPUs using around 1 million 2D images of 3D shapes captured from different camera angles. The model can generate around 20 objects a second when running inference on a single NVIDIA GPU.

The AI-generated objects could be used to populate 3D representations of buildings, outdoor spaces or entire cities - digital spaces designed for industries such as gaming, robotics, architecture and social media.

Improving Inverse Rendering Pipelines With Control Over Materials, Lighting At the most recent CVPR conference, held in New Orleans in June, NVIDIA Research introduced 3D MoMa, an inverse rendering method that enables developers to create 3D objects composed of three distinct parts: a 3D mesh model, materials overlaid on the model, and lighting.

The team has since achieved significant advancements in untangling materials and lighting from the 3D objects - which in turn improves creators' abilities to edit the AI-generated shapes by swapping materials or adjusting lighting as the object moves around a scene.

The work, which relies on a more realistic shading model that leverages NVIDIA RTX GPU-accelerated ray tracing, is being presented as a poster at NeurIPS.

Enhancing Factual Accuracy of Language Models' Generated Text Another accepted paper at NeurIPS examines a key challenge with pretrained language models: the factual accuracy of AI-generated text.

Language models trained for open-ended text generation often come up with text that includes nonfactual information, since the AI is simply making correlations between words to predict what comes next in a sentence. In the paper, NVIDIA researchers propose techniques to address this limitation, which is necessary before such models can be deployed for real-world applications.

The researchers built the first automatic benchmark to measure the factual accuracy of language models for open-ended text generation, and found that bigger language models with billions of parameters were more factual than smaller ones. The team proposed a new technique, factuality-enhanced training, along with a novel sampling algorithm that together help train language models to generate accurate text - and demonstrated a reduction in the rate of factual errors from 33% to around 15%.

There are more th
LINK: https://blogs.nvidia.com/blog/2022/11/28/nvidia-neurips-research/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

21/04/2026

Nielsen data shows Australian outdoor and sport retailers are changing how they advertise to win over outdoor enthusiasts

Advertising strategies shift as competition grows for a large, active and qualit...

21/04/2026

ATSC Celebrates 3.0's Global Expansion

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Cinematic Feel Makes Survivor' Built to Last

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Live Event Technology Expands Fan Engagement

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

MS NOW Uses Community to Build Up Its Brand

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Why Broadcast Is Well-Positioned to Safeguard Freedom of Speech

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

AWS Demos AI Tools to Deliver Vertical Video

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Video Podcasting Leaps in Popularity

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Audio Systems Get Boost From Cloud and AI

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Layercake Deepens Bitmovin Integration to Power End-to-End Media Orchestration with Streamcake

Layercake Deepens Bitmovin Integration to Power End-to-End Media Orchestration w...

21/04/2026

Cobalt Digital Announces Launch of blueCORE at NAB Show 2026: Compact Powerhouse Signal Processors for SDI and ST 2110 Workflows

Cobalt Digital Announces Launch of blueCORE at NAB Show 2026: Compact Powerhouse...

21/04/2026

On-Premise AI Suite from Studio Network Solutions Debuts at NAB Show 2026

On-Premise AI Suite from Studio Network Solutions Debuts at NAB Show 2026 Melanie Ciotti April 21, 2026 0 Comments Unlimited processing, no cloud depe...

21/04/2026

IBC appoints Tim Banham as Chief Commercial Officer to dr...

London, 21 April 2026 IBC today announced the appointment of Tim Banham as its first Chief Commercial Officer (CCO), a newly created role that reflects the or...

21/04/2026

Motion Design Tools - April 2026

Motion Design Tools - April 2026 Roland Kahlenberg April 21, 2026 0 Comments Within 2 days, Maxon and Canva announced pro-level motion design apps - A...

21/04/2026

Chaos and Zero Density to Showcase Real-Time Ray Tracing for Virtual Studios and XR

Chaos and Zero Density to Showcase Real-Time Ray Tracing for Virtual Studios and...

21/04/2026

Diversified Appoints Tyler Affolter Chief Revenue Officer

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

TV Azteca to Bring Dolby Atmos to Free-To-Air TV in Mexico

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Maxon Announces Free Tools and Mobile Expansion of ZBrush...

Cinema 4D brings professional 3D workflows to iPad. The return of Autograph now free for individual users. ZBrush expands to Windows on Arm. See it all at NAB...

21/04/2026

Bitfocus improves availability, security and user managem...

Software version 1.6 extends enterprise functionality to place Buttons at the heart of media operations at any scale Bitfocus, the Norwegian software develope...

21/04/2026

Cobalt Digital Announces Launch of blueCORE at NAB Show 2...

Cobalt Digital Announces Launch of blueCORE at NAB Show 2026: Compact Powerhouse Signal Processors for SDI and ST 2110 Workflows Compact, multi-function stan...

21/04/2026

Applications open for 2026 AISF and Screen Australia Writer/Director Virtual Sessions

Applications open for 2026 AISF and Screen Australia Writer/Director Virtual Ses...

21/04/2026

Cultivating creativity: Super Garden is back for another season

Summer is nearly here and Super Garden is returning to our screens to spark some gardening inspiration. The new series kicks off on Thursday 23 April at 7pm on ...

20/04/2026

Live From NAB 2026: Sonys Hugo Gaggioni Highlights HDR Advances, Software-Defined Workflows

At the 2026 NAB Show, Sony is showcasing a broad slate of innovations across liv...

20/04/2026

Live From NAB 2026: Fujinons Stosh Durbacz on Expanding the 4K Broadcast Lens Lineup With New Portable Zooms, 94x Box Lens

Fujifilm is sharpening its focus on core broadcast production with a new wave of...

20/04/2026

Live From NAB 2026: Rock-It Sports' John Walberg on Powering Logistics, Shipping for the 2026 FIFA Men's World Cup

This upcoming summer in North America is going to be a busy one. The 2026 FIFA M...

20/04/2026

NAB 2026: Glookast outlines product updates including Media Producer UX, connectors and Premiere Pro panel

Glookast (Booth W1661) announced a series of product updates at NAB Show 2026, c...

20/04/2026

NAB 2026: Matrox Video and Amagi collaborate on cloud-based broadcast workflows using ORIGIN framework

Matrox Video and Amagi announced a collaboration to integrate the Matrox ORIGIN ...

20/04/2026

NAB 2026: Riedel SimplyLive supports expanded centralised VAR system for Argentina football league

Riedel Communications (Booth C4908) announced that the Asociaci n del F tbol Arg...

20/04/2026

NAB 2026: Ikegami introduces VFE-P07D OLED viewfinder with integrated LCD monitor

Ikegami (Booth C3819) announced the VFE-P07D monocular OLED viewfinder at NAB Sh...

20/04/2026

NAB 2026: IABM rebrands as IAMT and launches AI discovery platform and global alliance

International Association of MediaTech (IAMT), formerly known as IABM, announced...

20/04/2026

NAB 2026: Harmonic supports DIRECTV DTH platform upgrade with VOS Media Software

Harmonic (Booth W2831) announced that DIRECTV is updating its US direct-to-home (DTH) video platform using Harmonic's VOS Media Software. The deployment is...

20/04/2026

NAB 2026: Wasabi Technologies acquires Seagate Lyve Cloud business

Wasabi Technologies announced that it has acquired the Lyve Cloud business from Seagate Technology. As part of the agreement, Seagate received equity in Wasabi ...

20/04/2026

NAB 2026: EVS introduces Choreon robotics orchestration platform for unified production control

EVS (Booth N1841) has launched Choreon, a robotics controller for media producti...

20/04/2026

SportsTechBuzz at NAB 2026, Day 2: Live Reports From the Show Floor in Vegas

The NAB Show is in full swing, and the SVG and SVG Europe editorial teams are chasing down the hottest stories from all over the Las Vegas Convention Center. He...

20/04/2026

NAB 2026: Skyline Communications launches DataMiner packages on Grass Valley AMPP App Store

Skyline Communications announced the availability of its DataMiner xOps platform...

20/04/2026

NAB 2026: SNS launches Outpost, Trio and AI Suite for connected post-production workflows

Studio Network Solutions (Booth N1129) introduced a set of new products at NAB S...

20/04/2026

NAB 2026: Dell Technologies and NVIDIA present AI data platform for media workflows

Dell Technologies is showcasing its Dell AI Data Platform with NVIDIA at NAB Sho...

20/04/2026

NAB 2026: Blackmagic Design Announces Fairlight Live Software Audio Mixer

Blackmagic Design has announced Fairlight Live, a software-based live audio mixer with SMPTE 2110 support and spatial audio mixing. A public beta is available n...

20/04/2026

Live From NAB 2026: Imagine Comms Jimbo Haneklau Talks Prismon, Hybrid IP/SDI Workflows, and Cloud Playout

At the 2026 NAB Show in Las Vegas, Imagine Communications VP of Sales, Sports an...

20/04/2026

Live From NAB 2026: LiveUs Phillip Broaddus on LU900Q Launch, Nexus Cloud Platform, and REMI Growth

At the 2026 NAB Show in Las Vegas, LiveU Senior Director of Sales, Sports Philli...

20/04/2026

3 New Ways to Dive Deeper Into the Music You Love

A song that perfectly captures a moment is magic. But when you uncover the story behind it, who made it, what inspired it, and the meaning woven into the lyrics...

20/04/2026

Deity Microphones announce the PR-4

Ultra-compact 32-bit recorder set for launch Deity Microphones will soon be launching a new 32-bit six-track recorder that's been designed with producti...

20/04/2026

Lectrosonics preview the S1

Uncoming lightweight shotgun mic announced Production-sound experts Lectrosonics have recently announced the upcoming launch of a new lightweight shotgun mi...