Sony Pixel Power calrec Sony

NVIDIA Wins NeurIPS Awards for Research on Generative AI, Generalist AI Agents

28/11/2022

Two NVIDIA Research papers - one exploring diffusion-based generative AI models and another on training generalist AI agents - have been honored with NeurIPS 2022 Awards for their contributions to the field of AI and machine learning.

These are among more than 60+ talks, posters and workshops with NVIDIA authors being presented at the NeurIPs conference, taking place this week in New Orleans and next week online.

Synthetic data generation - for images, text or video - is a key theme across several of the NVIDIA-authored papers. Other topics include reinforcement learning, data collection and augmentation, weather models and federated learning.

AI is an incredibly important technology, and NVIDIA is making fast progress across the gamut - from generative AI to autonomous AI agents, said Jan Kautz, vice president of learning and perception research at NVIDIA. In generative AI, we are not only advancing our theoretical understanding of the underlying models, but are also making practical contributions that will reduce the effort of creating realistic virtual worlds and simulations.

Reimagining the Design of Diffusion-Based Generative Models Diffusion-based models have emerged as a groundbreaking technique for generative AI. NVIDIA researchers won an Outstanding Main Track Paper award for work that analyzes the design of diffusion models, proposing improvements that can dramatically improve the efficiency and quality of these models.

The paper breaks down the components of a diffusion model into a modular design, helping developers identify processes that can be adjusted to improve the performance of the entire model. The researchers show that their modifications enable record scores on a metric that assesses the quality of AI-generated images.

Training Generalist AI Agents in a Minecraft-Based Simulation Suite While researchers have long trained autonomous AI agents in video-game environments such as Starcraft, Dota and Go, these agents are usually specialists in only a few tasks. So NVIDIA researchers turned to Minecraft, the world's most popular game, to develop a scalable training framework for a generalist agent - one that can successfully execute a wide variety of open-ended tasks.

Dubbed MineDojo, the framework enables an AI agent to learn Minecraft's flexible gameplay using a massive online database of more than 7,000 wiki pages, millions of Reddit threads and 300,000 hours of recorded gameplay (shown in image at top). The project won an Outstanding Datasets and Benchmarks Paper Award from the NeurIPS committee.

As a proof of concept, the researchers behind MineDojo created a large-scale foundation model, called MineCLIP, that learned to associate YouTube footage of Minecraft gameplay with the video's transcript, in which the player typically narrates the onscreen action. Using MineCLIP, the team was able to train a reinforcement learning agent capable of performing several tasks in Minecraft without human intervention.

Creating Complex 3D Shapes to Populate Virtual Worlds Also at NeurIPS is GET3D, a generative AI model that instantly synthesizes 3D shapes based on the category of 2D images it's trained on, such as buildings, cars or animals. The AI-generated objects have high-fidelity textures and complex geometric details - and are created in a triangle mesh format used in popular graphics software applications. This makes it easy for users to import the shapes into 3D renderers and game engines for further editing.

Named for its ability to Generate Explicit Textured 3D meshes, GET3D was trained on NVIDIA A100 Tensor Core GPUs using around 1 million 2D images of 3D shapes captured from different camera angles. The model can generate around 20 objects a second when running inference on a single NVIDIA GPU.

The AI-generated objects could be used to populate 3D representations of buildings, outdoor spaces or entire cities - digital spaces designed for industries such as gaming, robotics, architecture and social media.

Improving Inverse Rendering Pipelines With Control Over Materials, Lighting At the most recent CVPR conference, held in New Orleans in June, NVIDIA Research introduced 3D MoMa, an inverse rendering method that enables developers to create 3D objects composed of three distinct parts: a 3D mesh model, materials overlaid on the model, and lighting.

The team has since achieved significant advancements in untangling materials and lighting from the 3D objects - which in turn improves creators' abilities to edit the AI-generated shapes by swapping materials or adjusting lighting as the object moves around a scene.

The work, which relies on a more realistic shading model that leverages NVIDIA RTX GPU-accelerated ray tracing, is being presented as a poster at NeurIPS.

Enhancing Factual Accuracy of Language Models' Generated Text Another accepted paper at NeurIPS examines a key challenge with pretrained language models: the factual accuracy of AI-generated text.

Language models trained for open-ended text generation often come up with text that includes nonfactual information, since the AI is simply making correlations between words to predict what comes next in a sentence. In the paper, NVIDIA researchers propose techniques to address this limitation, which is necessary before such models can be deployed for real-world applications.

The researchers built the first automatic benchmark to measure the factual accuracy of language models for open-ended text generation, and found that bigger language models with billions of parameters were more factual than smaller ones. The team proposed a new technique, factuality-enhanced training, along with a novel sampling algorithm that together help train language models to generate accurate text - and demonstrated a reduction in the rate of factual errors from 33% to around 15%.

There are more th
LINK: https://blogs.nvidia.com/blog/2022/11/28/nvidia-neurips-research/...
See more stories from nvidia

North America Stories

05/12/2025

Netflix to Acquire Warner Bros. in Deal Worth $82.7 Billon

LOS ANGELES Netflix announced it has entered into an agreement to acquire the assets of Warner Bros. for $82.7 billion....

05/12/2025

Gracenote Launches New CTV Ad Platform

NEW YORK Nielsens Gracenote has launched Gracenote Content Connect, a new ad platform that provides agencies, brands, supply-side platforms (SSPs) and demand-si...

05/12/2025

IAB Tech Lab Releases Deals API

NEW YORK In an most important update to the workings of deal-based programmatic advertising, IAB Tech Lab has released version 1.0 of its Deals API for public c...

05/12/2025

Nielsen: NFL Thanksgiving Games Score Big Audiences

NEW YORK Pass the turkey. Pass the stuffing. Pass the cranberry sauce. All are common requests of Americans celebrating Thanksgiving Day with family and f...

05/12/2025

Iris Cloud-Connected Camera Control Platform Now Available

NEW YORK Iris, the new cloud-connected camera control platform, has officially launched with features that turn virtually any PTZ camera into a software-connect...

05/12/2025

Netflix to Acquire Warner Bros. in Deal Worth $82.7B

HOLLYWOOD, Calif. Netflix announced today that it has entered into an agreement to acquire the assets of Warner Bros. for $82.7 billion....

05/12/2025

Iris Cloud-Connected Camera Control Platform Is Now Available

NEW YORK Iris, the new cloud-connected camera control platform, has officially launched with features that turn virtually any PTZ camera into a software-connect...

05/12/2025

FCC Approves AT&T's $1 Billion Acquisition of UScellular Spectrum

WASHINGTON The Federal Communications Commission has approved AT&T's $1.02 billion acquisition of spectrum from UScellular in a decision that was issued sho...

05/12/2025

The Best Coldplay Songs: 21 Tracks That Shoot for the Stars

The Best Coldplay Songs: 21 Tracks That Shoot for the Stars From Yellow to Viva La Vida, Fix You to Paradise, this playlist goes back to the start. December ...

05/12/2025

Zafris Lecture Series Brings Nabil Ayers to Berklee

Zafris Lecture Series Brings Nabil Ayers to Berklee The 32nd annual James G. Zafris Distinguished Lecture series was held on Thursday, November 13 with guest ...

04/12/2025

SVG Sit-Down: ProximaVision's Claudio Lisman on Why Tethered Drones Could Be a Game-Changer for Live Sports Production

SVG Sit-Down: ProximaVision's Claudio Lisman on Why Tethered Drones Could Be...

04/12/2025

SVG Campus Shot Callers: Imry Halevi, Senior Associate Director of Athletics, Content & Strategic Communications, Harvard University

SVG Campus Shot Callers: Imry Halevi, Senior Associate Director of Athletics, Co...

04/12/2025

Platinum White Paper: LiveU Lightweight Sports Production: A Step Change in Sports Storytelling

Platinum White Paper: LiveU Lightweight Sports Production: A Step Change in Spor...

04/12/2025

London to Riyadh: DAZN Brings the Boxing Glamour to New Production Levels for Benavidez v Yarde in Saudi Arabia

London to Riyadh: DAZN brings the boxing glamour to new production levels for Be...

04/12/2025

Analysis: Paramount Bets on the Battering Ram' with Champions League Play

Analysis: Paramount bets on the battering ram' with Champions League play By Callum McCarthy, Editor-at-Large Tuesday, December 2, 2025 - 10:12 Print ...

04/12/2025

Space City Home Network Launches SCHN+ DTC App for Astros and Rockets

Space City Home Network Launches SCHN DTC App for Astros and RocketsThe Rockets and Astros were previously the lone NBA and MLB teams without a DTC appBy Jason...

04/12/2025

SVG Summit 2025 Preview: Content Workflows Workshop Spotlights Evolution of Sports Media Supply Chain

SVG Summit 2025 Preview: Content Workflows Workshop Spotlights Evolution of Spor...

04/12/2025

New Sponsor Spotlight: Geotech's Patrick Wambold On the Unreal Engine Revolution Taking Place in Sports Broadcasting

New Sponsor Spotlight: Geotech's Patrick Wambold On the Unreal Engine Revolu...

04/12/2025

Curt Gowdy Jr. - Master Storyteller, Nationally and Regionally

Curt Gowdy Jr. - Master Storyteller, Nationally and RegionallyBy Jason Dachman, Editorial Director, U.S. Thursday, December 4, 2025 - 1:52 pm Print This Sto...

04/12/2025

Cutting Through Rocks ( ) Shows the Difference That One Person Can Make for Change

(L-R) Rebecca Lichtenfeld, Mohammadreza Eyni, Sara Khaki, and Judith Helfand att...

04/12/2025

L3Harris Supports NOAA's Million Mile Journey to Safeguard Earth from Solar Storms

Coronal mass ejections caused by eruptions on the surface of the sun can have fa...

04/12/2025

Gracenote launches new CTV ad platform making program-level targeting a reality

Gracenote Content Connect enables media ecosystem to precisely align ad campaigns and programming based on rich content signals NEW YORK - December 4, 2025 - N...

04/12/2025

Lightware in 2025 - Celebrating a successful year of inno...

Lightware, a global specialist in AV connectivity, is looking back on a year defined by new advancements, strong collaboration and continued growth. Across the ...

04/12/2025

Riedel and Haivision Join Forces to Advance Wireless Vide...

Riedel Communications today announced a new partnership with Haivision, a leading global provider of mission-critical, real-time video networking and visual col...

04/12/2025

Harmonic and Normann Engineering Achieve Major Milestone...

Harmonic (NASDAQ: HLIT) and Normann Engineering today announced a major milestone in their strategic collaboration, celebrating 20 successful broadband deployme...

04/12/2025

Foundry introduces Multi-Paint support for Mari 7-5 devel...

Creative software developer Foundry today announced Mari 7.5, the latest iteration of its artist-friendly paint toolset that can handle large, detailed assets w...

04/12/2025

Professional Wireless Systems PWS Manages Over 1000 Wirel...

Professional Wireless Systems (PWS), a leading provider of wireless audio solutions and RF management, was on site at Dreamforce 2025 in San Francisco providing...

04/12/2025

Lionsgate and Debmar-Mercury partner with LTN to power di...

LTN's purpose-built IP video network brings all-movie diginet to over 100 stations and streaming platforms in just three months while eliminating satellite ...

04/12/2025

Bitmovin and ThinkAnalytics Partner to Deliver Intelligen...

Bitmovin, the leading provider of video streaming solutions, today announced a strategic partnership with ThinkAnalytics, the global leader in AI-powered data a...

04/12/2025

The HELM and Keslow Camera join forces to launch Keslow L...

The HELM, a global expert in cinematic live broadcast and high-end production workflows, has signed a partnership agreement with Keslow Camera, one of North Ame...

04/12/2025

LiveU Pushes Creative Boundaries at ISE 2026 Powering Ric...

At ISE 2026, LiveU will showcase its expanded IP-video EcoSystem, enabling broadcasters, sports, production companies and pro-AV professionals to share their st...

04/12/2025

Broadcasters See More Potential in Programmatic Advertising

Since the beginning of commercial television, advertising has been a key part of broadcasting. Over the years, the technology for inserting ads into programs ha...

04/12/2025

HBO Max Plans Significant Expansion of European Footprint

MUNICH and MILAN Warner Bros. Discovery said HBO Max is expanding into Germany, Italy, Austria, Switzerland, Luxembourg and Liechtenstein on Jan. 13, 2026, and ...

04/12/2025

AudioShake Launches Features for Removing Copyrighted Music

SAN FRANCISCO AudioShake has launched its first streaming-capable software development kits (SDKs) designed specifically for real-time music detection and copyr...

04/12/2025

TNDV Wraps REMI Production of a Fishing Tournament in Mexico

NASHVILLE The mobile and REMI production company TNDV has announced that it headed south into Mexico to live-produce the three-day 2025 Zane Grey Championship P...

04/12/2025

HPA Executive Director Phil Kubel Steps Down

BURBANK, Calif. Hollywood Professionals Association Executive Director Phil Kubel has stepped down from the organization to pursue new opportunities, the group ...

04/12/2025

FCC Closes More Than 2,000 Inactive Proceedings

WASHINGTON The Federal Communications Commission said it has closed 2,048 inactive proceedings, the largest number of dormant dockets ever terminated in a singl...

04/12/2025

AV1 Open Video Codec Now Powers 30% of Netflix Streaming

A new tech blog from Netflix highlights the importance of the AV1 open video codec, which now powers about 30% of the platform's streaming and discusses a v...

04/12/2025

Step Inside the World of 'Troll 2': VFX Breakdown Featuring Director Roar Uthaug

Back to All News Step Inside the World of Troll 2: VFX Breakdown Featuring Dire...

04/12/2025

Robots' Holiday Wishes Come True: NVIDIA Jetson Platform Offers High-Performance Edge AI at Festive Prices

Developers, researchers, hobbyists and students can take a byte out of holiday s...

04/12/2025

Game the Halls: GeForce NOW Brings Holiday Cheer With 30 New Games in the Cloud

Editor's note: The Game Pass edition of Hogwarts Legacy' will also be supported on GeForce NOW when the Steam and Epic Games Store versions launch on t...

04/12/2025

December 03, 2025

Scientists find cancer weak spot in backup DNA repair system New findings from Scripps Research reveal how certain tumors survive DNA damage and point to a stra...

03/12/2025

MLS Cup 2025 Production To Feature Four iPhone 17 Pros as Game-Coverage Cameras

MLS Cup 2025 Production To Feature Four iPhone 17 Pros as Game-Coverage CamerasStay tuned to SVG on Friday for our in-depth story on this year's MLS Cup pro...

03/12/2025

SVG LIVE! 2025: All Sessions Now Available to Watch on SVG PLAY

SVG LIVE! 2025: All Sessions Now Available to Watch on SVG PLAYThe inaugural event placed a spotlight on the exciting world of live entertainmentBy SVG Staff ...

03/12/2025

Endless Cookie Is an Animated Documentary Unlike Any Other

(L-R) Peter Scriver and Seth Scriver introduce their documentary Endless Cookie for its premiere at the Egyptian Theatre in Park City. (Photo by Andrew H. Wa...

03/12/2025

Lionsgate, Debmar-Mercury Turn to LTN to Distribute MovieSphereGold

COLUMBIA, Md. Lionsgate and its TV syndicator subsidiary Debmar-Mercury have selected LTN to launch and deliver the new MovieSphereGold all-movie digital networ...

03/12/2025

Bitmovin, ThinkAnalytics Partner to Expand AI Capabilities

VIENNA, Austria Video streaming solutions provider Bitmovin and ThinkAnalytics, a provider of AI-powered data analytics for TV, have formed a strategic partners...

03/12/2025

Dolby and NFM to Debut First-Ever Dolby Home Experience

SAN FRANCISCO & THE COLONY, Texas Dolby Laboratories is making what it is calling a new chapter in its retail efforts as part of an agreement with NFM (Nebras...

03/12/2025

Chinese Broadcaster Takes Delivery of Native-IP Outside Broadcast Vehicle

MONTREAL Grass Valley has delivered a 4K Ultra-High-Definition (UHD) outside broadcast (OB) truck to Guangdong Radio and Television (GRT), in partnership with B...