Sony Pixel Power calrec Sony

NVIDIA Wins NeurIPS Awards for Research on Generative AI, Generalist AI Agents

28/11/2022

Two NVIDIA Research papers - one exploring diffusion-based generative AI models and another on training generalist AI agents - have been honored with NeurIPS 2022 Awards for their contributions to the field of AI and machine learning.

These are among more than 60+ talks, posters and workshops with NVIDIA authors being presented at the NeurIPs conference, taking place this week in New Orleans and next week online.

Synthetic data generation - for images, text or video - is a key theme across several of the NVIDIA-authored papers. Other topics include reinforcement learning, data collection and augmentation, weather models and federated learning.

AI is an incredibly important technology, and NVIDIA is making fast progress across the gamut - from generative AI to autonomous AI agents, said Jan Kautz, vice president of learning and perception research at NVIDIA. In generative AI, we are not only advancing our theoretical understanding of the underlying models, but are also making practical contributions that will reduce the effort of creating realistic virtual worlds and simulations.

Reimagining the Design of Diffusion-Based Generative Models Diffusion-based models have emerged as a groundbreaking technique for generative AI. NVIDIA researchers won an Outstanding Main Track Paper award for work that analyzes the design of diffusion models, proposing improvements that can dramatically improve the efficiency and quality of these models.

The paper breaks down the components of a diffusion model into a modular design, helping developers identify processes that can be adjusted to improve the performance of the entire model. The researchers show that their modifications enable record scores on a metric that assesses the quality of AI-generated images.

Training Generalist AI Agents in a Minecraft-Based Simulation Suite While researchers have long trained autonomous AI agents in video-game environments such as Starcraft, Dota and Go, these agents are usually specialists in only a few tasks. So NVIDIA researchers turned to Minecraft, the world's most popular game, to develop a scalable training framework for a generalist agent - one that can successfully execute a wide variety of open-ended tasks.

Dubbed MineDojo, the framework enables an AI agent to learn Minecraft's flexible gameplay using a massive online database of more than 7,000 wiki pages, millions of Reddit threads and 300,000 hours of recorded gameplay (shown in image at top). The project won an Outstanding Datasets and Benchmarks Paper Award from the NeurIPS committee.

As a proof of concept, the researchers behind MineDojo created a large-scale foundation model, called MineCLIP, that learned to associate YouTube footage of Minecraft gameplay with the video's transcript, in which the player typically narrates the onscreen action. Using MineCLIP, the team was able to train a reinforcement learning agent capable of performing several tasks in Minecraft without human intervention.

Creating Complex 3D Shapes to Populate Virtual Worlds Also at NeurIPS is GET3D, a generative AI model that instantly synthesizes 3D shapes based on the category of 2D images it's trained on, such as buildings, cars or animals. The AI-generated objects have high-fidelity textures and complex geometric details - and are created in a triangle mesh format used in popular graphics software applications. This makes it easy for users to import the shapes into 3D renderers and game engines for further editing.

Named for its ability to Generate Explicit Textured 3D meshes, GET3D was trained on NVIDIA A100 Tensor Core GPUs using around 1 million 2D images of 3D shapes captured from different camera angles. The model can generate around 20 objects a second when running inference on a single NVIDIA GPU.

The AI-generated objects could be used to populate 3D representations of buildings, outdoor spaces or entire cities - digital spaces designed for industries such as gaming, robotics, architecture and social media.

Improving Inverse Rendering Pipelines With Control Over Materials, Lighting At the most recent CVPR conference, held in New Orleans in June, NVIDIA Research introduced 3D MoMa, an inverse rendering method that enables developers to create 3D objects composed of three distinct parts: a 3D mesh model, materials overlaid on the model, and lighting.

The team has since achieved significant advancements in untangling materials and lighting from the 3D objects - which in turn improves creators' abilities to edit the AI-generated shapes by swapping materials or adjusting lighting as the object moves around a scene.

The work, which relies on a more realistic shading model that leverages NVIDIA RTX GPU-accelerated ray tracing, is being presented as a poster at NeurIPS.

Enhancing Factual Accuracy of Language Models' Generated Text Another accepted paper at NeurIPS examines a key challenge with pretrained language models: the factual accuracy of AI-generated text.

Language models trained for open-ended text generation often come up with text that includes nonfactual information, since the AI is simply making correlations between words to predict what comes next in a sentence. In the paper, NVIDIA researchers propose techniques to address this limitation, which is necessary before such models can be deployed for real-world applications.

The researchers built the first automatic benchmark to measure the factual accuracy of language models for open-ended text generation, and found that bigger language models with billions of parameters were more factual than smaller ones. The team proposed a new technique, factuality-enhanced training, along with a novel sampling algorithm that together help train language models to generate accurate text - and demonstrated a reduction in the rate of factual errors from 33% to around 15%.

There are more th
LINK: https://blogs.nvidia.com/blog/2022/11/28/nvidia-neurips-research/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

27/05/2026

Telestream Appoints Benjamin Desbois as CEO, Effective July 1

Telestream has announced that its Board of Directors has appointed Benjamin Desbois as Chief Executive Officer, effective July 1, 2026. Desbois, currently Teles...

27/05/2026

FOX MLB Leads Live-Event Categories; ESPN Is Tops Overall at 47th Annual Sports Emmy Awards

ESPN garnered 10 awards; NBC's Sunday Night Football received the Outstandin...

27/05/2026

Matrox Video Marks 50th Anniversary, Announces New Product Launch for June

Matrox Video is celebrating its 50th anniversary, marking five decades of operations from its headquarters in Montreal, Canada. Founded in 1976, the company has...

27/05/2026

MLB Announces Fan Engagement Initiatives for Americas 250th Anniversary

Major League Baseball has announced a series of initiatives tied to America's Semiquincentennial, including a national marketing campaign, Fourth of July br...

27/05/2026

Advanced Systems Group Hires Brian Gross as Account Manager for Audio Team

Advanced Systems Group (ASG) has announced that Brian Gross has joined the company as an Account Manager on its Audio team, based in the Burbank office. He will...

27/05/2026

Nielsen Research: Hispanic Fans, Asian Markets Drive Global Soccer Audience Ahead of World Cup 2026

Nielsen has released new research on soccer fandom ahead of the FIFA World Cup 2...

27/05/2026

ESL FACEIT Group Debuts First Ever Esports Vertical Stream Co-Developed With TikTok

ESL FACEIT Group (EFG) has unveiled a new partnership with TikTok to bring broad...

27/05/2026

Two Weeks Away: FIFA Outlines Production Plans for Highly Anticipated North American-Based World Cup

FIFA's Oscar Sanchez gives a deeper look to how this tournament will be cove...

27/05/2026

SVG Students To Watch: Maggie Lynn, Virginia Tech

The soon-to-be senior from Charlottesville is building her skills in replay, TD, and even creative content for HokieVision and its ACC Network productions In t...

27/05/2026

A Global Festival of Football: FOX Sports Illustrates Strategy to Bring Every FIFA Mens World Cup Match to the U.S. Audience

FOX Sports' Mike Davies breaks down the vision for this summer's showcas...

27/05/2026

Top-Tier Storytelling: Host Broadcast Services Works at Capturing the Atmosphere of the FIFA Mens World Cup

HBS's Paul King, FIFA's Oscar Sanchez preview how the masses at home wil...

27/05/2026

Matt Gangl & Pete Macheska on FOX MLBs Huge Night and an Unforgettable Postseason Run

FOX's MLB coverage dominated the night at the 47th Annual Sports Emmy Awards...

27/05/2026

FOXs Mike Davies and Team on Outstanding Technical Team Win for 2025 World Series

One of the most memorable Postseasons in baseball history would have had no memo...

27/05/2026

NBC Sports Rob Hyland Reflects on an Unforgettable Sunday Night Football Season

NBC's Sunday Night Football is among the most decorated and most watched programs in the history of television. It added to its jam-packed trophy case on Tu...

27/05/2026

Prime Videos John Ward and Mike Francis on Groundbreaking NBA on Prime Video Studio

The 2026 Sports Emmys marked a watershed moment for Prime Video Sports. After bu...

27/05/2026

Countdown to FIFA World Cup 2026: SVG Launches SportsTechLive Blog in Lead-up to Winter Games

With the Opening Match just over two weeks away, the entire sports-production-te...

27/05/2026

Spotify Brings Long-Form Magazine Articles to Audio

Spotify already brings together listeners' favorite music, podcasts, and audiobooks in one place. Now, we're trialing a new format that expands the cont...

27/05/2026

Podcast Clips Make Your Favorite Moments Easier to Save and Share

The best podcast moments deserve more than just a mental note. That's why today, we're making those moments easier to save and share with clips. Whethe...

27/05/2026

Spotify and Netflix Partner With Jay Shetty to Bring On Purpose' to Video Across Both Platforms

On Purpose is one of the most popular podcasts in the world, known for conversat...

27/05/2026

Olivia Rodrigo Brings Billions Club Live to Barcelona: Watch the Concert Film Now

On May 8, 1,500 of Olivia Rodrigo's top fans gathered in Barcelona's Tea...

27/05/2026

JZ Microphones announce the MU-1

Hybrid design combines large-diaphragm capsule & ribbon JZ Microphones have teamed up with Grammy-winning producer and engineer Marc Urselli to develop a ne...

27/05/2026

Tape Effects Collection from AIR Music Tech

Three new plug-ins inspired by classic tape effects AIR Music Tech's latest release delivers a set of plug-ins that aim to capture the character, moveme...

27/05/2026

The Crow Hill Company's Absurdly Quiet Piano goes Pro

Piano played on the edge of silence The Crow Hill Company's Vaults collection offers a continual rotation of instruments that are given away for free fo...

27/05/2026

Arturia release Memory V

Recreates Moog's iconic Memorymoog polysynth Arturia's vast software instrument range offers a combination of new and old, with innovative modern so...

27/05/2026

Accentize introduce free dxLevel plug-in

Offers loudness levelling for speech and dialogue Accentize have built up a solid reputation with their audio-restoration tools, and their latest plug-in is...

27/05/2026

10,000 units strong - The Rohde & Schwarz R&S M3SR Radio 4400

10,000 units strong - The Rohde & Schwarz R&S M3SR Radio 4400 Rohde & Schwarz celebrates a major manufacturing milestone, producing its 10,000th R&S M3SR Radi...

27/05/2026

L3Harris Introduces the XL Converge 300P Portable Public Safety Radio

The XL Converge 300P radio system emerges with a groundbreaking feature set enhancing the mission-critical communications of public safety, federal and critica...

27/05/2026

Modernizing Public Safety Communications

Pairing Two47 MCX software with existing LTE networks means tailored system upgrades that can save time, money and lives....

27/05/2026

L3Harris Strengthens Global Solid Rocket Motor Supply Chain With New PAC-3 Propulsion Supplier

PAC-3 MSE offers improved range, speed, and maneuverability, making it an effect...

27/05/2026

Brightcove Adds New Features to Its AI Suite for Video Advertising

Share Copy link Facebook X Linkedin Bluesky Email...

27/05/2026

Star Trek VFX: Recreating John Knoll's Iconic Warp Stars without a Slitscan Camera

Star Trek VFX: Recreating John Knoll's Iconic Warp Stars without a Slitscan ...

27/05/2026

Adventure World Uses Blackmagic Replay for Marine Live

Adventure World Uses Blackmagic Replay for Marine Live Brie Clayton May 27, 2026 0 Comments Large screen displays and slow motion replays dynamically ...

27/05/2026

Berklee Alumna and Assistant Professor Olivia Prez-Collellmir to Premiere Original Work at Gaud Centennial in Barcelona

Berklee Alumna and Assistant Professor Olivia P rez-Collellmir to Premiere Origi...

27/05/2026

Gravity Media Expands Into Creative Services With New Agency

Share Copy link Facebook X Linkedin Bluesky Email...

27/05/2026

Tegna Names Patrick Paolini as CEO

Share Copy link Facebook X Linkedin Bluesky Email...

27/05/2026

Telestream Taps Company Vet Benjamin Desbois as CEO

Share Copy link Facebook X Linkedin Bluesky Email...

27/05/2026

HDR10+ Technologies to Launch Eclipsa Video Certification Program

Share Copy link Facebook X Linkedin Bluesky Email...

27/05/2026

ATSC to Gather in Washington Next Week for Annual Meeting

Share Copy link Facebook X Linkedin Bluesky Email...

27/05/2026

Telestream Appoints Benjamin Desbois as Chief Executive O...

Co-founder Dan Castles to transition to Executive Chair; internal promotion reinforces continuity and long-term growth Telestream, a global leader in media wor...

27/05/2026

Big Blue Marble Announces First End-to-End 5G Broadcast S...

Big Blue Marble today announced that its Nakolos platform is the first end-to-end 5G Broadcast solution worldwide to implement the complete feature set introduc...

27/05/2026

Lightware Continues Its ESG Commitment Through Girls Day...

Lightware recently hosted the Girls' Day event in April at its headquarters in Budapest, welcoming students for an interactive introduction to engineering a...

26/05/2026

Matrox Video Marks 50 Year Milestone

Share Copy link Facebook X Linkedin Bluesky Email...

26/05/2026

Roku Expands Premium Subscriptions With Fox One

Share Copy link Facebook X Linkedin Bluesky Email...

26/05/2026

Brian Gross Joins ASG's Audio Team as Account Manager

Share Copy link Facebook X Linkedin Bluesky Email...

26/05/2026

MPA Urges FCC Not to Reclassify vMVPDs

Share Copy link Facebook X Linkedin Bluesky Email...

26/05/2026

Cobalt Digital to Showcase End-to-End IPMX Ecosystem at I...

Cobalt Digital to Showcase End-to-End IPMX Ecosystem at InfoComm 2026, Making ST 2110 Easy for Pro AV blueCORE standalone processors headline solutions designe...