Sony Pixel Power calrec Sony

NVIDIA Wins NeurIPS Awards for Research on Generative AI, Generalist AI Agents

28/11/2022

Two NVIDIA Research papers - one exploring diffusion-based generative AI models and another on training generalist AI agents - have been honored with NeurIPS 2022 Awards for their contributions to the field of AI and machine learning.

These are among more than 60+ talks, posters and workshops with NVIDIA authors being presented at the NeurIPs conference, taking place this week in New Orleans and next week online.

Synthetic data generation - for images, text or video - is a key theme across several of the NVIDIA-authored papers. Other topics include reinforcement learning, data collection and augmentation, weather models and federated learning.

AI is an incredibly important technology, and NVIDIA is making fast progress across the gamut - from generative AI to autonomous AI agents, said Jan Kautz, vice president of learning and perception research at NVIDIA. In generative AI, we are not only advancing our theoretical understanding of the underlying models, but are also making practical contributions that will reduce the effort of creating realistic virtual worlds and simulations.

Reimagining the Design of Diffusion-Based Generative Models Diffusion-based models have emerged as a groundbreaking technique for generative AI. NVIDIA researchers won an Outstanding Main Track Paper award for work that analyzes the design of diffusion models, proposing improvements that can dramatically improve the efficiency and quality of these models.

The paper breaks down the components of a diffusion model into a modular design, helping developers identify processes that can be adjusted to improve the performance of the entire model. The researchers show that their modifications enable record scores on a metric that assesses the quality of AI-generated images.

Training Generalist AI Agents in a Minecraft-Based Simulation Suite While researchers have long trained autonomous AI agents in video-game environments such as Starcraft, Dota and Go, these agents are usually specialists in only a few tasks. So NVIDIA researchers turned to Minecraft, the world's most popular game, to develop a scalable training framework for a generalist agent - one that can successfully execute a wide variety of open-ended tasks.

Dubbed MineDojo, the framework enables an AI agent to learn Minecraft's flexible gameplay using a massive online database of more than 7,000 wiki pages, millions of Reddit threads and 300,000 hours of recorded gameplay (shown in image at top). The project won an Outstanding Datasets and Benchmarks Paper Award from the NeurIPS committee.

As a proof of concept, the researchers behind MineDojo created a large-scale foundation model, called MineCLIP, that learned to associate YouTube footage of Minecraft gameplay with the video's transcript, in which the player typically narrates the onscreen action. Using MineCLIP, the team was able to train a reinforcement learning agent capable of performing several tasks in Minecraft without human intervention.

Creating Complex 3D Shapes to Populate Virtual Worlds Also at NeurIPS is GET3D, a generative AI model that instantly synthesizes 3D shapes based on the category of 2D images it's trained on, such as buildings, cars or animals. The AI-generated objects have high-fidelity textures and complex geometric details - and are created in a triangle mesh format used in popular graphics software applications. This makes it easy for users to import the shapes into 3D renderers and game engines for further editing.

Named for its ability to Generate Explicit Textured 3D meshes, GET3D was trained on NVIDIA A100 Tensor Core GPUs using around 1 million 2D images of 3D shapes captured from different camera angles. The model can generate around 20 objects a second when running inference on a single NVIDIA GPU.

The AI-generated objects could be used to populate 3D representations of buildings, outdoor spaces or entire cities - digital spaces designed for industries such as gaming, robotics, architecture and social media.

Improving Inverse Rendering Pipelines With Control Over Materials, Lighting At the most recent CVPR conference, held in New Orleans in June, NVIDIA Research introduced 3D MoMa, an inverse rendering method that enables developers to create 3D objects composed of three distinct parts: a 3D mesh model, materials overlaid on the model, and lighting.

The team has since achieved significant advancements in untangling materials and lighting from the 3D objects - which in turn improves creators' abilities to edit the AI-generated shapes by swapping materials or adjusting lighting as the object moves around a scene.

The work, which relies on a more realistic shading model that leverages NVIDIA RTX GPU-accelerated ray tracing, is being presented as a poster at NeurIPS.

Enhancing Factual Accuracy of Language Models' Generated Text Another accepted paper at NeurIPS examines a key challenge with pretrained language models: the factual accuracy of AI-generated text.

Language models trained for open-ended text generation often come up with text that includes nonfactual information, since the AI is simply making correlations between words to predict what comes next in a sentence. In the paper, NVIDIA researchers propose techniques to address this limitation, which is necessary before such models can be deployed for real-world applications.

The researchers built the first automatic benchmark to measure the factual accuracy of language models for open-ended text generation, and found that bigger language models with billions of parameters were more factual than smaller ones. The team proposed a new technique, factuality-enhanced training, along with a novel sampling algorithm that together help train language models to generate accurate text - and demonstrated a reduction in the rate of factual errors from 33% to around 15%.

There are more th
LINK: https://blogs.nvidia.com/blog/2022/11/28/nvidia-neurips-research/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

13/04/2026

TikToK, Major Ad Groups Back Influencer Certification Program

Share Copy link Facebook X Linkedin Bluesky Email...

13/04/2026

DHD Marks 30th Anniversary with Brand Relaunch

DHD audio, developer and manufacturer of digital audio systems for professional broadcast, has launched a comprehensive brand update to mark its 30th anniversar...

13/04/2026

Stegawave Debuts Real-Time Forensic Watermarking to Tackl...

Stegawave, an Irish technology company specialising in forensic watermarking for video content, today announced the launch of its anti-piracy platform for live ...

13/04/2026

Synamedia PowerVu cuts broadcast distribution costs by up...

New version of Quortex PowerVu delivers a standards-based approach to satellite-to-IP transitions, eliminating the need for baseband workflows and complex infra...

13/04/2026

Studio Berlin Invests in Cinematic Live Production with G...

Grass Valley LDX camera systems enable leading German production company to support broadcast and cinematic live production within a single environment. Grass ...

13/04/2026

Techex and MediaKind partner to bring resilient IP transp...

London, UK, 13 April 2026 Techex and MediaKind today announced a partnership to integrate Techexs IP transport and orchestration technology, tx edge, directly...

13/04/2026

Transforming modern education environments with Lightware...

In today's hybrid education environments, there is no one-size-fits-all' AV solution. Lightware's extensive AV portfolio addresses this challenge, ...

13/04/2026

Mediaproxy adds AI toolset to LogServer for brand and adv...

Mediaproxy, the global standard for software-based IP compliance monitoring and multiviewing solutions, has developed a new suite of AI-powered tools designed t...

13/04/2026

Freelance Video Cameraman - Los Angeles

Freelance Video Cameraman - Los Angeles Brie Clayton April 13, 2026 0 Comments Freelance Video Cameraman April 8, 2026COW Jobs: Director Needed for ...

13/04/2026

Atomos to Acquire Flanders Scientific

Atomos to Acquire Flanders Scientific Brie Clayton April 13, 2026 0 Comments Strengthening commitment to precision monitoring, from camera to delivery...

13/04/2026

Digital Anarchy Announces ShotNotes, A Notepad and Task Tracking Panel for Premiere Pro

Digital Anarchy Announces ShotNotes, A Notepad and Task Tracking Panel for Premi...

13/04/2026

NAB 2026 Live Demo at HP Booth Highlights JALI Powered Interactive AI Character Experience

NAB 2026 Live Demo at HP Booth Highlights JALI Powered Interactive AI Character ...

13/04/2026

Manifold Introduces AT300 Multiviewer Support at NAB 2026

Manifold Introduces AT300 Multiviewer Support at NAB 2026 Brie Clayton April 13, 2026 0 Comments and adds HDR-SDR conversion to the recently announced...

13/04/2026

RT Radio 1 unveils new audio identity

RT Radio 1 has today launched a significant step in its ongoing strategic evolution. Following the launch of its brand-new schedule late last year, RT Radio 1...

13/04/2026

The Late Late Show offers a once-in-a-lifetime prize to the winner of next week's Opening Act compet

The Late Late Show Opening Act, the search for Ireland's newest country musi...

13/04/2026

RT appoints new Chief Financial Officer

RT has today announced the appointment of Annemarie Britz to the position of Chief Financial Officer, RT following a public competition. Annemarie Britz is c...

12/04/2026

Areal unveil the SR1

Headphone system designed for immersive monitoring With the demand for immersive audio showing no signs of slowing down, lots of companies are turning their...

12/04/2026

Beeble expands AI production workflow ahead of NAB 2026 with background remover

Beeble expands AI production workflow ahead of NAB 2026 with background remover Brie Clayton April 11, 2026 0 Comments Ahead of its upcoming participa...

12/04/2026

Like and Transcribe

Like and Transcribe Mei Semones BM '22 blends languages and techniques to create her singular style. April 10, 2026 By Bryan Parys Mei Semones BM '...

12/04/2026

Cue the Change

Cue the Change Nicknamed the Converse Conductor, Jonathon Heyward BM '14 is making classical music more relatable. April 10, 2026 By Sarah Godcher Murp...

12/04/2026

Heat Wave

Heat Wave Inside Miamis sizzling, boundary-blurring Latin music scene. April 13, 2026 By Ricardo Herrera Bandrich Image by Stella Levi Down there: Thats ...

11/04/2026

Infrasonic launch Infrasonic Berlin

Engineer collective welcome Freddy Knop Infrasonic, an award-winning collective of audio engineers operating out of Nashville and Los Angeles with credits r...

11/04/2026

L3Harris' Red Wolf and SKY RAIDER II INTERNATIONAL Showcase Adaptability for Evolving Missions

Combining launched effects with a proven mission aircraft, Red Wolf and SKY RAI...

11/04/2026

Accelerating Production of National Security Space Assets with Additive Manufacturing

3D printed RL10 rocket engine combustion chambers shown in the manufacturing are...

11/04/2026

Sachtler Highlights Comprehensive Camera Support Solutions at NAB 2026

Sachtler Highlights Comprehensive Camera Support Solutions at NAB 2026 Brie Clayton April 11, 2026 0 Comments Sachtler showcases advanced camera suppo...

11/04/2026

Sohonet Launches Media Fabric: A Unified Managed Infrastructure Suite for Film, Television and Post-Production

Sohonet Launches Media Fabric: A Unified Managed Infrastructure Suite for Film, ...

11/04/2026

AJA Unveils BRIDGE LIVE IP with SMPTE ST 2110 I/O

AJA Unveils BRIDGE LIVE IP with SMPTE ST 2110 I/O Brie Clayton April 11, 2026 0 Comments New IP video solution streamlines modern productions, providi...

11/04/2026

InSync Unveils Advanced Video Processing and Frame Rate Conversion Solutions at NAB 2026

InSync Unveils Advanced Video Processing and Frame Rate Conversion Solutions at ...

11/04/2026

Amagi Launches Newspulse: An Agentic AI Platform That Autonomously Turns Live Newscasts into Multi-Format Digital Content

Amagi Launches Newspulse: An Agentic AI Platform That Autonomously Turns Live Ne...

11/04/2026

Federal Judge Extends Nexstar/Tegna TRO, Softens Some Provisions

Share Copy link Facebook X Linkedin Bluesky Email...

11/04/2026

Sling TV Launches $19.99 a Month Sling Essentials with ESPN

Share Copy link Facebook X Linkedin Bluesky Email...

11/04/2026

NAB Show Launches Content Creator VIP Program

Share Copy link Facebook X Linkedin Bluesky Email...

11/04/2026

FCC Announces Tentative Agenda for April Open Meeting

Share Copy link Facebook X Linkedin Bluesky Email...

11/04/2026

GARR and Cubbit launch the first geo-distributed storage...

Pilot phase begins for a new national infrastructure designed to safeguard academic and research data with full local data control, sovereignty, resilience, and...

11/04/2026

How to Stream Coachella 2026 at Home

How to Stream Coachella 2026 at Home Check this years stacked schedule for the annual music festivals full lineup, including when Berklee artists from Laufey ...

11/04/2026

Time Travel

Time Travel As Berklee on the Road programs in Puerto Rico and Italy mark decades-long anniversaries, we journey into the past and step into the future. Apri...

11/04/2026

April 10, 2026

Improving vaccine design for Ebola, HIV and more Scripps Research scientists and colleagues develop a nanodisc platform that offers a clearer view of how key vi...

10/04/2026

The Invisible OPEX Killer: Is Your Server Room Dragging You Down?

The Invisible OPEX Killer: Is Your Server Room Dragging You Down? In the broadcast world, we talk a lot about uptime. We talk about talent retention, latency...

10/04/2026

NAB 2026: Imagine Communications to Showcase Expanded Multiviewer Portfolio

Imagine Communications will showcase its multiviewer portfolio at NAB Show 2026 (April 19-22, Booth N1328, Las Vegas Convention Center), including Prismon and t...

10/04/2026

NAB 2026: Chyron Releases PRIME VSAR 2.3 with Updated Unreal Engine Integration

Chyron has released PRIME VSAR 2.3, an update to its virtual set and augmented reality solution for broadcast. The release adds compatibility with Unreal Engine...

10/04/2026

NAB 2026: Techex to Showcase New tx darwin Capabilities

Techex will exhibit at NAB Show 2026 (Booth W2267, April 19-23, Las Vegas Convention Center), demonstrating new tx darwin features including consumer multiview,...

10/04/2026

NAB 2026: NDI to Showcase Ecosystem and NDI 6.3

NDI will exhibit at NAB Show 2026, demonstrating its IP video ecosystem through live partner integrations, NDI 6.3 features, AI metadata workflows, and creator ...

10/04/2026

FOR-A Acquires Tamura Corporations Information Equipment Business

FOR-A has announced the acquisition of all shares of Tamu Radiance Corporation, a new company spun off from the Information Equipment Business of Tamura Corpora...

10/04/2026

NAB 2026: InSync Technology to Unveil New Video Processing and Frame Rate Conversion Products

InSync Technology will showcase new and updated video conversion products at NAB...