Sony Pixel Power calrec Sony

NVIDIA Wins NeurIPS Awards for Research on Generative AI, Generalist AI Agents

28/11/2022

Two NVIDIA Research papers - one exploring diffusion-based generative AI models and another on training generalist AI agents - have been honored with NeurIPS 2022 Awards for their contributions to the field of AI and machine learning.

These are among more than 60+ talks, posters and workshops with NVIDIA authors being presented at the NeurIPs conference, taking place this week in New Orleans and next week online.

Synthetic data generation - for images, text or video - is a key theme across several of the NVIDIA-authored papers. Other topics include reinforcement learning, data collection and augmentation, weather models and federated learning.

AI is an incredibly important technology, and NVIDIA is making fast progress across the gamut - from generative AI to autonomous AI agents, said Jan Kautz, vice president of learning and perception research at NVIDIA. In generative AI, we are not only advancing our theoretical understanding of the underlying models, but are also making practical contributions that will reduce the effort of creating realistic virtual worlds and simulations.

Reimagining the Design of Diffusion-Based Generative Models Diffusion-based models have emerged as a groundbreaking technique for generative AI. NVIDIA researchers won an Outstanding Main Track Paper award for work that analyzes the design of diffusion models, proposing improvements that can dramatically improve the efficiency and quality of these models.

The paper breaks down the components of a diffusion model into a modular design, helping developers identify processes that can be adjusted to improve the performance of the entire model. The researchers show that their modifications enable record scores on a metric that assesses the quality of AI-generated images.

Training Generalist AI Agents in a Minecraft-Based Simulation Suite While researchers have long trained autonomous AI agents in video-game environments such as Starcraft, Dota and Go, these agents are usually specialists in only a few tasks. So NVIDIA researchers turned to Minecraft, the world's most popular game, to develop a scalable training framework for a generalist agent - one that can successfully execute a wide variety of open-ended tasks.

Dubbed MineDojo, the framework enables an AI agent to learn Minecraft's flexible gameplay using a massive online database of more than 7,000 wiki pages, millions of Reddit threads and 300,000 hours of recorded gameplay (shown in image at top). The project won an Outstanding Datasets and Benchmarks Paper Award from the NeurIPS committee.

As a proof of concept, the researchers behind MineDojo created a large-scale foundation model, called MineCLIP, that learned to associate YouTube footage of Minecraft gameplay with the video's transcript, in which the player typically narrates the onscreen action. Using MineCLIP, the team was able to train a reinforcement learning agent capable of performing several tasks in Minecraft without human intervention.

Creating Complex 3D Shapes to Populate Virtual Worlds Also at NeurIPS is GET3D, a generative AI model that instantly synthesizes 3D shapes based on the category of 2D images it's trained on, such as buildings, cars or animals. The AI-generated objects have high-fidelity textures and complex geometric details - and are created in a triangle mesh format used in popular graphics software applications. This makes it easy for users to import the shapes into 3D renderers and game engines for further editing.

Named for its ability to Generate Explicit Textured 3D meshes, GET3D was trained on NVIDIA A100 Tensor Core GPUs using around 1 million 2D images of 3D shapes captured from different camera angles. The model can generate around 20 objects a second when running inference on a single NVIDIA GPU.

The AI-generated objects could be used to populate 3D representations of buildings, outdoor spaces or entire cities - digital spaces designed for industries such as gaming, robotics, architecture and social media.

Improving Inverse Rendering Pipelines With Control Over Materials, Lighting At the most recent CVPR conference, held in New Orleans in June, NVIDIA Research introduced 3D MoMa, an inverse rendering method that enables developers to create 3D objects composed of three distinct parts: a 3D mesh model, materials overlaid on the model, and lighting.

The team has since achieved significant advancements in untangling materials and lighting from the 3D objects - which in turn improves creators' abilities to edit the AI-generated shapes by swapping materials or adjusting lighting as the object moves around a scene.

The work, which relies on a more realistic shading model that leverages NVIDIA RTX GPU-accelerated ray tracing, is being presented as a poster at NeurIPS.

Enhancing Factual Accuracy of Language Models' Generated Text Another accepted paper at NeurIPS examines a key challenge with pretrained language models: the factual accuracy of AI-generated text.

Language models trained for open-ended text generation often come up with text that includes nonfactual information, since the AI is simply making correlations between words to predict what comes next in a sentence. In the paper, NVIDIA researchers propose techniques to address this limitation, which is necessary before such models can be deployed for real-world applications.

The researchers built the first automatic benchmark to measure the factual accuracy of language models for open-ended text generation, and found that bigger language models with billions of parameters were more factual than smaller ones. The team proposed a new technique, factuality-enhanced training, along with a novel sampling algorithm that together help train language models to generate accurate text - and demonstrated a reduction in the rate of factual errors from 33% to around 15%.

There are more th
LINK: https://blogs.nvidia.com/blog/2022/11/28/nvidia-neurips-research/...
See more stories from nvidia

Most recent headlines

09/11/2025

Dalet Unveils Agentic AI Media Workflows at IBC2025

Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...

06/10/2025

France Tlvisions Wins Prestigious 2025 EBU Technology & Innovation Award in Groundbreaking Collaboration with Dalet

France T l visions, France's leading broadcaster, has received the 2025 EBU ...

17/09/2025

Tech Focus: Audio Training, Part 2 - Manufacturers Offer Extensive Online Learning

Tech Focus: Audio Training, Part 2 - Manufacturers Offer Extensive Online Learni...

17/09/2025

Tech Focus: Audio Training, Part 1 - A1 Shortage Remains a Major-League Challenge for Sports Broadcasting

Tech Focus: Audio Training, Part 1 - A1 Shortage Remains a Major-League Challeng...

17/09/2025

Dua Lipa's Service95 Book Club' Goes Live at the New York Public Library

It was the ultimate convergence of pop culture and literary prestige: Last night, Dua Lipa brought her Service95 Book Club podcast to the stage for a special li...

17/09/2025

The Gauge: Mexico August 2025

During August, streaming's share of TV viewing in Mexico showed an increase of 0.4% compared to the previous month, accounting for 25% of TV viewing. Discl...

17/09/2025

Jo Aun Joins FOR-A America as Senior Manager, Product Engineering

CYPRESS, Calif. FOR-A America has named Jo Aun as senior manager of product engineering, a new role responsible for guiding the planning, development and rollou...

17/09/2025

PlayBox Neo and CIS Group Power CazeTV with a seamless Pl...

PlayBox Neo, in partnership with CIS Group, a leading provider of media and broadcast technology solutions, has successfully deployed PlayBox Neo's Dual Cha...

17/09/2025

Energy Regulatory Agency Underscores Commitment with Ene...

In a relationship that mirrors societal advances in sustainability, Brightline Lighting and the Federal Energy Regulatory Commission (FERC) Headquarters have en...

17/09/2025

Clear-Com Powers Star-Studded Communications at Houston A...

Clear-Com is proud to support the world-class productions of Alley Theatre, one of the oldest and largest nonprofit resident theatres in the United States. With...

17/09/2025

Arch Platform Technologies Announces Strategic Collaborat...

Arch Platform Technologies (www.archpt.io), a pioneer in automated, scalable cloud infrastructure for high-performance workflows, today announced a Strategic Co...

17/09/2025

With over 39bn EUR in assets under management and record-...

Over 300 selected decision-makers from start-ups, corporates, and VC funds worldwide will gather for the third edition of the event, united by a single goal: to...

17/09/2025

Telestream Celebrates Award Win at IBC2025

Telestream, a global leader in media workflow technologies, is excited to announce that its flagship Vantage platform and its next-generation AI capabilities re...

17/09/2025

Mediagenix Celebrates Triple Best of Show Wins at IBC2025...

Mediagenix, a global leader in smart content solutions that profitably connect the right content to the right audience, proudly announces its three Best of Show...

17/09/2025

PlayBox Neo Appoints Transtel Universal as Top Reseller P...

In a move to further establish a firm foothold across South East Asia, PlayBox Neo, the well-respected name in broadcast playout and channel branding, has appoi...

17/09/2025

Wisycom Unveils Two New Solutions at IBC 2025

Wisycom, a global leader in advanced wireless audio solutions, announced two major wireless solutions at IBC 2025 (Stand 8.D30). This includes the Portable RF-o...

17/09/2025

Six Berklee Alumni Win Emmy Awards

Six Berklee Alumni Win Emmy Awards The recipients were recognized for their contributions to acclaimed programs Severance, The Studio, The Penguin, SNL50: The...

17/09/2025

Applications Open for Berklee in Santo Domingo

Applications Open for Berklee in Santo Domingo The weeklong contemporary music program will run January 5-10, 2026. By Colette Greenstein September 17, 2025 ...

17/09/2025

Ukrainian Students Find Creative Consonance' at Berklee Valencia

Ukrainian Students Find Creative Consonance' at Berklee Valencia Through ELIA's UAx Platform, six students from Kyiv joined Berklee Valencia for a week...

17/09/2025

Meet Kenna Hilburn, Avids New Incoming Chief Product Officer

Earlier this year Avid announced Kenna Hilburn as its new senior vice president of product. Recently Hilburn was promoted to Avids new Chief Product Officer, su...

17/09/2025

SES and K2 Space to Accelerate Development of Next-Generation MEO Network

Transatlantic collaboration combines experience and agility to drive innovation in network design and delivery Luxembourg, September 16, 2025 - SES, a leading ...

17/09/2025

Fox TV Stations Join Madhive's Local Live Sports Marketplace

NEW YORK Madhive has announced that the Fox Television Stations have joined its Live Sports Marketplace....

17/09/2025

Sony Electronics Partners with Newhouse School at Syracuse University

SYRACUSE, N.Y. Sony Electronics has announced that it is partnering with the Newhouse School at Syracuse University to provide state-of-the-art equipment, hands...

17/09/2025

Roku's First TV Smart Projector Now Available in the U.S.

SAN JOSE, Calif. Roku has announced that the first smart projector using its Roku TV operating system, the Aurzen Roku TV Smart Projector D1R Cube, is now avail...

17/09/2025

Portrait Artist of the Year returns to Sky Arts with a dazzling line-up of celebrity sitters on 1 October

Wednesday 17 September 2025 UK artists capture icons of stage and screen, inclu...

17/09/2025

FOR-A America Appoints Jo Aun to Lead U.S. Product Development

Jo Returns to FOR-A as Senior Manager of Product Management and Engineering...

17/09/2025

AIR's Big Comeback with DPA Microphones

For the Moon Safari anniversary tour, AIR opened the doors to their backstage. Just a few hours before the Paris concert, DPA met with two key figures of the te...

17/09/2025

The Late Late Toy Show hits the road in search of Ireland's brightest young stars

Auditions will be held in Dublin, Cork and Galway The County Parade returns f...

16/09/2025

SVG All-Stars: Leigh Michaud, Manager, Remote Operations, ESPN

SVG All-Stars: Leigh Michaud, Manager, Remote Operations, ESPNThe UConn grad rose from ESPN's mailroom to become one of its most valuable ops leadersBy Bran...

16/09/2025

Live From IBC 2025: Friday's Latest From Halls 1-4, Outdoor Exhibits in Amsterdam

Live From IBC 2025: Friday's Latest From Halls 1-4, Outdoor Exhibits in Amst...

16/09/2025

Live From IBC 2025: Saturday's Latest From Halls 5-7 in Amsterdam

Live From IBC 2025: Saturday's Latest From Halls 5-7 in Amsterdam By SVG Staff Friday, September 12, 2025 - 17:00 Print This Story The SVG Europe and ...

16/09/2025

Live From IBC 2025: Sunday's Latest From Halls 8-10 in Amsterdam

Live From IBC 2025: Sunday's Latest From Halls 8-10 in Amsterdam By SVG Staff Saturday, September 13, 2025 - 17:00 Print This Story The SVG Europe and...

16/09/2025

Live From IBC 2025: Monday's Latest From Halls 11-14 in Amsterdam

Live From IBC 2025: Monday's Latest From Halls 11-14 in Amsterdam By SVG Staff Sunday, September 14, 2025 - 17:00 Print This Story The SVG Europe and ...

16/09/2025

Amazon Prime Video Picks Up Four Hours of Early-Round Masters Coverage in 2026

Amazon Prime Video Picks Up Four Hours of Early-Round Masters Coverage in 2026 By Jason Dachman, Editorial Director, U.S. Tuesday, September 16, 2025 - 10:15...

16/09/2025

VERSANT Inks Deal for League One Volleyball as Women's Sports Rights Slate Grows

VERSANT Inks Deal for League One Volleyball as Women's Sports Rights Slate G...

16/09/2025

ESPN VP, Corporate Communications, Katina Arnold Named SVP, Disney Advertising Communications

ESPN VP, Corporate Communications, Katina Arnold Named SVP, Disney Advertising C...

16/09/2025

IBC 2025 in Review: SVG Europe's Full Collection of Video Interviews From the Show Floor

IBC 2025 in Review: SVG Europe's Full Collection of Video Interviews From th...

16/09/2025

Celebramos 10 aos de Viva Latino en Spotify y el xito global de la msica latina

Hace una d cada, la m sica latina representaba apenas el 8% de las reproducciones globales en Spotify. Hoy, constituye m s de una cuarta parte (27%) de toda la ...

16/09/2025

Celebrating 10 Years of Spotify's Viva Latino Playlist and the Global Rise of Latin Music

A decade ago, Latin music made up just 8% of global Spotify streams. Today, it a...

16/09/2025

Spotify Welcomes Graham Norton and Select VICE Studios Content

Spotify is expanding our video lineup with a new partnership with Zoo 55, part of ITV Studios. For the first time, acclaimed content from ITV Studios is landing...

16/09/2025

One Enterprise, One Mission: Aligning the Supply Chain to the Warfighter

At DSEI 2025, James Dunne of L3Harris Maritime UK chaired a panel on aligning the supply chain to the warfighter, where leaders discussed modernising support fo...

16/09/2025

RTW chooses Calrec as technology partner

Calrec has strengthened its collaboration with audio metering expert RTW by integrating RTW's new TMxCore metering platform across its full range of Argo IP...

16/09/2025

Football and Back-to-School Dynamics Spark First Gains Since April for Traditional TV

College Football Scores Top Telecast in August with 16M+ Viewers on FOX, Followe...

16/09/2025

Index Exchange and Gracenote Team to Enhance Contextual Intelligence in Programmatic Streaming TV

Collaboration marks the first SSP integration of Gracenote IDs, enabling show-le...

16/09/2025

IBC2025 Attracts 43,858 Visitors

AMSTERDAM The organizers of IBC2025 are reporting that 43,858 visitors from more than 170 countries attended the event, which had more than 1,300 exhibitors and...

16/09/2025

Wooden Camera Releases Accessory Collection for FUJIFILMs...

Wooden Camera announces the release of its new Accessory Collection for the FUJIFILM GFX ETERNA 55. The highlights of this collection include vital power soluti...

16/09/2025

AntonBauer Launches Free Cloud Platform for Smarter Batte...

Anton/Bauer, a leading manufacturer of mobile power solutions for broadcast and cinematic equipment, has announced the launch of Anton/Bauer Fleet Management, a...

16/09/2025

Teradek Launches Prism Jetpack - A New Era of 5G Video Co...

Teradek, a leading provider of video transmission and live production solutions, today announced the launch of Prism Jetpack, a groundbreaking 5G video contribu...

16/09/2025

Astera Reinvents Practical Lighting with SolaBulb

Astera, the leader in wireless LED lighting solutions, announces the ultra-versatile SolaBulb. Building on the success of the Astera bulb family, SolaBulb intro...

16/09/2025

TED2025 Relies on Clear-Com and NETGEAR to Power Producti...

As the world gathered at TED2025 to explore the provocative theme "Humanity Reimagined", Clear-Com , supported by NETGEAR networking infrastructure, delivered f...