Sony Pixel Power calrec Sony

NVIDIA Releases Open Synthetic Data Generation Pipeline for Training Large Language Models

14/06/2024

NVIDIA today announced Nemotron-4 340B, a family of open models that developers can use to generate synthetic data for training large language models (LLMs) for commercial applications across healthcare, finance, manufacturing, retail and every other industry.

High-quality training data plays a critical role in the performance, accuracy and quality of responses from a custom LLM - but robust datasets can be prohibitively expensive and difficult to access.

Through a uniquely permissive open model license, Nemotron-4 340B gives developers a free, scalable way to generate synthetic data that can help build powerful LLMs.

The Nemotron-4 340B family includes base, instruct and reward models that form a pipeline to generate synthetic data used for training and refining LLMs. The models are optimized to work with NVIDIA NeMo, an open-source framework for end-to-end model training, including data curation, customization and evaluation. They're also optimized for inference with the open-source NVIDIA TensorRT-LLM library.

Nemotron-4 340B can be downloaded now from the NVIDIA NGC catalog and from Hugging Face, where developers can also use the Train on DGX Cloud service to easily fine-tune open AI models. Developers will soon be able to access the models at ai.nvidia.com, where they'll be packaged as an NVIDIA NIM microservice with a standard application programming interface that can be deployed anywhere.

Navigating Nemotron to Generate Synthetic Data LLMs can help developers generate synthetic training data in scenarios where access to large, diverse labeled datasets is limited.

The Nemotron-4 340B Instruct model creates diverse synthetic data that mimics the characteristics of real-world data, helping improve data quality to increase the performance and robustness of custom LLMs across various domains.

Then, to boost the quality of the AI-generated data, developers can use the Nemotron-4 340B Reward model to filter for high-quality responses. Nemotron-4 340B Reward grades responses on five attributes: helpfulness, correctness, coherence, complexity and verbosity. It's currently first place on the Hugging Face RewardBench leaderboard, created by AI2, for evaluating the capabilities, safety and pitfalls of reward models.

In this synthetic data generation pipeline, (1) the Nemotron-4 340B Instruct model is first used to produce synthetic text-based output. An evaluator model, (2) Nemotron-4 340B Reward, then assesses this generated text - providing feedback that guides iterative improvements and ensures the synthetic data is accurate, relevant and aligned with specific requirements. Researchers can also create their own instruct or reward models by customizing the Nemotron-4 340B Base model using their proprietary data, combined with the included HelpSteer2 dataset.

Fine-Tuning With NeMo, Optimizing for Inference With TensorRT-LLM Using open-source NVIDIA NeMo and NVIDIA TensorRT-LLM, developers can optimize the efficiency of their instruct and reward models to generate synthetic data and to score responses.

All Nemotron-4 340B models are optimized with TensorRT-LLM to take advantage of tensor parallelism, a type of model parallelism in which individual weight matrices are split across multiple GPUs and servers, enabling efficient inference at scale.

Nemotron-4 340B Base, trained on 9 trillion tokens, can be customized using the NeMo framework to adapt to specific use cases or domains. This fine-tuning process benefits from extensive pretraining data and yields more accurate outputs for specific downstream tasks.

A variety of customization methods are available through the NeMo framework, including supervised fine-tuning and parameter-efficient fine-tuning methods such as low-rank adaptation, or LoRA.

To boost model quality, developers can align their models with NeMo Aligner and datasets annotated by Nemotron-4 340B Reward. Alignment is a key step in training LLMs, where a model's behavior is fine-tuned using algorithms like reinforcement learning from human feedback (RLHF) to ensure its outputs are safe, accurate, contextually appropriate and consistent with its intended goals.

Businesses seeking enterprise-grade support and security for production environments can also access NeMo and TensorRT-LLM through the cloud-native NVIDIA AI Enterprise software platform, which provides accelerated and efficient runtimes for generative AI foundation models.

Evaluating Model Security and Getting Started The Nemotron-4 340B Instruct model underwent extensive safety evaluation, including adversarial tests, and performed well across a wide range of risk indicators. Users should still perform careful evaluation of the model's outputs to ensure the synthetically generated data is suitable, safe and accurate for their use case.

For more information on model security and safety evaluation, read the model card.

Download Nemotron-4 340B models via NVIDIA NGC and Hugging Face. For more details, read the research papers on the model and dataset.

See notice regarding software product information.
LINK: https://blogs.nvidia.com/blog/nemotron-4-synthetic-data-generation-llm...
See more stories from nvidia

North America Stories

17/07/2024

Glen Mazzara's Six Rules for Writing a Compelling TV Episode

by Sundance Collab team Showrunner Glen Mazzara (The Walking Dead, The Shield) thinks that writing for TV has one rule: Cool people doing cool stuff every epis...

17/07/2024

Give Me the Backstory: Get to Know Jeff Zimbalist, the Filmmaker Behind Skywalkers: A Love Story

By Bailey Pennick One of the most exciting things about the Sundance Film Festi...

17/07/2024

The Greatest Night in Pop, Girls State Lead 2024 Sundance Institute-Supported Emmy Nominations

With limited-edition summer merch out now and our Early Bird Ticket Sale happeni...

17/07/2024

Artemis II SLS Core Stage Heading to NASA's Kennedy Space Center

Space Launch System Core Stage for Artemis II Rocket Loaded onto Pegasus Barge. Image Credit: NASA...

17/07/2024

L3Harris Boeing 787-9 Full Flight Simulator Enters Into Service At ANA's Tokyo Training Centre

L3Harris, a global leader in the manufacture of flight training devices, has had...

17/07/2024

NAB Asks Court to Toss Ownership Rules

WASHINGTON, D.C. The National Association of Broadcasters (NAB) has filed its initial brief in its challenge to the Federal Communications Commissions (FCC) loc...

17/07/2024

Interra Systems To Show BATON 9 AI/ML-Based Media QC Solution at IBC Show

CUPERTINO, Calif. Interra Systems will show the latest developments in its content-aware quality control (QC), monitoring, captioning and analysts at IBC 2024, ...

17/07/2024

Canon Introduce EOS R1, EOS R5 Mark II Mirrorless Cameras

MELVILLE, N.Y. Canon has launched the EOS R1 and EOS R5 Mark II, two full-frame mirrorless cameras for professional still photography and video production....

17/07/2024

Design, Immersive and VFX Studio Boma Launches in Los Angeles

Design, Immersive and VFX Studio Boma Launches in Los Angeles Brie Clayton July 17, 2024 0 Comments Boma, a new breed of a fully-remote studio special...

17/07/2024

ITV Content Services Restores Archived Classics with Cintel Scanner G3 HDR+

ITV Content Services Restores Archived Classics with Cintel Scanner G3 HDR Brie Clayton July 17, 2024 0 Comments Blackmagic Design today announced that ...

17/07/2024

Study: Consumers Hitting Spending Limits for Video

PORTSMOUTH, N.H. As streaming companies continue to ramp up prices, a new study from Hub Entertainment Research addresses an increasingly central issue: How muc...

17/07/2024

Streaming Jumps to a Record 40% of TV Viewing in June

The shift to streaming passed a new milestone in June, when time spent streaming hit 40.3% of all TV viewing, a record according to Nielsen's The Gauge. In ...

17/07/2024

Eight News-Press & Gazette Stations Join NewsOn

SEATTLE Eight News-Press & Gazette (NPG) stations have joined NewsON, Sinclair Broadcast Group's free, multiplatform streaming service offering local news c...

17/07/2024

Team USA TV FAST Channel Goes Live with 24/7 Olympic Team Coverage

COLORADO SPRINGS, Colo. The U.S. Olympic & Paralympic Committee (USOPC), NBCUniversal and FAST studios have launched Team USA TV, a free ad-supported (FAST) TV ...

17/07/2024

StreamGM and The Yard Mcr Support Grassroots Music Using Blackmagic Design

StreamGM and The Yard Mcr Support Grassroots Music Using Blackmagic Design Brie Clayton July 16, 2024 0 Comments Manchester music and event space The ...

17/07/2024

Content Creator Roger Seng Sings Ci's Praises for Video Asset Management

Content Creator Roger Seng Sings Ci's Praises for Video Asset Management Brie Clayton July 16, 2024 0 Comments Roger Seng wears many hats in his p...

17/07/2024

ASG Expands its Software-Defined Workflow Solutions with Virtual Truck Offering

ASG Expands its Software-Defined Workflow Solutions with Virtual Truck Offering Brie Clayton July 16, 2024 0 Comments New Cloud-Based Infrastructure E...

17/07/2024

MRMC Unveils the Super Milo: Precision and Speed Redefined in Motion Control

MRMC Unveils the Super Milo: Precision and Speed Redefined in Motion Control Brie Clayton July 16, 2024 0 Comments MRMC today announced the release of...

17/07/2024

Get Organized: A Filmmaker's Guide to Planning a Successful Video Shoot

Get Organized: A Filmmaker's Guide to Planning a Successful Video Shoot Sean Alami July 16, 2024 0 Comments I've learned that planning for a v...

17/07/2024

WRAL's Live After 5' summer concert series pushes start time back due to heat

The free Live After 5 summer concert series is adjusting its start time for it...

17/07/2024

Paris 2024: Inside Look at OBS Plans for 11,000 Hours of Games Coverage

Paris 2024: Inside Look at OBS Plans for 11,000 Hours of Games Coverage The effort ranges from the largest live-TV effort ever to personal Athlete Moments By K...

17/07/2024

Paris 2024: Topping 40, Telemundo's Olympic Commentary Team Is Its Largest Ever

Paris 2024: Topping 40, Telemundo's Olympic Commentary Team Is Its Largest E...

17/07/2024

Interview: Appear CTO Andy Rayner on R&D, Security and Future Growth

Interview: Appear CTO Andy Rayner on R&D, security and future growth By Jo Ruddock Wednesday, July 10, 2024 - 13:59 Print This Story As Appear celebrates ...

17/07/2024

Globecast Employs IP and Cloud Distribution for Malaga Premier Padel P1

Globecast employs IP and cloud distribution for Malaga Premier Padel P1 By David Davies Monday, July 15, 2024 - 15:59 Print This Story In a year that is e...

17/07/2024

Paris 2024: NBC Sports Teams Up With Chyron for Live Broadcast Graphics at Olympics

Paris 2024: NBC Sports Teams Up With Chyron for Live Broadcast Graphics at Olymp...

17/07/2024

Japanese Female Pro-Wrestling Drama Series The Queen of Villains' Premieres September 19

Back to All News Japanese Female Pro-Wrestling Drama Series The Queen of Villa...

17/07/2024

Decoding How AI-Powered Upscaling on NVIDIA RTX Improves Video Quality

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, softwa...

17/07/2024

July 16, 2024

New sleep study aims to understand cognitive decline in women Scripps Research launches digital trial to identify sleep-related risk factors for Alzheimer's...

16/07/2024

SNS Trophy Case: Industry Awards And Accolades

SNS has won multiple awards for its innovative technology solutions. At SNS, we are passionate about empowering creative teams with the tools they need to creat...

16/07/2024

ASG Unveils Virtual Truck Cloud Production For Live, Remote Events

EMERYVILLE, Calif. Advanced Systems Group (ASG) has launched Virtual Truck, a cloud-based remote production solution for live sports, music, corporate events an...

16/07/2024

Sling TV Launches 4K Streams with MLB All-Star Game

ENGLEWOOD, Colo. Sling TV has announced that it will begin offering its first 4K streams of sporting events and that the first 4K stream will be the July 16, 20...

16/07/2024

Dr. Judith Christie McAllister Joins Gospel Performance Summer Program as Artist-in-Residence

Dr. Judith Christie McAllister Joins Gospel Performance Summer Program as Artist...

16/07/2024

Good Morning Football' Extension Series To Stream on Roku Channel

Good Morning Football, the NFL's football-focused panel talk show, is further expanding with a new two-hour edition titled GMFB: Overtime that will stream o...

16/07/2024

Charter Makes Spectrum News Plus Available to Non-Cable Subscribers

Charter Communications said its Spectrum News Plus national news network is now available to stream for free on Xumo Play....

16/07/2024

Season 3 of Dateline' Podcast Missing in America' Out July 16

Season three of NBC podcast Dateline: Missing in America premieres Tuesday, July 16. Josh Mankiewicz, Dateline NBC correspondent, investigates missing persons c...

16/07/2024

Big Brother' Cast Revealed for Season 26

CBS has announced the 16 new Houseguests for season 26 of Big Brother. The season begins Wednesday, July 17....

16/07/2024

Fox and Foxtel Join To Develop Scripted Series

Fox Entertainment and Foxtel Group said they made a deal to co-develop scripted series for the U.S. and Australian markets....

16/07/2024

PBS Shares Fall Premieres

PBS shared its fall schedule at the Television Critics Association Summer Press Tour in Pasadena, California, including a John Leguizamo special about Hispanic ...

16/07/2024

TCLtv Plus Streaming Service Launches on Roku Devices, TVs

Smart TV maker TCL said it launched its streaming service, TCLtv Plus, on Roku devices and TVs made by TCL that use the Roku TV operating system....

16/07/2024

Ikegami Announces European Market Introduction of Ultra-C...

Ikegami announces a space-saving addition to its range of broadcast quality television production equipment. Previewed at the April 2024 NAB Show in Las Vegas a...

16/07/2024

BBC Studios picks Blue Lucy BLAM for agile media operatio...

Blue Lucy has signed a multi-year contract with BBC Studios, who will continue using its BLAM media management platform. The new contract is an extension of a p...

16/07/2024

OOONA to Demonstrate Latest Advances in Media Localizatio...

OOONA will demonstrate the latest additions and refinements to its award-winning media localization management and production platform on stand 3.C69 at IBC 202...

16/07/2024

Cubbit the first geo-distributed cloud enabler raises twe...

Cubbit, the first geo-distributed cloud storage enabler, today announced the closing of its $12.5M funding round. With this new funding the company will enable ...

16/07/2024

Avid Upgrades NEXIS Software

BURLINGTON, Mass. Avid has announced improvements to its software-defined storage solution, Avid NEXIS that are designed to help users meet the most demanding a...

16/07/2024

Spectrum News+ Launches as FAST Channel on Xumo Play

STAMFORD, Conn. Spectrum News has launched a streaming service Spectrum News+ on Xumo Play, the free ad-supported streaming television (FAST) service....

16/07/2024

PurpleTV Adds Video-Taped Radio Talk Shows from Civic Media

MILWAUKEE The recently launched PurpleTV, political broadcast TV channel has ramped up its content with a deal with Civic Media....

16/07/2024

John Stamos Joins Zeam As Chief Innovation Officer

Zeam Media has announced that John Stamos, the popular actor, musician, producer and best-selling author, is adding a new role to his new illustrious resume: ch...

16/07/2024

NBC Sports Highlights Importance of Sony Solutions in its Paris Olympics Coverage

STAMFORD, Conn. NBC Sports has announced that it is using Sony Electronics to pr...

16/07/2024

ASG Expands its Software-Defined Workflow Solutions with...

Advanced Systems Group LLC (ASG), a technology and services provider for media creatives and content owners, has launched a new solution in cloud-based remote p...

16/07/2024

The UKs Most Popular Radio Stations Moves To PMC Equipped...

After 18 years at Wogan House, BBC Radio 2 has moved back into the nearby BBC Broadcasting House, London, and is housed in new studios designed by specialist br...