
NVIDIA today announced Nemotron-4 340B, a family of open models that developers can use to generate synthetic data for training large language models (LLMs) for commercial applications across healthcare, finance, manufacturing, retail and every other industry.
High-quality training data plays a critical role in the performance, accuracy and quality of responses from a custom LLM - but robust datasets can be prohibitively expensive and difficult to access.
Through a uniquely permissive open model license, Nemotron-4 340B gives developers a free, scalable way to generate synthetic data that can help build powerful LLMs.
The Nemotron-4 340B family includes base, instruct and reward models that form a pipeline to generate synthetic data used for training and refining LLMs. The models are optimized to work with NVIDIA NeMo, an open-source framework for end-to-end model training, including data curation, customization and evaluation. They're also optimized for inference with the open-source NVIDIA TensorRT-LLM library.
Nemotron-4 340B can be downloaded now from the NVIDIA NGC catalog and from Hugging Face, where developers can also use the Train on DGX Cloud service to easily fine-tune open AI models. Developers will soon be able to access the models at ai.nvidia.com, where they'll be packaged as an NVIDIA NIM microservice with a standard application programming interface that can be deployed anywhere.
Navigating Nemotron to Generate Synthetic Data LLMs can help developers generate synthetic training data in scenarios where access to large, diverse labeled datasets is limited.
The Nemotron-4 340B Instruct model creates diverse synthetic data that mimics the characteristics of real-world data, helping improve data quality to increase the performance and robustness of custom LLMs across various domains.
Then, to boost the quality of the AI-generated data, developers can use the Nemotron-4 340B Reward model to filter for high-quality responses. Nemotron-4 340B Reward grades responses on five attributes: helpfulness, correctness, coherence, complexity and verbosity. It's currently first place on the Hugging Face RewardBench leaderboard, created by AI2, for evaluating the capabilities, safety and pitfalls of reward models.
In this synthetic data generation pipeline, (1) the Nemotron-4 340B Instruct model is first used to produce synthetic text-based output. An evaluator model, (2) Nemotron-4 340B Reward, then assesses this generated text - providing feedback that guides iterative improvements and ensures the synthetic data is accurate, relevant and aligned with specific requirements. Researchers can also create their own instruct or reward models by customizing the Nemotron-4 340B Base model using their proprietary data, combined with the included HelpSteer2 dataset.
Fine-Tuning With NeMo, Optimizing for Inference With TensorRT-LLM Using open-source NVIDIA NeMo and NVIDIA TensorRT-LLM, developers can optimize the efficiency of their instruct and reward models to generate synthetic data and to score responses.
All Nemotron-4 340B models are optimized with TensorRT-LLM to take advantage of tensor parallelism, a type of model parallelism in which individual weight matrices are split across multiple GPUs and servers, enabling efficient inference at scale.
Nemotron-4 340B Base, trained on 9 trillion tokens, can be customized using the NeMo framework to adapt to specific use cases or domains. This fine-tuning process benefits from extensive pretraining data and yields more accurate outputs for specific downstream tasks.
A variety of customization methods are available through the NeMo framework, including supervised fine-tuning and parameter-efficient fine-tuning methods such as low-rank adaptation, or LoRA.
To boost model quality, developers can align their models with NeMo Aligner and datasets annotated by Nemotron-4 340B Reward. Alignment is a key step in training LLMs, where a model's behavior is fine-tuned using algorithms like reinforcement learning from human feedback (RLHF) to ensure its outputs are safe, accurate, contextually appropriate and consistent with its intended goals.
Businesses seeking enterprise-grade support and security for production environments can also access NeMo and TensorRT-LLM through the cloud-native NVIDIA AI Enterprise software platform, which provides accelerated and efficient runtimes for generative AI foundation models.
Evaluating Model Security and Getting Started The Nemotron-4 340B Instruct model underwent extensive safety evaluation, including adversarial tests, and performed well across a wide range of risk indicators. Users should still perform careful evaluation of the model's outputs to ensure the synthetically generated data is suitable, safe and accurate for their use case.
For more information on model security and safety evaluation, read the model card.
Download Nemotron-4 340B models via NVIDIA NGC and Hugging Face. For more details, read the research papers on the model and dataset.
See notice regarding software product information.
North America Stories
24/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
24/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
24/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
23/01/2026
WWE's Virtual Production Playbook: How the Professional Wrestling Super Powe...
23/01/2026
Tight set up: Squeezing the PSA's Tournament of Champions into Grand Central...
23/01/2026
Evolving production: The PSA on bringing squash to more viewers at the Tournamen...
23/01/2026
AFC Championship Preview: Behind the Scenes With NFL on CBS' Producer Jim R...
23/01/2026
NFC Championship Preview: FOX Sports Director Rich Russo Talks Technology, Story...
23/01/2026
Coalition military forces operating across the vast geography of the Indo-Pacific rely on interoperable, secure data links to share intelligence, surveillance a...
23/01/2026
Artist rendering of L3Harris Technologies' AERIS next generation airborne early warning and control solution....
23/01/2026
The U.S. Air Force AMP Increment II aircraft at L3Harris' facility in Waco, Texas. L3Harris has modernized C-130 avionics since 1985, delivering digital coc...
23/01/2026
Multi-year deal utilizes Nielsen's full suite of local audience marketing in...
23/01/2026
New York, NY January 21, 2026 - Neptune BidCo US Inc. (the Issuer or the Co...
23/01/2026
ALT Systems, Inc., a leading system integrator and technology solutions provider for the media and entertainment industry, today announced the launch of PixSpan...
23/01/2026
The Alliance for IP Media Solutions (AIMS) will mark a major milestone for Pro AV-over-IP at ISE 2026 with the official launch of Internet Protocol Media Experi...
23/01/2026
KRK, a leader in professional studio monitoring for nearly four decades, will unveil the all new V Series Five at the 2026 NAMM Show, offering attendees an excl...
23/01/2026
SMPTE , the home of media professionals, technologists, and engineers, today announced Steve LLamb, Vice President of Technology Standards and Solutions for Cin...
23/01/2026
IBC today announces that the call for Technical Papers is now open for the IBC2026 Conference, inviting innovators from across the global media, entertainment, ...
23/01/2026
Grass Valley has announced that Asharq News, the leading multi-platform Arabic news service owned by the Saudi Research & Media Group (SRMG), has expanded its c...
23/01/2026
At the SET Expo 2025, a consortium including Qualcomm Technologies, Inc., Motorola, and Rohde & Schwarz successfully demonstrated a real-world proof-of-concept ...
23/01/2026
Dalet, a leading technology and service provider for media-rich organizations, today announced the appointment of Gwen Braygreen as Executive Vice President and...
23/01/2026
Alfalite, Brainstorm, Dejero, Domo Broadcast Systems, FOR-A, KitPlus, Ontario Soluciones and RGB Spectrum partner to demonstrate revolutionary integrated soluti...
23/01/2026
Vizrt, the leader in live production technology revolutionizing viewer experience and engagement, expands its team to ignite a new era of professional-grade pro...
23/01/2026
LOGIC media solutions, an Amazon Web Services (AWS) Advanced Partner specialising in AWS-based media workflows, is one of the official launch partners of the ne...
23/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
23/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
23/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
23/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
23/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
23/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
23/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
23/01/2026
Back to All News
Pavane' Drops Teaser Ahead of February 20 Debut - A Tende...
22/01/2026
SVG Students To Watch: Chuck Luarasi, Curry CollegeThe Massachusetts native is cutting his teeth with Harvard Athletics, Cape Cod Baseball LeagueBy Brandon Cost...
22/01/2026
Follow the Money, Episode 4: Talking Tech, Sports, and Private Capital With Sam ...
22/01/2026
Fever pitch: WRC is back for the start of the 2026 season with Rallye Monte-Carl...
22/01/2026
FloSports Prepares To Broadcast Outdoor Hockey Game Amidst Brutally Cold Tempera...
22/01/2026
As Paramount Enters the Octagon, UFC's Craig Borsari Previews Production Pl...
22/01/2026
By Jordan Crucchiola
It's a desire you hear so often among those in filmmaking circles. I just want to make cool stuff with my friends. With the NEXT selec...
22/01/2026
Brittany Shyne attends the 2025 Sundance Film Festival premiere of Seeds at The Ray Theatre on January 25, 2025, in Park City, UT. (Photo by Robin Marshall/Sh...
22/01/2026
Joel Edgerton and Felicity Jones appear in Train Dreams by Clint Bentley, an off...
22/01/2026
MELBOURNE, Fla., Jan 22, 2026 - L3Harris Technologies (NYSE: LHX) has received a...
22/01/2026
Strategic hire marks latest milestone in Gracenote's continued expansion into CTV advertising & monetization
New York - January 21, 2026 - Nielsen's Gr...
22/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
22/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
22/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
22/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
22/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
22/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
22/01/2026
AI-powered driver assistance technologies are becoming standard equipment, funda...
22/01/2026
A Four-Time Emmy Award Winner on Defining His SoundCharles David Denler is a Composer and Pianist for film, television, and the Concert Stage. He is a 4 Time E...