
NVIDIA today announced Nemotron-4 340B, a family of open models that developers can use to generate synthetic data for training large language models (LLMs) for commercial applications across healthcare, finance, manufacturing, retail and every other industry.
High-quality training data plays a critical role in the performance, accuracy and quality of responses from a custom LLM - but robust datasets can be prohibitively expensive and difficult to access.
Through a uniquely permissive open model license, Nemotron-4 340B gives developers a free, scalable way to generate synthetic data that can help build powerful LLMs.
The Nemotron-4 340B family includes base, instruct and reward models that form a pipeline to generate synthetic data used for training and refining LLMs. The models are optimized to work with NVIDIA NeMo, an open-source framework for end-to-end model training, including data curation, customization and evaluation. They're also optimized for inference with the open-source NVIDIA TensorRT-LLM library.
Nemotron-4 340B can be downloaded now from the NVIDIA NGC catalog and from Hugging Face, where developers can also use the Train on DGX Cloud service to easily fine-tune open AI models. Developers will soon be able to access the models at ai.nvidia.com, where they'll be packaged as an NVIDIA NIM microservice with a standard application programming interface that can be deployed anywhere.
Navigating Nemotron to Generate Synthetic Data LLMs can help developers generate synthetic training data in scenarios where access to large, diverse labeled datasets is limited.
The Nemotron-4 340B Instruct model creates diverse synthetic data that mimics the characteristics of real-world data, helping improve data quality to increase the performance and robustness of custom LLMs across various domains.
Then, to boost the quality of the AI-generated data, developers can use the Nemotron-4 340B Reward model to filter for high-quality responses. Nemotron-4 340B Reward grades responses on five attributes: helpfulness, correctness, coherence, complexity and verbosity. It's currently first place on the Hugging Face RewardBench leaderboard, created by AI2, for evaluating the capabilities, safety and pitfalls of reward models.
In this synthetic data generation pipeline, (1) the Nemotron-4 340B Instruct model is first used to produce synthetic text-based output. An evaluator model, (2) Nemotron-4 340B Reward, then assesses this generated text - providing feedback that guides iterative improvements and ensures the synthetic data is accurate, relevant and aligned with specific requirements. Researchers can also create their own instruct or reward models by customizing the Nemotron-4 340B Base model using their proprietary data, combined with the included HelpSteer2 dataset.
Fine-Tuning With NeMo, Optimizing for Inference With TensorRT-LLM Using open-source NVIDIA NeMo and NVIDIA TensorRT-LLM, developers can optimize the efficiency of their instruct and reward models to generate synthetic data and to score responses.
All Nemotron-4 340B models are optimized with TensorRT-LLM to take advantage of tensor parallelism, a type of model parallelism in which individual weight matrices are split across multiple GPUs and servers, enabling efficient inference at scale.
Nemotron-4 340B Base, trained on 9 trillion tokens, can be customized using the NeMo framework to adapt to specific use cases or domains. This fine-tuning process benefits from extensive pretraining data and yields more accurate outputs for specific downstream tasks.
A variety of customization methods are available through the NeMo framework, including supervised fine-tuning and parameter-efficient fine-tuning methods such as low-rank adaptation, or LoRA.
To boost model quality, developers can align their models with NeMo Aligner and datasets annotated by Nemotron-4 340B Reward. Alignment is a key step in training LLMs, where a model's behavior is fine-tuned using algorithms like reinforcement learning from human feedback (RLHF) to ensure its outputs are safe, accurate, contextually appropriate and consistent with its intended goals.
Businesses seeking enterprise-grade support and security for production environments can also access NeMo and TensorRT-LLM through the cloud-native NVIDIA AI Enterprise software platform, which provides accelerated and efficient runtimes for generative AI foundation models.
Evaluating Model Security and Getting Started The Nemotron-4 340B Instruct model underwent extensive safety evaluation, including adversarial tests, and performed well across a wide range of risk indicators. Users should still perform careful evaluation of the model's outputs to ensure the synthetically generated data is suitable, safe and accurate for their use case.
For more information on model security and safety evaluation, read the model card.
Download Nemotron-4 340B models via NVIDIA NGC and Hugging Face. For more details, read the research papers on the model and dataset.
See notice regarding software product information.
North America Stories
10/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
10/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
10/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
10/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
10/04/2026
Frequency, the engine behind the worlds leading streaming television channels, today launched its AI platform for Frequency Studio, powering the entire channel ...
09/04/2026
Zixi will demonstrate IP-based live video workflow solutions at NAB Show 2026 (Booth W2057).
The industry is moving quickly toward IP-based distribution as br...
09/04/2026
Global women's elite sports revenues are expected to reach at least $3 billi...
09/04/2026
Monitor engineer Gavin Tempany mixed Kylie Minogue s Tension Tour on a Solid Sta...
09/04/2026
KOKUSAI DENKI Electric America will exhibit at NAB Show 2026 (Booth C5507), debu...
09/04/2026
With the 2025-26 NBA regular season concluded and the playoffs beginning next we...
09/04/2026
Telestream and Mimir have announced an integration connecting Telestream's V...
09/04/2026
Bitmovin has expanded its Live Encoding and Observability solutions to provide r...
09/04/2026
The Nashville Predators and Scripps Sports have announced a multi-year media rights agreement covering local preseason, regular season, and first-round playoff ...
09/04/2026
Advanced Systems Group, LLC has announced a partnership with Beam Dynamics to offer the Beam Asset and License Intelligence Platform to its clients. The platfor...
09/04/2026
Lawo has unveiled Edge One, a combined video and audio stagebox for broadcast and Pro AV workflows. The device will be on display at NAB Show (Booth C2108, Apri...
09/04/2026
The Society of Motion Picture and Television Engineers (SMPTE) will host the SMPTE ST 2110 IP Media Roadshow on Tuesday, April 21, 2026, at the Las Vegas Conven...
09/04/2026
The Atlanta Braves have completed upgrades to video displays in and around Truist Park ahead of the 2026 MLB season. The upgrades include the Delta Out-of-Town ...
09/04/2026
The University of Southern California has contracted Daktronics (NASDAQ: DAKT) of Brookings, South Dakota, to manufacture and install 22 LED displays across fou...
09/04/2026
Backlight, the media technology company behind Iconik and Wildmoka, will showcase its Creative Operations Platform at NAB Show 2026 (Booth N2829, April 19-22). ...
09/04/2026
MotoAmerica and V10 Entertainment have announced a partnership to broadcast MotoAmerica Superbike racing on VICE TV for the 2026 season. Coverage begins live on...
09/04/2026
Proton Camera Innovations has announced the appointment of Tod Musgrave as US Sa...
09/04/2026
Designed specifically for live sports broadcasting, new platform features IP-nat...
09/04/2026
Blending 1990s DNA, modern motion theory, and a distinctly colorful brand identi...
09/04/2026
Technical capability is essential, but long-term success often depends on how we...
09/04/2026
15 feature films, including fiction and documentaries, along with six short film...
09/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
09/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
09/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
09/04/2026
Purpose Built Monitoring From Live Production to Master Control to OTT, Across On Prem and Cloud Environments
At the 2026 NAB Show (April 19-22, Las Vegas Con...
09/04/2026
Purpose Built Monitoring From Live Production to Master Control to OTT, Across On Prem and Cloud Environments
At the 2026 NAB Show (April 19-22, Las Vegas Con...
09/04/2026
New advances meet surge in demand for broadcast-grade IP migration as C-band spectrum auctions approach
LTN announces major enhancements to its purpose-built g...
09/04/2026
Hitomi Broadcast has expanded its sales team with the addition of Nicola Milburn as Technical Sales Manager. In this role, Nicola will work with customers and p...
09/04/2026
Revolutionary combined solution brings sub-second latency, resilient delivery, and workflow orchestration to global broadcasters and digital platforms
Layercak...
09/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
09/04/2026
Yospace exceeded 10 billion dynamically stitched ads in a single month, reaching 11.6 billion as ad-supported streaming surged. Driven by a packed global sports...
09/04/2026
Bitmovin has expanded its Live Encoding and Observability solutions to provide true end to end, real time insights across live streaming workflows, from encodin...
09/04/2026
Leyra has announced the launch of Icelandic public broadcaster R V's streaming service on Samsung and LG Smart TVs. R V is the first public broadcaster to d...
09/04/2026
3Play Media, a global leader in video accessibility and localization, today announced an AI Dubbing solution purpose-built for YouTube creators. The company, wh...
09/04/2026
Big Blue Marble, a provider of broadcast-grade, cloud-native video solutions, has been recognized as an Amazon Web Services (AWS) Managed Services Provider (MSP...
09/04/2026
The Professional Darts Corporation (PDC) has officially launched its revamped global streaming service, PDC TV, in collaboration with Cleeng and sports technolo...
09/04/2026
Cleeng, the Subscriber Retention Management (SRM ) pioneer, today announced a raft of new AI agents for its AI Assistant to accelerate decision-making and autom...
09/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
09/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
09/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
09/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
09/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
09/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
09/04/2026
New Charging, Connectivity, and Mounting Solutions Now Available
LAS VEGAS, APRIL 8, 2026 Pliant Technologies will highlight a range of new accessories at th...
09/04/2026
As live production continues its shift to IP, the challenge is no longer adoption it's reliability. At NAB Show 2026 (Booth W2033), Media Links will demon...
09/04/2026
QuickLink, a leading provider of award-winning video production and remote guest contribution solutions, launches its new AI-powered add-on for its StudioPro p...