
In the age of AI reasoning, training smarter, more capable models is critical to scaling intelligence. Delivering the massive performance to meet this new age requires breakthroughs across GPUs, CPUs, NICs, scale-up and scale-out networking, system architectures, and mountains of software and algorithms.
In MLPerf Training v5.1 - the latest round in a long-running series of industry-standard tests of AI training performance - NVIDIA swept all seven tests, delivering the fastest time to train across large language models (LLMs), image generation, recommender systems, computer vision and graph neural networks.
NVIDIA was also the only platform to submit results on every test, underscoring the rich programmability of NVIDIA GPUs, and the maturity and versatility of its CUDA software stack.
NVIDIA Blackwell Ultra Doubles Down The GB300 NVL72 rack-scale system, powered by the NVIDIA Blackwell Ultra GPU architecture, made its debut in MLPerf Training this round, following a record-setting showing in the most recent MLPerf Inference round.
Compared with the prior-generation Hopper architecture, the Blackwell Ultra-based GB300 NVL72 delivered more than 4x the Llama 3.1 405B pretraining and nearly 5x the Llama 2 70B LoRA fine-tuning performance using the same number of GPUs.
These gains were fueled by Blackwell Ultra's architectural improvements - including new Tensor Cores that offer 15 petaflops of NVFP4 AI compute, twice the attention-layer compute and 279GB of HBM3e memory - as well as new training methods that tapped into the architecture's enormous NVFP4 compute performance.
Connecting multiple GB300 NVL72 systems, the NVIDIA Quantum-X800 InfiniBand platform - the industry's first end-to-end 800 Gb/s networking platform - also made its MLPerf debut, doubling scale-out networking bandwidth compared with the prior generation.
Performance Unlocked: NVFP4 Accelerates LLM Training Key to the outstanding results this round was performing calculations using NVFP4 precision - a first in the history of MLPerf Training.
One way to increase compute performance is to build an architecture capable of performing computations on data represented with fewer bits, and then to perform those calculations at a faster rate. However, lower precision means less information is available in each calculation. This means using low-precision calculations in the training process calls for careful design decisions to keep results accurate.
NVIDIA teams innovated at every layer of the stack to adopt FP4 precision for LLM training. The NVIDIA Blackwell GPU can perform FP4 calculations - including the NVIDIA-designed NVFP4 format as well as other FP4 variants - at double the rate of FP8. Blackwell Ultra boosts that to 3x, enabling the GPUs to deliver substantially greater AI compute performance.
NVIDIA is the only platform to date that has submitted MLPerf Training results with calculations performed using FP4 precision while meeting the benchmark's strict accuracy requirements.
NVIDIA Blackwell Scales to New Heights NVIDIA set a new Llama 3.1 405B time-to-train record of just 10 minutes, powered by more than 5,000 Blackwell GPUs working together efficiently. This entry was 2.7x faster than the best Blackwell-based result submitted in the prior round, resulting from efficient scaling to more than twice the number of GPUs, as well as the use of NVFP4 precision to dramatically increase the effective performance of each Blackwell GPU.
To illustrate the performance increase per GPU, NVIDIA submitted results this round using 2,560 Blackwell GPUs, achieving a time to train of 18.79 minutes - 45% faster than the submission last round using 2,496 GPUs.
New Benchmarks, New Records NVIDIA also set performance records on the two new benchmarks added this round: Llama 3.1 8B and FLUX.1.
Llama 3.1 8B - a compact yet highly capable LLM - replaced the long-running BERT-large model, adding a modern, smaller LLM to the benchmark suite. NVIDIA submitted results with up to 512 Blackwell Ultra GPUs, setting the bar at 5.2 minutes to train.
In addition, FLUX.1 - a state-of-the-art image generation model - replaced Stable Diffusion v2, with only the NVIDIA platform submitting results on the benchmark. NVIDIA submitted results using 1,152 Blackwell GPUs, setting a record time to train of 12.5 minutes.
NVIDIA continued to hold records on the existing graph neural network, object detection and recommender system tests.
A Broad and Deep Partner Ecosystem The NVIDIA ecosystem participated extensively this round, with compelling submissions from 15 organizations including ASUS, Dell Technologies, Giga Computing, Hewlett Packard Enterprise, Krai, Lambda, Lenovo, Nebius, Quanta Cloud Technology, Supermicro, University of Florida, Verda (formerly DataCrunch) and Wiwynn.
NVIDIA is innovating at a one-year rhythm, driving significant and rapid performance increases across pretraining, post-training and inference - paving the way to new levels of intelligence and accelerating AI adoption.
See more NVIDIA performance data on the Data Center Deep Learning Product Performance Hub and Performance Explorer pages.
Most recent headlines
12/12/2025
HUDSON, Mass. BitFire and Appear have struck a strategic partnership aimed at offering broadcasters, sports leagues and streaming platforms a faster, more flexi...
12/12/2025
The broadcast industry is evolving faster than ever. #IPWorkflows #remoteproduction, and next-gen audio systems are reshaping how teams design, deliver, and sca...
12/12/2025
LOS ANGELES The payroll and production accounting platform Wrapbook has announced the acquisition of Cinapse, a modern scheduling platform for film and televisi...
12/12/2025
DEHLI Ross Video has announced that it is expanding and restructuring its commercial and technical teams in the South Asian Association for Regional Cooperation...
12/12/2025
LONDON Following the success of its UK launch in January 2025, Rise AV, the global not-for-profit initiative dedicated to supporting and advancing women in the ...
12/12/2025
SAN FRANCISCO Ad-supported streaming service Tubi next week will launch Matter Casting, a new casting standard that will enable seamless mobile-to-TV viewing di...
12/12/2025
LOS ANGELES The Hollywood Professional Association (HPA) today said futurist Robert Tercek, creative technologist Jessie Hughes from Leonardo.AI and Emmy-winnin...
12/12/2025
Scripps Research team pioneers an efficient way to stereoselectively add fluorine to drug-like molecules A new method uses a novel catalyst and inexpensive fluo...
11/12/2025
Thomson and the Center for News, Technology and Innovation (CNTI) convened a two-day workshop in Sarajevo bringing together more than 35 journalists, editors, p...
11/12/2025
ESPN's Aims for Spectacular With Heisman Trophy ShowEvent firsts include 1080p HDR production airing on both national broadcast and cableBy Dan Daley, Audio...
11/12/2025
SVG Students To Watch: Frankie Patton, University of ColoradoThe 2025 grad is hitting the ground running as a PA on national broadcastsBy Brandon Costa, Directo...
11/12/2025
SVG Summit 2025 Technology Exhibits Preview, Part 3By SVG Staff
Thursday, December 11, 2025 - 7:24 am
Print This Story | Subscribe
Story Highlights
The 2...
11/12/2025
SVG Sit-Down: What Makes Gen Z, X, and Y Fans Tick? Dave Gavant of WSC Sports Go...
11/12/2025
SVG Summit 2025 Preview: 5G, MXL, Spectrum Loss, and Outerspace on Tap for Tues...
11/12/2025
2025 Sports Broadcasting Hall of Fame: David Levy, Turner Titan and Master of Al...
11/12/2025
SVG Launches Follow the Money' Podcast: Go Inside the Sports Media Biz with...
11/12/2025
A Deep Dive Inside Game Creek Video's Bird and Magic Mobile Units, Home to A...
11/12/2025
How Sound Effects for Monsters Funday Football' Emulated the Sonic Soul of ...
11/12/2025
SVG New Sponsor Spotlight: CSP Mobile Productions' Len Chase on Upgrading Tr...
11/12/2025
Having the right song soundtrack your moves can make all the difference when gam...
11/12/2025
It's been a big year for Taylor Swift. Her highly anticipated album The Life...
11/12/2025
New satellites for the SDA Tranche 1 Tracking program in production at L3Harris&...
11/12/2025
The Meadowlands system, a compact and mobile version of the CCS, uses ground-based radio frequency units to disrupt satellite communications....
11/12/2025
The L3Harris demonstration united tactical communications devices, counter-UAS c...
11/12/2025
Throughout 2025, L3Harris delivered innovative solutions to U.S. and allied warfighters across every domain.
With an unrelenting commitment to excellence, our...
11/12/2025
A Majority of the World's Population (51%) Identify As Soccer Fans
The 2025 MLB postseason notched 58.2 billion viewing minutes, up +24% from the prior y...
11/12/2025
WALTHAM, Mass. Video-over-IP software provider Zixi said Roi Sasson has joined the company as vice president, engineering....
11/12/2025
MOUNTAIN VIEW, Calif. In a move that highlights the growing competition between broadcasters and CTV platforms for local advertising, LG Ad Solutions has announ...
11/12/2025
Boston Conservatory Earns Several Best of Accolades in 2025 Highlights include a faculty Grammy win, a seventh consecutive year on Playbill's list of co...
11/12/2025
RASTATT, Germany Lawo and the Society of Motion Picture and Television Engineers (SMPTE) have partnered to launch the SMPTE ST 2110 Practical Lab, an immersive ...
11/12/2025
PHILADELPHIA Comcasts Xfinity operating brand has announced the launch of new national video plans with all-in pricing that the operator said will provide custo...
11/12/2025
After eight years of declines, MoffettNathansons new Cord Cutting Monitor for Q3 2025 shows that pay TV subscribers to linear TV packages rose by 303,000, the f...
11/12/2025
Happy Holidays from Berklee Enjoy this years holiday student-performance video.
December 10, 2025
By
Office of the President
Dear Berklee community,
As w...
11/12/2025
Unveiling what it describes as the most capable model series yet for professional knowledge work, OpenAI launched GPT-5.2 today. The model was trained and deplo...
11/12/2025
11 Dec 2025
VEON's Banglalink Receives Regulatory Approval to Launch Digita...
11/12/2025
Thursday 11 December 2025
Sky Arts paints a picture of Britain's beauty as ...
11/12/2025
Back to All News
Meet the Most Relatable Hero This Holiday: Main Trailer and Po...
11/12/2025
Back to All News
High Tides: Netflix Shares Release Date Final Season
Credit: Netflix / Thomas Nolf
Entertainment
11 December 2025
GlobalNetherlandsBelgium...
11/12/2025
Back to All News
Made in Texas: How Netflix House Dallas Is Leaving a Lasting Footprint
From left to right: America's Sweethearts: Dallas Cowboys Cheerl...
11/12/2025
Back to All News
The Second Part of One Hundred Years of Solitude Will Arrive i...
11/12/2025
It's not every day we're asked to create a website for a classical musician!
So, we were delighted to help pianist Georgina Duncan as she embarks on he...
11/12/2025
The Hollywood Professional Association (HPA) today announced additional programm...
11/12/2025
It's beginning to look a lot like Christmas!
Irish golfing hero Shane Lowry...
11/12/2025
RT Sport Awards 2025 live on RT One and RT Player at 8:05pm on Saturday 20 December
On Saturday 20 December live on RT One and RT Player at the earlier ti...
11/12/2025
Hunters, saddle up - adventure awaits in the cloud.
Journey into the world of M...
11/12/2025
Sash is ready to samba as Fair City star Stephanie Kelly is revealed as the late...
11/12/2025
Dalet, a leading provider of cloud-native, end-to-end media workflow solutions, ...
10/12/2025
Sound-Alike Commercials Are Part of Sports' Soundtrack Johnny Cash for Coca-Cola is the latest in a long litany of sonic approximationsBy Dan Daley, Audio ...
10/12/2025
Immersive Sound Is Logical Next Step for Sports VenuesSound-systems suppliers are sanguine, but the market has its challengesBy Dan Daley, Audio Editor
Wednes...
10/12/2025
The Romans Built Arenas for Immersive Sound 2,000 Years AgoThe historic Arena of Nimes in France is still in use todayBy Dan Daley, Audio Editor
Wednesday, De...