
The full-stack NVIDIA accelerated computing platform has once again demonstrated exceptional performance in the latest MLPerf Training v4.0 benchmarks.
NVIDIA more than tripled the performance on the large language model (LLM) benchmark, based on GPT-3 175B, compared to the record-setting NVIDIA submission made last year. Using an AI supercomputer featuring 11,616 NVIDIA H100 Tensor Core GPUs connected with NVIDIA Quantum-2 InfiniBand networking, NVIDIA achieved this remarkable feat through larger scale - more than triple that of the 3,584 H100 GPU submission a year ago - and extensive full-stack engineering.
Thanks to the scalability of the NVIDIA AI platform, Eos can now train massive AI models like GPT-3 175B even faster, and this great AI performance translates into significant business opportunities. For example, in NVIDIA's recent earnings call, we described how LLM service providers can turn a single dollar invested into seven dollars in just four years running the Llama 3 70B model on NVIDIA HGX H200 servers. This return assumes an LLM service provider serving Llama 3 70B at $0.60/M tokens, with an HGX H200 server throughput of 24,000 tokens/second.
NVIDIA H200 GPU Supercharges Generative AI and HPC The NVIDIA H200 Tensor GPU builds upon the strength of the Hopper architecture, with 141GB of HBM3 memory and over 40% more memory bandwidth compared to the H100 GPU. Pushing the boundaries of what's possible in AI training, the NVIDIA H200 Tensor Core GPU extended the H100's performance by up to 47% in its MLPerf Training debut.
NVIDIA Software Drives Unmatched Performance Gains Additionally, our submissions using a 512 H100 GPU configuration are now up to 27% faster compared to just one year ago due to numerous optimizations to the NVIDIA software stack. This improvement highlights how continuous software enhancements can significantly boost performance, even with the same hardware.
This work also delivered nearly perfect scaling. As the number of GPUs increased by 3.2x - going from 3,584 H100 GPUs last year to 11,616 H100 GPUs with this submission - so did the delivered performance.
Learn more about these optimizations on the NVIDIA Technical Blog.
Excelling at LLM Fine-Tuning As enterprises seek to customize pretrained large language models, LLM fine-tuning is becoming a key industry workload. MLPerf introduced a new LLM fine-tuning benchmark this round, based on the popular low-rank adaptation (LoRA) technique applied to Meta Llama 2 70B.
The NVIDIA platform excelled at this task, scaling from eight to 1,024 GPUs, with the largest-scale NVIDIA submission completing the benchmark in a record 1.5 minutes.
Accelerating Stable Diffusion and GNN Training NVIDIA also accelerated Stable Diffusion v2 training performance by up to 80% at the same system scales submitted last round. These advances reflect numerous enhancements to the NVIDIA software stack, showcasing how software and hardware improvements go hand-in-hand to deliver top-tier performance.
On the new graph neural network (GNN) test based on R-GAT, the NVIDIA platform with H100 GPUs excelled at both small and large scales. The H200 delivered a 47% boost on single-node GNN training compared to the H100. This showcases the powerful performance and high efficiency of NVIDIA GPUs, which make them ideal for a wide range of AI applications.
Broad Ecosystem Support Reflecting the breadth of the NVIDIA AI ecosystem, 10 NVIDIA partners submitted results, including ASUS, Dell Technologies, Fujitsu, GIGABYTE, Hewlett Packard Enterprise, Lenovo, Oracle, Quanta Cloud Technology, Supermicro and Sustainable Metal Cloud. This broad participation, and their own impressive benchmark results, underscores the widespread adoption and trust in NVIDIA's AI platform across the industry.
MLCommons' ongoing work to bring benchmarking best practices to AI computing is vital. By enabling peer-reviewed comparisons of AI and HPC platforms, and keeping pace with the rapid changes that characterize AI computing, MLCommons provides companies everywhere with crucial data that can help guide important purchasing decisions.
And with the NVIDIA Blackwell platform, next-level AI performance on trillion-parameter generative AI models for both training and inference is coming soon.
North America Stories
23/01/2026
ALT Systems, Inc., a leading system integrator and technology solutions provider for the media and entertainment industry, today announced the launch of PixSpan...
23/01/2026
The Alliance for IP Media Solutions (AIMS) will mark a major milestone for Pro AV-over-IP at ISE 2026 with the official launch of Internet Protocol Media Experi...
23/01/2026
KRK, a leader in professional studio monitoring for nearly four decades, will unveil the all new V Series Five at the 2026 NAMM Show, offering attendees an excl...
23/01/2026
SMPTE , the home of media professionals, technologists, and engineers, today announced Steve LLamb, Vice President of Technology Standards and Solutions for Cin...
23/01/2026
IBC today announces that the call for Technical Papers is now open for the IBC2026 Conference, inviting innovators from across the global media, entertainment, ...
23/01/2026
Grass Valley has announced that Asharq News, the leading multi-platform Arabic news service owned by the Saudi Research & Media Group (SRMG), has expanded its c...
23/01/2026
At the SET Expo 2025, a consortium including Qualcomm Technologies, Inc., Motorola, and Rohde & Schwarz successfully demonstrated a real-world proof-of-concept ...
23/01/2026
Dalet, a leading technology and service provider for media-rich organizations, today announced the appointment of Gwen Braygreen as Executive Vice President and...
23/01/2026
Alfalite, Brainstorm, Dejero, Domo Broadcast Systems, FOR-A, KitPlus, Ontario Soluciones and RGB Spectrum partner to demonstrate revolutionary integrated soluti...
23/01/2026
Vizrt, the leader in live production technology revolutionizing viewer experience and engagement, expands its team to ignite a new era of professional-grade pro...
23/01/2026
LOGIC media solutions, an Amazon Web Services (AWS) Advanced Partner specialising in AWS-based media workflows, is one of the official launch partners of the ne...
23/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
23/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
23/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
23/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
23/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
23/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
23/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
22/01/2026
SVG Students To Watch: Chuck Luarasi, Curry CollegeThe Massachusetts native is cutting his teeth with Harvard Athletics, Cape Cod Baseball LeagueBy Brandon Cost...
22/01/2026
Follow the Money, Episode 4: Talking Tech, Sports, and Private Capital With Sam ...
22/01/2026
Fever pitch: WRC is back for the start of the 2026 season with Rallye Monte-Carl...
22/01/2026
FloSports Prepares To Broadcast Outdoor Hockey Game Amidst Brutally Cold Tempera...
22/01/2026
As Paramount Enters the Octagon, UFC's Craig Borsari Previews Production Pl...
22/01/2026
By Jordan Crucchiola
It's a desire you hear so often among those in filmmaking circles. I just want to make cool stuff with my friends. With the NEXT selec...
22/01/2026
Brittany Shyne attends the 2025 Sundance Film Festival premiere of Seeds at The Ray Theatre on January 25, 2025, in Park City, UT. (Photo by Robin Marshall/Sh...
22/01/2026
Joel Edgerton and Felicity Jones appear in Train Dreams by Clint Bentley, an off...
22/01/2026
MELBOURNE, Fla., Jan 22, 2026 - L3Harris Technologies (NYSE: LHX) has received a...
22/01/2026
Strategic hire marks latest milestone in Gracenote's continued expansion into CTV advertising & monetization
New York - January 21, 2026 - Nielsen's Gr...
22/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
22/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
22/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
22/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
22/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
22/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
22/01/2026
AI-powered driver assistance technologies are becoming standard equipment, funda...
22/01/2026
A Four-Time Emmy Award Winner on Defining His SoundCharles David Denler is a Composer and Pianist for film, television, and the Concert Stage. He is a 4 Time E...
22/01/2026
The wait is over, pilots. Flight control support - one of the most community-requested features for GeForce NOW - is live starting today, following its announce...
22/01/2026
AI has taken center stage in financial services, automating the research and exe...
22/01/2026
AI-powered content generation is now embedded in everyday tools like Adobe and Canva, with a slew of agencies and studios incorporating the technology into thei...
21/01/2026
The Living Room Remains Central: Nielsen Highlights Growing TV Screen Dominance ...
21/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
21/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
21/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
21/01/2026
Telestream , the industry's leading provider of content lifecycle management and media workflow orchestration, and Quantum Corporation (NASDAQ: QMCO) today ...
21/01/2026
Lightware s TPN ecosystem brings a new level of predictability and structure to 10G AV-over-IP deployments, offering professional AV integrators a deterministic...
21/01/2026
Wisycom, a global leader in advanced wireless RF solutions, launches its new wideband antenna matrix, MATF, which supports RF and fiber for demanding multi-zone...
21/01/2026
Grass Valley will demonstrate how it is powering scalable, future-ready live production at FOMEX 2026, taking place February 2 4 in Riyadh, Saudi Arabia. Exhibi...
21/01/2026
BCNEXXT, the developers of the advanced playout platform Vipe, today announced that OKAST, the monetization-first OTT platform provider, is using BCNEXXT's ...
21/01/2026
Revamped design enables advanced capabilities, leading with powerful IP to HDMI conversion
Magewell, developer of innovative, high-performance video I/O and I...
21/01/2026
Jan 20th 2026, Changsha Kiloview today announced the launch of two major additions to its AV-over-IP ecosystem: the AVX24-4 Media HUB and KiloLink Station, ma...