Sony Pixel Power calrec Sony

Scaling to New Heights: NVIDIA MLPerf Training Results Showcase Unprecedented Performance and Elasticity

12/06/2024

The full-stack NVIDIA accelerated computing platform has once again demonstrated exceptional performance in the latest MLPerf Training v4.0 benchmarks.

NVIDIA more than tripled the performance on the large language model (LLM) benchmark, based on GPT-3 175B, compared to the record-setting NVIDIA submission made last year. Using an AI supercomputer featuring 11,616 NVIDIA H100 Tensor Core GPUs connected with NVIDIA Quantum-2 InfiniBand networking, NVIDIA achieved this remarkable feat through larger scale - more than triple that of the 3,584 H100 GPU submission a year ago - and extensive full-stack engineering.

Thanks to the scalability of the NVIDIA AI platform, Eos can now train massive AI models like GPT-3 175B even faster, and this great AI performance translates into significant business opportunities. For example, in NVIDIA's recent earnings call, we described how LLM service providers can turn a single dollar invested into seven dollars in just four years running the Llama 3 70B model on NVIDIA HGX H200 servers. This return assumes an LLM service provider serving Llama 3 70B at $0.60/M tokens, with an HGX H200 server throughput of 24,000 tokens/second.

NVIDIA H200 GPU Supercharges Generative AI and HPC The NVIDIA H200 Tensor GPU builds upon the strength of the Hopper architecture, with 141GB of HBM3 memory and over 40% more memory bandwidth compared to the H100 GPU. Pushing the boundaries of what's possible in AI training, the NVIDIA H200 Tensor Core GPU extended the H100's performance by up to 47% in its MLPerf Training debut.

NVIDIA Software Drives Unmatched Performance Gains Additionally, our submissions using a 512 H100 GPU configuration are now up to 27% faster compared to just one year ago due to numerous optimizations to the NVIDIA software stack. This improvement highlights how continuous software enhancements can significantly boost performance, even with the same hardware.

This work also delivered nearly perfect scaling. As the number of GPUs increased by 3.2x - going from 3,584 H100 GPUs last year to 11,616 H100 GPUs with this submission - so did the delivered performance.

Learn more about these optimizations on the NVIDIA Technical Blog.

Excelling at LLM Fine-Tuning As enterprises seek to customize pretrained large language models, LLM fine-tuning is becoming a key industry workload. MLPerf introduced a new LLM fine-tuning benchmark this round, based on the popular low-rank adaptation (LoRA) technique applied to Meta Llama 2 70B.

The NVIDIA platform excelled at this task, scaling from eight to 1,024 GPUs, with the largest-scale NVIDIA submission completing the benchmark in a record 1.5 minutes.

Accelerating Stable Diffusion and GNN Training NVIDIA also accelerated Stable Diffusion v2 training performance by up to 80% at the same system scales submitted last round. These advances reflect numerous enhancements to the NVIDIA software stack, showcasing how software and hardware improvements go hand-in-hand to deliver top-tier performance.

On the new graph neural network (GNN) test based on R-GAT, the NVIDIA platform with H100 GPUs excelled at both small and large scales. The H200 delivered a 47% boost on single-node GNN training compared to the H100. This showcases the powerful performance and high efficiency of NVIDIA GPUs, which make them ideal for a wide range of AI applications.

Broad Ecosystem Support Reflecting the breadth of the NVIDIA AI ecosystem, 10 NVIDIA partners submitted results, including ASUS, Dell Technologies, Fujitsu, GIGABYTE, Hewlett Packard Enterprise, Lenovo, Oracle, Quanta Cloud Technology, Supermicro and Sustainable Metal Cloud. This broad participation, and their own impressive benchmark results, underscores the widespread adoption and trust in NVIDIA's AI platform across the industry.

MLCommons' ongoing work to bring benchmarking best practices to AI computing is vital. By enabling peer-reviewed comparisons of AI and HPC platforms, and keeping pace with the rapid changes that characterize AI computing, MLCommons provides companies everywhere with crucial data that can help guide important purchasing decisions.

And with the NVIDIA Blackwell platform, next-level AI performance on trillion-parameter generative AI models for both training and inference is coming soon.
LINK: https://blogs.nvidia.com/blog/mlperf-training-benchmarks/...
See more stories from nvidia

North America Stories

23/06/2026

PBS Selects LTN to Power Nationwide IP Video Network

Share Copy link Facebook X Linkedin Bluesky Email...

23/06/2026

PBS selects LTN for nationwide IP video network

LTN, a global leader in IP-based video transport and network services, today announced that PBS has selected LTN as its IP video partner to modernize and future...

23/06/2026

The LiveU Q Era Arrives in ANZ with the LU900Q at ABE2026

LiveU will introduce its Q Era to Australia and New Zealand for the first time at ABE2026 on Stand No. 25, (July 30 31). Leading the showcase is the LU900Q, a n...

23/06/2026

Miri Technologies Ships V410 Live 4K Video Encoder-Decode...

Miri Technologies Inc. has begun shipping its highly anticipated V410 live 4K video encoder/decoder for streaming, IP-based production workflows and AV-over-IP ...

23/06/2026

DHD SX2 and TX2 Consoles Go On-Air at Radio Tzafon

DHD audio reports the completion of an upgrade to the audio production facilities at the Galilee headquarters of Radio Tzafon. The station broadcasts two progra...

23/06/2026

Nagravision Launches Nagra Venturi Security Offering

Share Copy link Facebook X Linkedin Bluesky Email...

23/06/2026

ITN Expands Programmatic Local TV Platform

Share Copy link Facebook X Linkedin Bluesky Email...

23/06/2026

Warner Bros. Discovery Taps AWS for New AI-Powered Ad Tech

Share Copy link Facebook X Linkedin Bluesky Email...

23/06/2026

Study: Younger Viewers More Distracted But More Receptive to Ads

Share Copy link Facebook X Linkedin Bluesky Email...

23/06/2026

Chilevisin, ClaroVTR Tap Pixop for 4K FIFA World Cup Feed

Share Copy link Facebook X Linkedin Bluesky Email...

23/06/2026

Imagine Communications Names Greg Garmon as Senior Vice P...

Multifaceted Growth Executive Brings 20+ Years of Experience Leading Organizations Across Tech and M&E Imagine Communications today announced the appointment ...

23/06/2026

Visual Productions Unveils RdmRelay2 Four-channel Relay Control at InfoComm 2026

Visual Productions Unveils RdmRelay2 Four-channel Relay Control at InfoComm 2026 Brie Clayton June 22, 2026 0 Comments New Relay Solution Combines DMX, ...

23/06/2026

SMPTE Makes Its Standards Freely Accessible, Opening Standards Library to the Global Media Technology Community

SMPTE Makes Its Standards Freely Accessible, Opening Standards Library to the Gl...

23/06/2026

NVIDIA Powers Over 400 of the World's 500 Fastest Supercomputers

News Highlights: NVIDIA technology runs 81% of the TOP500 and 90% of the systems new to the list. 26 systems on the TOP500 adopted the NVIDIA Grace CPU, up ei...

23/06/2026

How Businesses Are Building Specialized AI They Can Trust

Companies are asking how to build specialized AI that fits with the way their workflows actually run. The first wave of enterprise AI was about access. Compan...

23/06/2026

June 22, 2026

Newly identified molecule strengthens the eye's response to damage in retinal disease Scripps Research discovery finds that restoring the naturally occurrin...

22/06/2026

Behind the Mic: SportsCenters Lisa Cohn to Retire This June From ESPN as Longest-Tenured Anchor

Behind The Mic provides a roundup of recent news regarding on-air talent, includ...

22/06/2026

Cosm Appoints David Ho as Chief Legal Officer

Cosm has announced the appointment of David Ho as Chief Legal Officer, a newly created executive role reporting to President and CEO Jeb Terry. Ho will oversee ...

22/06/2026

Warner Bros. Discovery and AWS Announce AI-Powered Advertising Technology Platform

Warner Bros. Discovery and Amazon Web Services (AWS) have announced the developm...

22/06/2026

Daktronics Completes Audio Control System Upgrade at Petco Park for San Diego Padres

Daktronics has completed an audio control system upgrade at Petco Park in San Di...

22/06/2026

Accelerate Media Names John Willi President, Launches Accelerate Sports Network

Accelerate Media has named John Willi as President and announced the launch of the Accelerate Sports Network (ASN), a prep sports media and streaming platform c...

22/06/2026

AWSN to Air 3XBA Womens Basketball Tournament Live June 26-27

All Women's Sports Network (AWSN) and 3XBA (3 3 Basketball Association) have announced live television coverage of the annual 3XBA tournament on Friday, Jun...

22/06/2026

OWL AI Appoints Jay Prasad as Chief Executive Officer

OWL AI has announced the appointment of Jay Prasad as Chief Executive Officer and member of the Board of Directors. Prasad succeeds Josh Gwyther, who has served...

22/06/2026

CP Communications Provides RF Support for Inside the NBA at 2026 NBA Finals

CP Communications delivered RF video and audio support for TNT's Inside the NBA at the 2026 NBA Finals, providing main show coverage in San Antonio and ea...

22/06/2026

Polymarket and GRID Partner to Integrate Esports Data and Streaming into Trading Platform

Polymarket has announced a partnership with GRID, an official esports data platf...

22/06/2026

SVG New Sponsor Spotlight: Metinteractive's Rachel Mele, Ken Cyr on Building Technology Backbones for Sports Venues

As sports venues continue to evolve into more video-centric, fan-engagement-driv...

22/06/2026

SVG All-Stars: Corbin Perkins, Chief Engineer, Victory+

As the regional sports production scene shifts toward streaming, this Texan helps lead the engineering behind Victory+'s growing live platform...

22/06/2026

Meet the 2026 Sundance Institute Documentary Edit Intensive Fellows

By Kristin Feeley, Director, Documentary Film & Artist Programs the memories of your elders [are] a scaffolding for you to build your identity on - and t...

22/06/2026

Xumo Expands Contextual Targeting Capabilities Through Gracenote and IRIS.TV Integrations

Expanded integrations give advertisers access to distinct contextual signals acr...

22/06/2026

Greg Garmon Joins Imagine as Senior VP, Americas Video Sales

Share Copy link Facebook X Linkedin Bluesky Email...

22/06/2026

Kaleidescape Breaks the 8K and 4:4:4 Barriers

Share Copy link Facebook X Linkedin Bluesky Email...

22/06/2026

Xilica introduces Dynamic Voice Lift in new Designer

Xilica today announced the release of Dynamic Voice Lift, a new feature in Xilica Designer v4.12 that brings adaptive speech reinforcement to large meeting spac...

22/06/2026

NVIDIA Brings Trusted, 24/7 AI Agents to Telecom Operations

Telecom operators have seen remarkable returns from using generative AI to automate network management, customer care and back-office operations. Most of that i...

22/06/2026

Eco Wave Power Turns Waves Into Watts With NVIDIA AI Infrastructure and Digital Twins

The next era of AI will not be defined by compute alone. Its growth will be dete...

22/06/2026

NVIDIA Vera CPU Opens the Way for Agentic Scientific AI at Los Alamos National Laboratory

Mission, Vision and Veritas - new Los Alamos National Laboratory (LANL) supercom...

22/06/2026

From Materials Simulation to Experimental Astronomy, New NVIDIA AI Software Unlocks Scientific Discoveries

At the ISC conference running in Hamburg this week, NVIDIA is introducing new so...

22/06/2026

NAIRR Science Program Reshapes Scientific Research, Powered by NVIDIA AI Infrastructure

For the past two years, the U.S. National Science Foundation's National Arti...

22/06/2026

At ISC, JUPITER Shows What Exascale Science Looks Like

JUPITER, Europe's first exascale supercomputer at Germany's Forschungszentrum J lich, runs on NVIDIA Grace Hopper Superchips and NVIDIA Quantum-X800 Inf...

21/06/2026

FIFAs Oscar Sanchez on World Cup Effort: Were Feeling Good and Where We Want to Be

To call the 2026 FIFA World Cup a big undertaking would be a big understatement....

21/06/2026

Hotter Than a Hot Tub: The 45C Breakthrough to Cool AI's Biggest Machines

Hot tubs sit at about 38 to 40 degrees Celsius, warm enough that most people can only soak for about 15 minutes. NVIDIA's newest AI servers can run their co...

20/06/2026

What's Next for Apogee? Start Here.

What exactly is Apogee Control V3? Control V3 is a new mixer application that controls Apogee interfaces. The new hit feature is that V3 finally allows for...

19/06/2026

NBC Sports U.S. Open Coverage Fires Up 92 Cameras, Bunker cams

Split compound eases operational challenges at Shinnecock Hills Golf Club...

19/06/2026

ESPN's Men's College World Series Production Adds Onsite Studio, POVORA CapCams, Expanded Drone Coverage for Finale in Omaha

North Carolina, Oklahoma meet in the best-of-three Finals as ESPN leans into spe...

19/06/2026

Eurovision secures top four position as content distributor rankings hold steady in Poland

Data from May shows seasonal outdoor trends triggers lower viewing Warsaw, Pola...

19/06/2026

Bitfocus Buttons wins another top industry award

Buttons is best control system in the rAVe Best of Infocomm Awards 2026...

19/06/2026

Mavis Studio Makes iPad Production More Powerful

Mavis Studio Makes iPad Production More Powerful Brie Clayton June 19, 2026 0 Comments InfoComm update brings new NDI Preview, PTZ control, USB audio ...

19/06/2026

Immersive Studio Metaverse Stage Tackles Post with Blackmagic Design

Immersive Studio Metaverse Stage Tackles Post with Blackmagic Design Brie Clayton June 19, 2026 0 Comments New narrative projects rely on DaVinci Reso...

19/06/2026

How to Run the Original 1993 After Effects

How to Run the Original 1993 After Effects Graham Quince June 19, 2026 0 Comments How to the original After Effects v1 in an emulator, and you don'...

19/06/2026

IBC Show to Increase Focus on Networking, Startups

Share Copy link Facebook X Linkedin Bluesky Email...

19/06/2026

Irdeto Taps Axel Gallant as CEO

Share Copy link Facebook X Linkedin Bluesky Email...