Sony Pixel Power calrec Sony

Scaling to New Heights: NVIDIA MLPerf Training Results Showcase Unprecedented Performance and Elasticity

12/06/2024

The full-stack NVIDIA accelerated computing platform has once again demonstrated exceptional performance in the latest MLPerf Training v4.0 benchmarks.

NVIDIA more than tripled the performance on the large language model (LLM) benchmark, based on GPT-3 175B, compared to the record-setting NVIDIA submission made last year. Using an AI supercomputer featuring 11,616 NVIDIA H100 Tensor Core GPUs connected with NVIDIA Quantum-2 InfiniBand networking, NVIDIA achieved this remarkable feat through larger scale - more than triple that of the 3,584 H100 GPU submission a year ago - and extensive full-stack engineering.

Thanks to the scalability of the NVIDIA AI platform, Eos can now train massive AI models like GPT-3 175B even faster, and this great AI performance translates into significant business opportunities. For example, in NVIDIA's recent earnings call, we described how LLM service providers can turn a single dollar invested into seven dollars in just four years running the Llama 3 70B model on NVIDIA HGX H200 servers. This return assumes an LLM service provider serving Llama 3 70B at $0.60/M tokens, with an HGX H200 server throughput of 24,000 tokens/second.

NVIDIA H200 GPU Supercharges Generative AI and HPC The NVIDIA H200 Tensor GPU builds upon the strength of the Hopper architecture, with 141GB of HBM3 memory and over 40% more memory bandwidth compared to the H100 GPU. Pushing the boundaries of what's possible in AI training, the NVIDIA H200 Tensor Core GPU extended the H100's performance by up to 47% in its MLPerf Training debut.

NVIDIA Software Drives Unmatched Performance Gains Additionally, our submissions using a 512 H100 GPU configuration are now up to 27% faster compared to just one year ago due to numerous optimizations to the NVIDIA software stack. This improvement highlights how continuous software enhancements can significantly boost performance, even with the same hardware.

This work also delivered nearly perfect scaling. As the number of GPUs increased by 3.2x - going from 3,584 H100 GPUs last year to 11,616 H100 GPUs with this submission - so did the delivered performance.

Learn more about these optimizations on the NVIDIA Technical Blog.

Excelling at LLM Fine-Tuning As enterprises seek to customize pretrained large language models, LLM fine-tuning is becoming a key industry workload. MLPerf introduced a new LLM fine-tuning benchmark this round, based on the popular low-rank adaptation (LoRA) technique applied to Meta Llama 2 70B.

The NVIDIA platform excelled at this task, scaling from eight to 1,024 GPUs, with the largest-scale NVIDIA submission completing the benchmark in a record 1.5 minutes.

Accelerating Stable Diffusion and GNN Training NVIDIA also accelerated Stable Diffusion v2 training performance by up to 80% at the same system scales submitted last round. These advances reflect numerous enhancements to the NVIDIA software stack, showcasing how software and hardware improvements go hand-in-hand to deliver top-tier performance.

On the new graph neural network (GNN) test based on R-GAT, the NVIDIA platform with H100 GPUs excelled at both small and large scales. The H200 delivered a 47% boost on single-node GNN training compared to the H100. This showcases the powerful performance and high efficiency of NVIDIA GPUs, which make them ideal for a wide range of AI applications.

Broad Ecosystem Support Reflecting the breadth of the NVIDIA AI ecosystem, 10 NVIDIA partners submitted results, including ASUS, Dell Technologies, Fujitsu, GIGABYTE, Hewlett Packard Enterprise, Lenovo, Oracle, Quanta Cloud Technology, Supermicro and Sustainable Metal Cloud. This broad participation, and their own impressive benchmark results, underscores the widespread adoption and trust in NVIDIA's AI platform across the industry.

MLCommons' ongoing work to bring benchmarking best practices to AI computing is vital. By enabling peer-reviewed comparisons of AI and HPC platforms, and keeping pace with the rapid changes that characterize AI computing, MLCommons provides companies everywhere with crucial data that can help guide important purchasing decisions.

And with the NVIDIA Blackwell platform, next-level AI performance on trillion-parameter generative AI models for both training and inference is coming soon.
LINK: https://blogs.nvidia.com/blog/mlperf-training-benchmarks/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

07/04/2026

Haivision Unveils Makito ONE Live Video Contribution Platform

Share Copy link Facebook X Linkedin Bluesky Email...

07/04/2026

Neutrik To Unveil TRUE1 Data Connector Series At 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

07/04/2026

Chris Welcker Deployed Full DPA Arsenal to Record Live Mu...

Catgut Sound Owner and Production Sound Mixer Chris Welcker, CAS, has built a career at the intersection of music and film. A former musician and composer, Welc...

07/04/2026

SDVI Launches Next Generation Rally Platform to Give Medi...

SDVI Corporation today announced the next generation of its Rally media supply chain management platform, introducing a redesigned orchestration engine that rep...

07/04/2026

Avid to Debut Avid Content Core on AWS at 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

07/04/2026

Frequency Launches Smarter Ways to Operate Streaming Chan...

Frequency, the engine behind many of the world's leading streaming television channels, at NAB 2026 will be launching new Studio services to help content ow...

07/04/2026

Ikegami to Introduce Expanded Range of Broadcast Producti...

Ikegami USA has chosen NAB 2026 in Las Vegas as the launch platform for new additions to its range of broadcast-quality television production equipment. These w...

07/04/2026

Kiloview Advancing Broadcast IP Workflows with a Smarter...

April 19, 2026, Las Vegas Kiloview, an innovative provider of AV-over-IP technologies, will showcase its latest broadcast IP solutions at NAB 2026, presenting...

07/04/2026

Bitmovin Adds Support for SGAI in its Playback Products t...

Bitmovin has announced support for Server-Guided Ad Insertion (SGAI) across its playback products using HLS interstitials, enabling more advanced ad-supported s...

07/04/2026

Synamedia unveils AI by Quortex - a just-in-time AI-plugi...

Synamedia is unveiling AI by Quortex at The NAB show, a just-in-time AI plugin framework that applies intelligence only when needed across video processing, dis...

07/04/2026

Cuez Brings Four New Innovations to NAB 2026 From Story-C...

Cuez will showcase four additions to its cloud-based newsroom, rundown and automation platform at NAB Show 2026 (April 18 22, Las Vegas, Booth N1867): Cuez ...

07/04/2026

Barix Extends Transport Options for Multi-Engine IP Encoder

Barix Extends Transport Options for Multi-Engine IP Encoder Brie Clayton April 7, 2026 0 Comments New for NAB, Barix adds SRT and RIST support to Mult...

07/04/2026

Elite Media Technologies Selects Interra Systems' BATON File-Based QC Solution

Elite Media Technologies Selects Interra Systems' BATON File-Based QC Soluti...

07/04/2026

Tightrope Media Systems to Debut Cablecast LiveBridge for Simultaneous Streaming at NAB 2026

Tightrope Media Systems to Debut Cablecast LiveBridge for Simultaneous Streaming...

07/04/2026

Cuez Brings Four New Innovations to NAB 2026: From Story-Centric Newsroom to Open AI Agent Framework

Cuez Brings Four New Innovations to NAB 2026: From Story-Centric Newsroom to Ope...

07/04/2026

ASG Names Andrea Cummis VP of Systems Engineering

Share Copy link Facebook X Linkedin Bluesky Email...

07/04/2026

KTVJ Completes Major Signal Upgrade

Share Copy link Facebook X Linkedin Bluesky Email...

07/04/2026

Hearst's WDSU to Air Million Dollar Rodeo Competition

Share Copy link Facebook X Linkedin Bluesky Email...

07/04/2026

Grass Valley Launches Future Playmakers Program

Share Copy link Facebook X Linkedin Bluesky Email...

07/04/2026

Saranyu Technologies Launches MATCH - Multi-View Sports S...

Designed for synchronized multi-stream playback, low-latency delivery, and real-time analytics, MATCH introduces a unified viewing experience for sports broadca...

07/04/2026

Berklee Students to Honor George Martin with Performance of Original Scores

Berklee Students to Honor George Martin with Performance of Original Scores The orchestra, led by associate professor Xander Rovang, will perform several work...

07/04/2026

The GAA Championships on RT

THE GAA CHAMPIONSHIPS ARE BACK WITH A BANG LIVE AND FREE TO AIR ACROSS ALL RT PLATFORMS The Sunday Game and The Sunday Game Live return on Sunday 12 April L...

06/04/2026

Fab Five' Reunion Drives TNT and CBS's Experimental Final Four Altcast Built on REMI Workflow

Michigan legends bring a new voice to the broadcast as TNT Sports and CBS Sports...

06/04/2026

SVG New Sponsor Spotlight: Optikka CEO Daniel Evans on Scaling Sports Content with Programmatic Graphics

From high school sports all the way up to the major leagues, building high-quali...

06/04/2026

Quickplay and TwelveLabs Join AWS Business Outcomes Xcelerator Program

Quickplay, an AI company for the media and entertainment industry, has been accepted into the Advanced tier of the TwelveLabs Ecosystem Partner Program. Quickpl...

06/04/2026

Grass Valley Launches Future Playmakers Program for Students in Sports Production and Media Technology

Grass Valley has announced the Future Playmakers Program, a global initiative to...

06/04/2026

SVG All-Stars: Raasean Robinson, Gerente de Posproduccin y Operaciones de Estudio, FOX Deportes

El l der de operaciones impulsa la producci n en estudio mientras encuentra insp...

06/04/2026

SVG All-Stars: Raasean Robinson, Manager, Post Production and Studio Operations, FOX Deportes

The ops leader helps lead the charge in studio for the Spanish-language broadcas...

06/04/2026

Behind The Mic: SiriusXM Shares 2026 Masters Broadcast Team; ESPN to Produce Over 140+ Hours of Masters Live Coverage

Behind The Mic provides a roundup of recent news regarding on-air talent, includ...

06/04/2026

NHL Opens Innovation Lab in Partnership with Verizon, New Jersey Devils

The National Hockey League (NHL), in partnership with Verizon and the New Jersey Devils, today announced the opening of the NHL Innovation Lab powered by Verizo...

06/04/2026

ESPN+ To Stream Inaugural Rock League Curling Season

Rock League, a new professional curling league, has announced that ESPN+ will stream its inaugural 2026 season for fans in the United States. The first Rock Lea...

06/04/2026

ASG Appoints Andrea Cummis as VP of Systems Design and Engineering

Advanced Systems Group has announced the appointment of Andrea (Andy) Cummis as Vice President of Systems Design and Engineering. In this role, she will lead de...

06/04/2026

Source Media Group Launches Source Golf, a Creator-Driven YouTube Network Targeting Next-Gen Fans

Backed by Bolt Ventures, the venture brings Bryson DeChambeau, Grant Horvat, and...

06/04/2026

How the NHL's Innovation Lab Will Take Broadcast, Fan, and Team Tech to New Heights

With this environment we can start that collaboration even earlier because we ca...

06/04/2026

K-Pop Artist ENHYPEN Host The Blood Diary,' a New Video Podcast Series From HYBE

Like the immortal lives of vampires, some stories never really end. That's t...

06/04/2026

From Audio to IRL: How Let's Get Haunted' Is Building Community With Spotify RADAR

As podcasting continues to evolve, growth increasingly means building beyond aud...

06/04/2026

FSK Audio update Bark24 Dyn

Multiband dynamics plug-in enhanced California-based developer FSK Audio have released a significant update for their innovative multiband dynamics processo...

06/04/2026

IK Multimedia introduce ToneNET Preset Sharing

Share official & user-created full-rig presets IK Multimedia's latest TONEX update makes it possible for users of the popular amp and effects modelling ...

06/04/2026

Baseball 2026: More AI, Better Viewing Choices

Share Copy link Facebook X Linkedin Bluesky Email...

06/04/2026

JB&A Announces Details for its Pre-NAB 2026 Event

Share Copy link Facebook X Linkedin Bluesky Email...

06/04/2026

Dalet Showcases Dalia Agentic AI and End-to-End Media Workflows at NAB Show 2026

Dalet Showcases Dalia Agentic AI and End-to-End Media Workflows at NAB Show 2026 Brie Clayton April 6, 2026 0 Comments Dalet, a leading technology and...

06/04/2026

OpenDrives Shows Off Sports Expertise in Sports Business Hub located in NAB Show's West Hall

OpenDrives Shows Off Sports Expertise in Sports Business Hub located in NAB Show...

06/04/2026

Proton to Demonstrate 3D Application at NAB 2026

Proton to Demonstrate 3D Application at NAB 2026 Brie Clayton April 6, 2026 0 Comments Yet further creative potential unleashed through innovation in ...

06/04/2026

Autoscript Highlights Voice-Driven Prompting and PTZ Solutions at NAB 2026

Autoscript Highlights Voice-Driven Prompting and PTZ Solutions at NAB 2026 Brie Clayton April 6, 2026 0 Comments Experience Autoscript Voice, PTZ prom...

06/04/2026

Mediaproxy Highlights Significant Enhancements to its LogServer suite at NAB Show 2026

Mediaproxy Highlights Significant Enhancements to its LogServer suite at NAB Sho...