Sony Pixel Power calrec Sony

Speed Demon: NVIDIA Blackwell Takes Pole Position in Latest MLPerf Inference Results

02/04/2025

In the latest MLPerf Inference V5.0 benchmarks, which reflect some of the most challenging inference scenarios, the NVIDIA Blackwell platform set records - and marked NVIDIA's first MLPerf submission using the NVIDIA GB200 NVL72 system, a rack-scale solution designed for AI reasoning.

Delivering on the promise of cutting-edge AI takes a new kind of compute infrastructure, called AI factories. Unlike traditional data centers, AI factories do more than store and process data - they manufacture intelligence at scale by transforming raw data into real-time insights. The goal for AI factories is simple: deliver accurate answers to queries quickly, at the lowest cost and to as many users as possible.

The complexity of pulling this off is significant and takes place behind the scenes. As AI models grow to billions and trillions of parameters to deliver smarter replies, the compute required to generate each token increases. This requirement reduces the number of tokens that an AI factory can generate and increases cost per token. Keeping inference throughput high and cost per token low requires rapid innovation across every layer of the technology stack, spanning silicon, network systems and software.

The latest updates to MLPerf Inference, a peer-reviewed industry benchmark of inference performance, include the addition of Llama 3.1 405B, one of the largest and most challenging-to-run open-weight models. The new Llama 2 70B Interactive benchmark features much stricter latency requirements compared with the original Llama 2 70B benchmark, better reflecting the constraints of production deployments in delivering the best possible user experiences.

In addition to the Blackwell platform, the NVIDIA Hopper platform demonstrated exceptional performance across the board, with performance increasing significantly over the last year on Llama 2 70B thanks to full-stack optimizations.

NVIDIA Blackwell Sets New Records The GB200 NVL72 system - connecting 72 NVIDIA Blackwell GPUs to act as a single, massive GPU - delivered up to 30x higher throughput on the Llama 3.1 405B benchmark over the NVIDIA H200 NVL8 submission this round. This feat was achieved through more than triple the performance per GPU and a 9x larger NVIDIA NVLink interconnect domain.

While many companies run MLPerf benchmarks on their hardware to gauge performance, only NVIDIA and its partners submitted and published results on the Llama 3.1 405B benchmark.

Production inference deployments often have latency constraints on two key metrics. The first is time to first token (TTFT), or how long it takes for a user to begin seeing a response to a query given to a large language model. The second is time per output token (TPOT), or how quickly tokens are delivered to the user.

The new Llama 2 70B Interactive benchmark has a 5x shorter TPOT and 4.4x lower TTFT - modeling a more responsive user experience. On this test, NVIDIA's submission using an NVIDIA DGX B200 system with eight Blackwell GPUs tripled performance over using eight NVIDIA H200 GPUs, setting a high bar for this more challenging version of the Llama 2 70B benchmark.

Combining the Blackwell architecture and its optimized software stack delivers new levels of inference performance, paving the way for AI factories to deliver higher intelligence, increased throughput and faster token rates.

NVIDIA Hopper AI Factory Value Continues Increasing The NVIDIA Hopper architecture, introduced in 2022, powers many of today's AI inference factories, and continues to power model training. Through ongoing software optimization, NVIDIA increases the throughput of Hopper-based AI factories, leading to greater value.

On the Llama 2 70B benchmark, first introduced a year ago in MLPerf Inference v4.0, H100 GPU throughput has increased by 1.5x. The H200 GPU, based on the same Hopper GPU architecture with larger and faster GPU memory, extends that increase to 1.6x.

Hopper also ran every benchmark, including the newly added Llama 3.1 405B, Llama 2 70B Interactive and graph neural network tests. This versatility means Hopper can run a wide range of workloads and keep pace as models and usage scenarios grow more challenging.

It Takes an Ecosystem This MLPerf round, 15 partners submitted stellar results on the NVIDIA platform, including ASUS, Cisco, CoreWeave, Dell Technologies, Fujitsu, Giga Computing, Google Cloud, Hewlett Packard Enterprise, Lambda, Lenovo, Oracle Cloud Infrastructure, Quanta Cloud Technology, Supermicro, Sustainable Metal Cloud and VMware.

The breadth of submissions reflects the reach of the NVIDIA platform, which is available across all cloud service providers and server makers worldwide.

MLCommons' work to continuously evolve the MLPerf Inference benchmark suite to keep pace with the latest AI developments and provide the ecosystem with rigorous, peer-reviewed performance data is vital to helping IT decision makers select optimal AI infrastructure.

Learn more about MLPerf.

Images and video taken at an Equinix data center in the Silicon Valley.
LINK: https://blogs.nvidia.com/blog/blackwell-mlperf-inference/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

06/09/2026

Dolby and MagentaTV Bring Fans Closer to the FIFA World Cup 2026 in Germany with Dolby Vision and Dolby Atmos

June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

15/06/2026

University of South Carolina's Valerie Gerfin on Gamecock Productions' Growth, Upgrades at Williams-Brice Stadium

One of the more exciting internal video production divisions within a college at...

15/06/2026

Fox Corp. To Acquire Roku, Pairs Live Sports Powerhouse With Major CTV Platform

The deal valued at $22 Billion is expected to close in the first half of 2027...

15/06/2026

Golf Channel Mobile to Live Stream 2026 Arnold Palmer Cup Beginning July 13th

Golf Channel and the Arnold Palmer Cup have announced a partnership to livestream the 2026 Arnold Palmer Cup on Golf Channel Mobile and GolfChannel.com. The tou...

15/06/2026

TikTok and Panini Launch Digital Collectible Card Experience for FIFA World Cup 2026

TikTok and Panini have announced a partnership to bring a digital collectible ca...

15/06/2026

Cosm and Monster Energy Launch First Full-Dome Immersive Advertisement in Shared Reality Venues

Cosm and Monster Energy have announced the debut of the first full-dome immersiv...

15/06/2026

Fox Nation and Real American Freestyle Sign International Media Rights Deal

Real American Freestyle (RAF) and Fox Nation have announced an exclusive streaming agreement for three RAF international events, beginning with RAF Georgia on J...

15/06/2026

FanConnect and Extreme Networks Announce IPTV Integration for Large Venue Deployments

FanConnect has announced a partnership with Extreme Networks integrating FanConn...

15/06/2026

2026 Sundance Institute Ignite x Adobe Fellows Named

Ten Emerging Filmmakers Ages 18 to 25 Will Start Fellowship Year at Ignite Lab from June 14-19 LOS ANGELES, CA, June 15, 2026 - The nonprofit Sundance Institut...

15/06/2026

Rumble from UVI

Innovative three-band soft synth introduced UVI's latest synth takes an interesting approach to synthesis, offering a trio of synth engines that each op...

15/06/2026

Oram Awards 2026: Open call announcement

Applications now open for 2026 The Oram Awards have returned for 2026 to celebrate the unusual, unique and unfiltered creative worlds of women and gender-di...

15/06/2026

PSPaudioware release PSP Levelizer

New intelligent auto-fader plug-in revealed PSPaudioware's latest release offers automatic level adjustment and provides more detailed control than many...

15/06/2026

4.78M AUSSIES TUNE IN FOR SOCCEROOS WIN OVER TRKYE ON SBS

4.78M AUSSIES TUNE IN FOR SOCCEROOS WIN OVER T RK YE ON SBS 15 June, 2026 Media releases Match had a Total TV average audience of 3.035 million, with over ...

15/06/2026

SBS Head of Commissioning John Godfrey to depart after 18 years

SBS Head of Commissioning John Godfrey to depart after 18 years 15 June, 2026 Media releases SBS Head of Commissioning John Godfrey will depart the broadca...

15/06/2026

Greater Manchester Police installs Rohde & Schwarz security scanner for custody searches

Greater Manchester Police installs Rohde & Schwarz security scanner for custody ...

15/06/2026

The New Discovery Stack: AI, Metadata and Audience Intelligence

Insights from NAGRAVISION's latest industry webinar featuring One Hungary, Liberty Global and Media Press Group In this blog, Laura Rognoni explores the k...

15/06/2026

Clear-Com Introduces Avalon IP Intercom Platform

Share Copy link Facebook X Linkedin Bluesky Email...

15/06/2026

DoJ Approves Paramount Skydance, Warner Bros. Discovery Merger

Share Copy link Facebook X Linkedin Bluesky Email...

15/06/2026

Clear-Com Introduces Avalon IP Station for Modern Communi...

Clear-Com has introduced Avalon , a purpose built 1RU IP intercom communication platform for modern networked production, designed to simplify and scale workfl...

15/06/2026

Fox Makes CTV Play with Roku Acquisition

Share Copy link Facebook X Linkedin Bluesky Email...

15/06/2026

Gray Announces Plans to Expand Lansing, Mich. Broadcast HQ

Share Copy link Facebook X Linkedin Bluesky Email...

15/06/2026

Richmond Flying Squirrels Raise the Bar for Live Baseball...

MiLB Club Deploys LDX 110 Cameras at CarMax Park to Deliver A New Standard in Engaging Fan Experience Grass Valley today announced that the Richmond Flying Sq...

15/06/2026

Detach from Direct-Attached: How Remote Editing with EVO Keeps Creative Teams Moving

Detach from Direct-Attached: How Remote Editing with EVO Keeps Creative Teams Mo...

15/06/2026

Techtel Completes Media Production Setup for a major AFL sporting organisation

Techtel Completes Media Production Setup for a major AFL sporting organisation Sports 15 June Written By Suzanne Costello (Sydney, Australia 15 June 2026)...

15/06/2026

Sky News takes viewers inside Minab in new film investigating primary school strike in Iran

Monday 15 June 2026 Sky News takes viewers inside Minab in new film investigati...

15/06/2026

Fox Corporation to Acquire Roku, Inc.

Fox Corporation to Acquire Roku, Inc. Combination Creates a Scaled Media and Technology Platform with Superior Reach, Engagement and Monetization Capability ...

14/06/2026

Detroit Drums from Iconic Instruments

Library captures 1960s R&B/pop drum sound Following on from their recent wave of plug-in effects, Iconic Instruments have just launched an all-new virtual d...

14/06/2026

HBO Comedy Rooster Shot with URSA Cine 17K 65

HBO Comedy Rooster Shot with URSA Cine 17K 65 Brie Clayton June 14, 2026 0 Comments Large format brings viewers intimately close to characters. Black...

13/06/2026

Rhythmic Filters for Devious Machines' Infiltrator

Latest expansion pack includes 252 presets Devious Machines have recently introduced another expansion for their powerful multi-effects plug-in, Infiltrator...

13/06/2026

MetaGrid Pro gains AI Builder

Create custom DAW/plug-in controllers using prompts MetaGrid have recently introduced an all-new AI Builder function to their touchscreen-based control surf...

13/06/2026

Spectrum Reach Taps Anoki AI for Contextual Intelligence

Share Copy link Facebook X Linkedin Bluesky Email...

13/06/2026

Google TV Launches Soccer Hub, New Voice Command Features

Share Copy link Facebook X Linkedin Bluesky Email...

12/06/2026

YES Network and Gotham Sports App to Air Seven Athletes Unlimited Softball League Games

YES Network and The Gotham Sports App will air seven Athletes Unlimited Softball...

12/06/2026

UFL to Feature FAST Innovation Suite at 2026 United Bowl

The United Football League will host its FAST Innovation Suite at the 2026 United Bowl presented by Credit One Bank on Saturday, June 13 at 3:00 p.m. ET at Audi...

12/06/2026

InfoComm 2026: PTZOptics and LayerJot to Demo AI-Driven Camera Control

PTZOptics and LayerJot will present live demonstrations at InfoComm 2026 showing how natural-language AI prompting, robotic camera control, and on-device comput...

12/06/2026

InfoComm 2026: MultiDyne to Debut VF-9100 Fiber Transport Platform and Crescendo Audio Monitor

MultiDyne Video and Fiber Optic Systems will exhibit at InfoComm 2026, featuring...

12/06/2026

Eurovision Services Deploys Ateme Software-Based Frame-Rate Conversion

Ateme has announced that Eurovision Services is using Ateme's software-based frame-rate conversion technology for international live event workflows. The de...

12/06/2026

Bitmovin, Simplestream, and Xperi Partner to Support OTT Services on TiVo OS

Bitmovin and Simplestream have announced a partnership with Xperi to simplify the launch of OTT streaming services on TiVo OS smart TVs and devices. The collabo...

12/06/2026

Net Insight Deploys Nimbra 520 and Nimbra Edge for Multinational Corporate Live Production Workflow

Net Insight has announced that a multinational technology company is deploying a...

12/06/2026

MLB Players Inc., Athletes First Announce Content Partnership

MLB Players Inc., the business arm of the MLB Players Association, has announced a partnership with Athletes First to develop and sell brand partnerships across...

12/06/2026

G&D and VuWall Announce CommandKeyboard-Advanced for Network-Independent Control Room Operations

Guntermann and Drunck (G&D) and VuWall have announced the CommandKeyboard-Advanc...

12/06/2026

Philadelphia Union and Comcast Deploy Smart Technology at Subaru Park and WSFS Bank Sportsplex

Comcast Smart Solutions announces a new smart technology deployment with Major L...

12/06/2026

Elevation Worship Completes First Leg of 2026 Tour Using SSL Live Consoles and New UMD192 Interface

Elevation Worship completed the initial leg of its Elevation Nights 2026 tour ...

12/06/2026

AJA Announces KONA IP25 Integration with Colorfront Transkoder and On-Set Dailies

AJA Video Systems has announced KONA IP25 support for Colorfront Transkoder and ...

12/06/2026

InfoComm 2026: Audinate To Exhibit With New AVIO Install Adapters and Iris Camera Control Platform

Audinate Group Limited (ASX: AD8) will exhibit at InfoComm 2026 (Booth C7321, Ce...

12/06/2026

Pac-12 Appoints Scott Adametz as Chief Technology Officer

Pac-12 Commissioner Teresa Gould has announced the appointment of Scott Adametz as Chief Technology Officer. The Pac-12 describes the hire as the first CTO appo...

12/06/2026

InfoComm 2026: Grass Valley Introduces AMPP Edge Live for Enterprise Production

Grass Valley has announced AMPP Edge Live, a production system combining Grass Valley hardware, NVIDIA Blackwell GPU acceleration, and AMPP OS in a single platf...