Sony Pixel Power calrec Sony

NVIDIA Grace Hopper Superchip Sweeps MLPerf Inference Benchmarks

11/09/2023

In its debut on the MLPerf industry benchmarks, the NVIDIA GH200 Grace Hopper Superchip ran all data center inference tests, extending the leading performance of NVIDIA H100 Tensor Core GPUs.

The overall results showed the exceptional performance and versatility of the NVIDIA AI platform from the cloud to the network's edge.

Separately, NVIDIA announced inference software that will give users leaps in performance, energy efficiency and total cost of ownership.

GH200 Superchips Shine in MLPerf The GH200 links a Hopper GPU with a Grace CPU in one superchip. The combination provides more memory, bandwidth and the ability to automatically shift power between the CPU and GPU to optimize performance.

Separately, NVIDIA HGX H100 systems that pack eight H100 GPUs delivered the highest throughput on every MLPerf Inference test in this round.

Grace Hopper Superchips and H100 GPUs led across all MLPerf's data center tests, including inference for computer vision, speech recognition and medical imaging, in addition to the more demanding use cases of recommendation systems and the large language models (LLMs) used in generative AI.

Overall, the results continue NVIDIA's record of demonstrating performance leadership in AI training and inference in every round since the launch of the MLPerf benchmarks in 2018.

The latest MLPerf round included an updated test of recommendation systems, as well as the first inference benchmark on GPT-J, an LLM with six billion parameters, a rough measure of an AI model's size.

TensorRT-LLM Supercharges Inference To cut through complex workloads of every size, NVIDIA developed TensorRT-LLM, generative AI software that optimizes inference. The open-source library - which was not ready in time for August submission to MLPerf - enables customers to more than double the inference performance of their already purchased H100 GPUs at no added cost.

NVIDIA's internal tests show that using TensorRT-LLM on H100 GPUs provides up to an 8x performance speedup compared to prior generation GPUs running GPT-J 6B without the software.

The software got its start in NVIDIA's work accelerating and optimizing LLM inference with leading companies including Meta, AnyScale, Cohere, Deci, Grammarly, Mistral AI, MosaicML (now part of Databricks), OctoML, Tabnine and Together AI.

MosaicML added features that it needs on top of TensorRT-LLM and integrated them into its existing serving stack. It's been an absolute breeze, said Naveen Rao, vice president of engineering at Databricks.

TensorRT-LLM is easy-to-use, feature-packed and efficient, Rao said. It delivers state-of-the-art performance for LLM serving using NVIDIA GPUs and allows us to pass on the cost savings to our customers.

TensorRT-LLM is the latest example of continuous innovation on NVIDIA's full-stack AI platform. These ongoing software advances give users performance that grows over time at no extra cost and is versatile across diverse AI workloads.

L4 Boosts Inference on Mainstream Servers In the latest MLPerf benchmarks, NVIDIA L4 GPUs ran the full range of workloads and delivered great performance across the board.

For example, L4 GPUs running in compact, 72W PCIe accelerators delivered up to 6x more performance than CPUs rated for nearly 5x higher power consumption.

In addition, L4 GPUs feature dedicated media engines that, in combination with CUDA software, provide up to 120x speedups for computer vision in NVIDIA's tests.

L4 GPUs are available from Google Cloud and many system builders, serving customers in industries from consumer internet services to drug discovery.

Performance Boosts at the Edge Separately, NVIDIA applied a new model compression technology to demonstrate up to a 4.7x performance boost running the BERT LLM on an L4 GPU. The result was in MLPerf's so-called open division, a category for showcasing new capabilities.

The technique is expected to find use across all AI workloads. It can be especially valuable when running models on edge devices constrained by size and power consumption.

In another example of leadership in edge computing, the NVIDIA Jetson Orin system-on-module showed performance increases of up to 84% compared to the prior round in object detection, a computer vision use case common in edge AI and robotics scenarios.

The Jetson Orin advance came from software taking advantage of the latest version of the chip's cores, such as a programmable vision accelerator, an NVIDIA Ampere architecture GPU and a dedicated deep learning accelerator.

Versatile Performance, Broad Ecosystem The MLPerf benchmarks are transparent and objective, so users can rely on their results to make informed buying decisions. They also cover a wide range of use cases and scenarios, so users know they can get performance that's both dependable and flexible to deploy.

Partners submitting in this round included cloud service providers Microsoft Azure and Oracle Cloud Infrastructure and system manufacturers ASUS, Connect Tech, Dell Technologies, Fujitsu, GIGABYTE, Hewlett Packard Enterprise, Lenovo, QCT and Supermicro.

Overall, MLPerf is backed by more than 70 organizations, including Alibaba, Arm, Cisco, Google, Harvard University, Intel, Meta, Microsoft and the University of Toronto.

Read a technical blog for more details on how NVIDIA achieved the latest results.

All the software used in NVIDIA's benchmarks is available from the MLPerf repository, so everyone can get the same world-class results. The optimizations are continuously folded into containers available on the NVIDIA NGC software hub for GPU applications.
LINK: https://blogs.nvidia.com/blog/2023/09/11/grace-hopper-inference-mlperf...
See more stories from nvidia

Most recent headlines

09/11/2025

Dalet Unveils Agentic AI Media Workflows at IBC2025

Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...

30/10/2025

SVG Students To Watch: Sam Newitt, Kansas State University

SVG Students To Watch: Sam Newitt, Kansas State UniversityThe South Dakota native thrives in many roles behind the scenes at K-StateHD.TVBy Brandon Costa, Direc...

30/10/2025

SVG Sit-Down: Swerve Sports' Christy Tanner Explores the Young FAST Channel's Early Success

SVG Sit-Down: Swerve Sports' Christy Tanner Explores the Young FAST Channel&...

30/10/2025

SVG Campus Shot Callers: Andy Liebsch, Senior Director, Video Services, Kansas State University

SVG Campus Shot Callers: Andy Liebsch, Senior Director, Video Services, Kansas S...

30/10/2025

Diversified Names Paul Lidsky CEO, Expanding Leadership Role After Serving as Board Chairman

Diversified Names Paul Lidsky CEO, Expanding Leadership Role After Serving as Bo...

30/10/2025

NBA, Cosm Enter Long-Term Partnership for Shared Reality Production, Distribution

NBA, Cosm Enter Long-Term Partnership for Shared Reality Production, Distributio...

30/10/2025

FanDuel Sports Network to Deliver Select Live NBA, NHL Games to Major Streaming Services for In-Market Viewers

FanDuel Sports Network to Deliver Select Live NBA, NHL Games to Major Streaming ...

30/10/2025

If I Had Legs, I'd Kick You, East of Wall, and More Sundance Institute-Supported Films Nominated for 35th Gotham Awards

As the year comes to a close, we can feel the invigorating wind sweeping in for ...

30/10/2025

Give Me the Backstory: Get to Know Max Walker-Silverman, the Writer-Director of Rebuilding

By Bailey Pennick One of the most exciting things about the Sundance Film Festi...

30/10/2025

Excellent training at SGL Carbon's Bonn site

The SGL Carbon site in Bonn has a long tradition of training. For many years, young talent has been successfully trained here, regularly achieving excellent exa...

30/10/2025

SBS, NITV and Screen Australia announce 2025 Digital Originals Shortlist

SBS, NITV and Screen Australia announce 2025 Digital Originals Shortlist 29 October, 2025 Media releases SBS, NITV and Screen Australia are excited to unve...

30/10/2025

Remarks for the 2025 APEC CEO Roundtable

Jon Rambeau, President of Integrated Mission Systems at L3Harris Technologies, speaks about industrial collaboration at the Asia-Pacific Economic Cooperation (A...

30/10/2025

L3Harris Technologies Reports Strong Third Quarter 2025 Results, Increases 2025 Guidance

MELBOURNE, Fla., October 30, 2025 - L3Harris Technologies (NYSE: LHX) reports th...

30/10/2025

FCC's Brendan Carr Issues Draft Proposal for More C-Band Spectrum Sales

WASHINGTON Federal Communications Commission Chair Brendan Carr said he has circulated a proposal for the agency to auction additional midband spectrum in the U...

30/10/2025

Diversified Names Paul Lidsky as CEO

PLANO, Texas Technology solutions provider Diversified has named Paul Lidsky as CEO, tasked with guiding the company's next stage of growth, driving market ...

30/10/2025

Interra Adds Stream Recording, BATON Integration to ORION

CUPERTINO, Calif. Interra Systems today unveiled ORION stream recording support and seamless integration with BATON Media Player, a combination that lets broadc...

30/10/2025

InterDigital Buys AI-Driven Video Codec Startup Deep Render

WILMINGTON, Del. InterDigital today announced the acquisition of Deep Render, an artificial intelligence startup with a team of AI experts focused on video code...

30/10/2025

TAG Video Systems Earns Two ESG Recognitions

NEW YORK TAG Video Systems has earned a higher-rated Digital Product Passport (DPP) Committed to Sustainability badge and the Aclymate Climate Wise Silver Tier ...

30/10/2025

Nexstar Extends Employment Agreement with Perry Sook Through 2029

IRVING, Texas As station groups move into an era that promises rapid tech, regulatory and economic changes, the Nexstar Media Group, Inc. has announced that its...

30/10/2025

Samba TV: 60% Of TV Time Spent Viewing Streaming Content

Television viewers are spending more time watching streaming content than linear TV, but sports continues to be a bright spot for broadcasters, according to the...

30/10/2025

Operative Media Names Mike Napadano as CEO

NEW YORK Advertising technology company Operative Media has named Mike Napadano as its new CEO....

30/10/2025

Walmart Selects Marshall Cameras to Power New Campus Broa...

Walmart Inc. has chosen Marshall Electronics cameras for use across its brand-new corporate campus studios and event center. The installation includes Marshall ...

30/10/2025

NETGEAR Academy Expands Into Industry-Wide IP Training Pl...

NETGEAR, Inc. (NASDAQ: NTGR), a global leader in intelligent networking solutions designed to power extraordinary experiences, today announced the launch of its...

30/10/2025

Clear-Com Gen-IC Virtual Intercom Connects Students World...

Clear-Com recently contributed its award-winning Gen-IC virtual intercom solution to power real-time communications for On-Air Student TV, a 24-hour global st...

30/10/2025

Maxon Strengthens Growth Strategy with Appointment of Kse...

Maxon, maker of powerful, approachable software solutions for creators working in 2D and 3D design, motion graphics, visual effects, and more, today announced t...

30/10/2025

Studio Technologies Dante Enabled Model 394 GPI Interface...

Studio Technologies, a leading manufacturer of high-quality audio, video, and fiber-optic solutions, announces that its new Model 394 GPI Interface and Model 39...

30/10/2025

Astro selects Broadpeak for high performance streaming an...

Broadpeak , a leader in streaming and monetization at scale, has been selected by leading Malaysian content and entertainment company Astro to enable two major ...

30/10/2025

Riedel Communications Appoints Ulrich Voigt as Director L...

Riedel Communications is pleased to announce that Ulrich Voigt has joined the company as Director Live Production Solutions, taking over the SimplyLive business...

30/10/2025

LiveU and Kinetiq Launch Cloud Native Watermarking Integr...

LiveU, the global leader in live IP-video contribution, production, and distribution, today announced a new partnership with Kinetiq, the AI-powered platform un...

30/10/2025

FCC Plans Nov. 20 Open Meeting, Provides Shutdown Update

WASHINGTON Federal Communications Commission Chair Brendan Carr has called for an end to the government shutdown while providing some updates on the agency'...

30/10/2025

Carr Issues Draft Proposal for More C-Band Spectrum Sales

WASHINGTON Federal Communications Commission Chair Brendan Carr has announced that he has circulated a proposal for the FCC to auction additional mid-band spect...

30/10/2025

U&GOLD LAUNCHES ITS ANNUAL SEARCH FOR THE UK'S BEST CHRISTMAS CRACKER' JOKES

TIS THE SEASON TO GET PUNNY! U&GOLD LAUNCHES ITS ANNUAL SEARCH FOR THE UK'S...

30/10/2025

Rohde & Schwarz enables MediaTeks 6G waveform verification with CMP180 radio communication tester

Rohde & Schwarz enables MediaTeks 6G waveform verification with CMP180 radio com...

30/10/2025

Fox Corporation Reports First Quarter Fiscal 2026 Financial Results

Fox Corporation Reports First Quarter Fiscal 2026 Financial Results NEW YORK, NY, October 30, 2025 - Fox Corporation (Nasdaq: FOXA, FOX; FOX or the Company...

30/10/2025

Tubi Media Group and Audiochuck Announce Exclusive Partnership

Tubi Media Group and Audiochuck Announce Exclusive Partnership Tubi Media Group Enters into a Multi-Year Ad Partnership Deal with Ashley Flowers' Award-Wi...

30/10/2025

Comscore Appoints a new Country Manager for APAC

Comscore Appoints a new Country Manager for APACNEW DELHI, INDIA, October 30, 2025 Comscore (Nasdaq: SCOR), a global leader in measuring and analysing consum...

30/10/2025

October 29, 2025

Scripps Research professor awarded $3.2 million to advance type 1 diabetes research Support from the National Institute of Diabetes and Digestive and Kidney Dis...

30/10/2025

RT Concert Orchestra November/December 2025

RT Concert Orchestra in November and December Collaborations with RT Radio 1's Sunday Miscellany (2 December) and RT 2FM's Late Breakfast (9 Decembe...

30/10/2025

MUIREANN O' CONNELL, NEIL DELAMERE AND MICHAEL CONLON JOIN THE LADS FOR THE 2 JOHNNIES LATE NIGHT LOCK IN

Johnny B and Johnny Smacks are back tonight for the second episode of the new se...

30/10/2025

Jon Bon Jovi, Miriam O'Callaghan, Davy Fitzgerald and Donncha O'Callaghan join the Late Late Show Halloween Line-Up

First look of Jon Bon Jovi's interview with Patrick Kielty Jon Bon Jovi is ...

30/10/2025

AI-Powered Mobile Clinics Deliver Breast Cancer Screening to India's Rural Communities

An unassuming van driving around rural India uses powerful AI technology that...

30/10/2025

Join the Resistance: ARC Raiders' Launches in the Cloud

Get ready, raiders - the wait is over. ARC Raiders is dropping onto GeForce NOW and bringing the fight from orbit to the screen. To celebrate the launch, gamer...

29/10/2025

MLS, EDGE Sound Research To Debut Immersive Embodied Sound' at LAFC vs. Austin FC Playoff Match

MLS, EDGE Sound Research To Debut Immersive Embodied Sound' at LAFC vs. Aus...

29/10/2025

SVG Remote Production Forum 2025: All Sessions Now Available to Watch on SVG PLAY

SVG Remote Production Forum 2025: All Sessions Now Available to Watch on SVG PLA...

29/10/2025

World Series 2025: How Audio Is Transported Around the Sites and Beyond

World Series 2025: How Audio Is Transported Around the Sites and BeyondThe signals also move not just between two countries but around the globeBy Dan Daley, Au...

29/10/2025

Inside the Archives: Celebrating Archives Month Through Sundance Film Festival Films

A still from 306 Hollywood, a film by sibling filmmakers Jonathan Bogar n and El...

29/10/2025

New research shows sense of belonging is growing stronger among multilingual Australians

New research shows sense of belonging is growing stronger among multilingual Aus...

29/10/2025

SBS, NITV and Screen Australia announce2025 Digital Originals Shortlist

SBS, NITV and Screen Australia announce2025 Digital Originals Shortlist 29 October, 2025 Media releases SBS, NITV and Screen Australia are excited to unvei...