Sony Pixel Power calrec Sony

NVIDIA Takes Inference to New Heights Across MLPerf Tests

05/04/2023

MLPerf remains the definitive measurement for AI performance as an independent, third-party benchmark. NVIDIA's AI platform has consistently shown leadership across both training and inference since the inception of MLPerf, including the MLPerf Inference 3.0 benchmarks released today.

Three years ago when we introduced A100, the AI world was dominated by computer vision. Generative AI has arrived, said NVIDIA founder and CEO Jensen Huang.

This is exactly why we built Hopper, specifically optimized for GPT with the Transformer Engine. Today's MLPerf 3.0 highlights Hopper delivering 4x more performance than A100.

The next level of Generative AI requires new AI infrastructure to train large language models with great energy efficiency. Customers are ramping Hopper at scale, building AI infrastructure with tens of thousands of Hopper GPUs connected by NVIDIA NVLink and InfiniBand.

The industry is working hard on new advances in safe and trustworthy Generative AI. Hopper is enabling this essential work, he said.

The latest MLPerf results show NVIDIA taking AI inference to new levels of performance and efficiency from the cloud to the edge.

Specifically, NVIDIA H100 Tensor Core GPUs running in DGX H100 systems delivered the highest performance in every test of AI inference, the job of running neural networks in production. Thanks to software optimizations, the GPUs delivered up to 54% performance gains from their debut in September.

In healthcare, H100 GPUs delivered a 31% performance increase since September on 3D-UNet, the MLPerf benchmark for medical imaging.

Powered by its Transformer Engine, the H100 GPU, based on the Hopper architecture, excelled on BERT, a transformer-based large language model that paved the way for today's broad use of generative AI.

Generative AI lets users quickly create text, images, 3D models and more. It's a capability companies from startups to cloud service providers are rapidly adopting to enable new business models and accelerate existing ones.

Hundreds of millions of people are now using generative AI tools like ChatGPT - also a transformer model - expecting instant responses.

At this iPhone moment of AI, performance on inference is vital. Deep learning is now being deployed nearly everywhere, driving an insatiable need for inference performance from factory floors to online recommendation systems.

L4 GPUs Speed Out of the Gate NVIDIA L4 Tensor Core GPUs made their debut in the MLPerf tests at over 3x the speed of prior-generation T4 GPUs. Packaged in a low-profile form factor, these accelerators are designed to deliver high throughput and low latency in almost any server.

L4 GPUs ran all MLPerf workloads. Thanks to their support for the key FP8 format, their results were particularly stunning on the performance-hungry BERT model.

In addition to stellar AI performance, L4 GPUs deliver up to 10x faster image decode, up to 3.2x faster video processing and over 4x faster graphics and real-time rendering performance.

Announced two weeks ago at GTC, these accelerators are already available from major systems makers and cloud service providers. L4 GPUs are the latest addition to NVIDIA's portfolio of AI inference platforms launched at GTC.

Software, Networks Shine in System Test NVIDIA's full-stack AI platform showed its leadership in a new MLPerf test.

The so-called network-division benchmark streams data to a remote inference server. It reflects the popular scenario of enterprise users running AI jobs in the cloud with data stored behind corporate firewalls.

On BERT, remote NVIDIA DGX A100 systems delivered up to 96% of their maximum local performance, slowed in part because they needed to wait for CPUs to complete some tasks. On the ResNet-50 test for computer vision, handled solely by GPUs, they hit the full 100%.

Both results are thanks, in large part, to NVIDIA Quantum Infiniband networking, NVIDIA ConnectX SmartNICs and software such as NVIDIA GPUDirect.

Orin Shows 3.2x Gains at the Edge Separately, the NVIDIA Jetson AGX Orin system-on-module delivered gains of up to 63% in energy efficiency and 81% in performance compared with its results a year ago. Jetson AGX Orin supplies inference when AI is needed in confined spaces at low power levels, including on systems powered by batteries.

For applications needing even smaller modules drawing less power, the Jetson Orin NX 16G shined in its debut in the benchmarks. It delivered up to 3.2x the performance of the prior-generation Jetson Xavier NX processor.

A Broad NVIDIA AI Ecosystem The MLPerf results show NVIDIA AI is backed by the industry's broadest ecosystem in machine learning.

Ten companies submitted results on the NVIDIA platform in this round. They came from the Microsoft Azure cloud service and system makers including ASUS, Dell Technologies, GIGABYTE, H3C, Lenovo, Nettrix, Supermicro and xFusion.

Their work shows users can get great performance with NVIDIA AI both in the cloud and in servers running in their own data centers.

NVIDIA partners participate in MLPerf because they know it's a valuable tool for customers evaluating AI platforms and vendors. Results in the latest round demonstrate that the performance they deliver today will grow with the NVIDIA platform.

Users Need Versatile Performance NVIDIA AI is the only platform to run all MLPerf inference workloads and scenarios in data center and edge computing. Its versatile performance and efficiency make users the real winners.

Real-world applications typically employ many neural networks of different kinds that often need to deliver answers in real time.

For example, an AI application may need to understand a user's spoken request, classify an image, make a recommendation and then deliver a response as a spoken message in a human-sounding voice. Each step requires a different type
LINK: https://blogs.nvidia.com/blog/2023/04/05/inference-mlperf-ai/...
See more stories from nvidia

Most recent headlines

09/11/2025

Dalet Unveils Agentic AI Media Workflows at IBC2025

Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...

30/10/2025

Samba TV: 60% Of TV Time Spent Viewing Streaming Content

Television viewers are spending more time watching streaming content than linear TV, but sports continues to be a bright spot for broadcasters, according to the...

30/10/2025

Operative Media Names Mike Napadano as CEO

NEW YORK Advertising technology company Operative Media has named Mike Napadano as its new CEO....

30/10/2025

Walmart Selects Marshall Cameras to Power New Campus Broa...

Walmart Inc. has chosen Marshall Electronics cameras for use across its brand-new corporate campus studios and event center. The installation includes Marshall ...

30/10/2025

NETGEAR Academy Expands Into Industry-Wide IP Training Pl...

NETGEAR, Inc. (NASDAQ: NTGR), a global leader in intelligent networking solutions designed to power extraordinary experiences, today announced the launch of its...

30/10/2025

Clear-Com Gen-IC Virtual Intercom Connects Students World...

Clear-Com recently contributed its award-winning Gen-IC virtual intercom solution to power real-time communications for On-Air Student TV, a 24-hour global st...

30/10/2025

Maxon Strengthens Growth Strategy with Appointment of Kse...

Maxon, maker of powerful, approachable software solutions for creators working in 2D and 3D design, motion graphics, visual effects, and more, today announced t...

30/10/2025

Studio Technologies Dante Enabled Model 394 GPI Interface...

Studio Technologies, a leading manufacturer of high-quality audio, video, and fiber-optic solutions, announces that its new Model 394 GPI Interface and Model 39...

30/10/2025

Astro selects Broadpeak for high performance streaming an...

Broadpeak , a leader in streaming and monetization at scale, has been selected by leading Malaysian content and entertainment company Astro to enable two major ...

30/10/2025

Riedel Communications Appoints Ulrich Voigt as Director L...

Riedel Communications is pleased to announce that Ulrich Voigt has joined the company as Director Live Production Solutions, taking over the SimplyLive business...

30/10/2025

LiveU and Kinetiq Launch Cloud Native Watermarking Integr...

LiveU, the global leader in live IP-video contribution, production, and distribution, today announced a new partnership with Kinetiq, the AI-powered platform un...

30/10/2025

FCC Plans Nov. 20 Open Meeting, Provides Shutdown Update

WASHINGTON Federal Communications Commission Chair Brendan Carr has called for an end to the government shutdown while providing some updates on the agency'...

30/10/2025

Carr Issues Draft Proposal for More C-Band Spectrum Sales

WASHINGTON Federal Communications Commission Chair Brendan Carr has announced that he has circulated a proposal for the FCC to auction additional mid-band spect...

30/10/2025

Join the Resistance: ARC Raiders' Launches in the Cloud

Get ready, raiders - the wait is over. ARC Raiders is dropping onto GeForce NOW and bringing the fight from orbit to the screen. To celebrate the launch, gamer...

29/10/2025

MLS, EDGE Sound Research To Debut Immersive Embodied Sound' at LAFC vs. Austin FC Playoff Match

MLS, EDGE Sound Research To Debut Immersive Embodied Sound' at LAFC vs. Aus...

29/10/2025

SVG Remote Production Forum 2025: All Sessions Now Available to Watch on SVG PLAY

SVG Remote Production Forum 2025: All Sessions Now Available to Watch on SVG PLA...

29/10/2025

World Series 2025: How Audio Is Transported Around the Sites and Beyond

World Series 2025: How Audio Is Transported Around the Sites and BeyondThe signals also move not just between two countries but around the globeBy Dan Daley, Au...

29/10/2025

Inside the Archives: Celebrating Archives Month Through Sundance Film Festival Films

A still from 306 Hollywood, a film by sibling filmmakers Jonathan Bogar n and El...

29/10/2025

New research shows sense of belonging is growing stronger among multilingual Australians

New research shows sense of belonging is growing stronger among multilingual Aus...

29/10/2025

SBS, NITV and Screen Australia announce2025 Digital Originals Shortlist

SBS, NITV and Screen Australia announce2025 Digital Originals Shortlist 29 October, 2025 Media releases SBS, NITV and Screen Australia are excited to unvei...

29/10/2025

Clear-Com's Gen-IC Virtual Intercom Connects Students Worldwide for 24-Hour Global...

eds3_5_jq(document).ready(function($) { $(#eds_sliderM519).chameleonSlider_2_1({...

29/10/2025

NAB New York 2025 Preview

NAB New York 2025 Preview | October 22-23 | Booth 544 | Javits Center, New York We're looking forward to meeting up with customers and partners at NAB New Y...

29/10/2025

Riedel Names Ulrich Voigt Director of Live Production Solutions

WUPPERTAL, Germany Riedel Communications has hired Ulrich Voigt as director, live production solutions, taking over the leadership of its SimplyLive business fr...

29/10/2025

Sinclair Taps Mark Martin to Lead Stations in Oklahoma

OKLAHOMA CITY and TULSA, Okla. Sinclair has named Mark Martin as vice president and general manager of KOKH-KOCB Oklahoma City and KTUL Tulsa....

29/10/2025

iSpot Taps Julie Van Ullen as President and Chief Revenue Officer

BELLEVUE, Wash. Julie Van Ullen has joined cross-platform TV ad measurement company iSpot as president and chief revenue officer....

29/10/2025

Lawo Delivers Audio, IP Infrastructure for New Swiss OB Vehicle

Brutal g et, a Swiss broadcast services provider, has rolled out a state-of-the-art outside broadcast (OB) vehicle built on a Lawo AoIP (audio-over-internet pro...

29/10/2025

FCC Commissioner Olivia Trusty Announces Temporary Staff Changes

WASHINGTON FCC Commissioner Olivia Trusty has announced a temporary staff change in her office....

29/10/2025

Berklee Valencia Talent Helps Score Alejandro Amenbar's El cautivo

Berklee Valencia Talent Helps Score Alejandro Amen bar's El cautivo Faculty and alumni from Berklee Valencia's scoring for film, television, and video...

29/10/2025

Disney Takes Ownership of Fubo

The Walt Disney Company today announced they have closed their transaction to combine Fubo's business with Disney's Hulu + Live TV business....

29/10/2025

Ulrich Voigt

WUPPERTAL, Germany Riedel Communications has hired Ulrich Voigt as director, live production solutions, taking over the leadership of its SimplyLive business fr...

29/10/2025

MLS To Unveil Immersive Embodied Sound' During Playoff Match

LOS ANGELES Major League Soccer will introduce a broadcast audio enhancement tonight during the LAFC vs. Austin FC playoff match....

29/10/2025

ESPN, Sony Ink Deal to Expand Animated Altcasts for 2025-26

ESPN said it will produce animated telecasts for NFL, NHL, NBA and WNBA games across The Walt Disney Co. and ESPN platforms during the 2025-26 season under an a...

29/10/2025

FCC Approves Notice of Proposed Rulemaking on NextGen TV

WASHINGTON Despite the government shutdown, the Federal Communications Commission has passed, with some revisions, a previously announced Notice of Proposed Rul...

29/10/2025

SBS, NITV and Screen Australia announce 2025 Digital Originals Shortlist

29 10 2025 - Media release SBS, NITV and Screen Australia announce 2025 Digital Originals Shortlist The 2025 Digital Originals shortlisted teams. Photo credit...

29/10/2025

Hyundai Motor Group Brings Dolby Atmos to Elexio, Hyundai Motor Group's First China-Exclusive Electric Vehicle

October 29 2025, 17:00 (PDT) Hyundai Motor Group Brings Dolby Atmos to Elexio, ...

29/10/2025

Riedel Communications Appoints Ulrich Voigt as Director Live Production Solutions

Wuppertal October 29, 2025 Riedel Communications Appoints Ulrich Voigt as Dire...

29/10/2025

Start of Filming for Daniel Snchez Arvalo's New Netflix Movie

Back to All News Start of Filming for Daniel S nchez Ar valos New Netflix Movie Entertainment 29 October 2025 GlobalSpain Link copied to clipboard The fil...

29/10/2025

Comscore's 2025 State of Streaming Report Reveals Surging Growth of Both Ad-Supported Platforms and FAST Channels

Comscore's 2025 State of Streaming Report Reveals Surging Growth of Both Ad-...

29/10/2025

RT PUBLISHES 2024 ANNUAL REPORT

RT PUBLISHES 2024 ANNUAL REPORT RT REPORTS NET SURPLUS OF 5.5 MILLION IN 2024 A YEAR WITH MANY SPECIAL EVENTS UEFA EUROS 2024, THE OLYMPICS AND PARALYMP...

29/10/2025

Into the Omniverse: Open World Foundation Models Generate Synthetic Worlds for Physical AI Development

Editor's note: This post is part of Into the Omniverse, a series focused on ...

28/10/2025

SVG All-Stars: Catherine Chalfant, Manager, Remote Operations, ESPN

SVG All-Stars: Catherine Chalfant, Manager, Remote Operations, ESPNThe Ole Miss alum is an operational force behind ESPN's extensive college-football catalo...

28/10/2025

Elevating the Experience: AI and Data Take Ryder Cup to the Next Level

Elevating the experience: AI and data take Ryder Cup to the next level By Joe OHalloran Tuesday, October 28, 2025 - 10:25 Print This Story NBC produced th...

28/10/2025

Conquering the Air (Waves): Taking a Close Up Look at the IBC Accelerator Private 5G from Land to Sea to Sky'

Conquering the Air (waves): Taking a close up look at the IBC Accelerator Priva...

28/10/2025

World Series 2025: Spectrum SportsNet LA Brings Dodgers Fans Closer to the Action With Pre/Postgame Coverage

World Series 2025: Spectrum SportsNet LA Brings Dodgers Fans Closer to the Actio...

28/10/2025

The Thing with Feathers Brings the Horror of Grief to the Screen

Dylan Southern and Benedict Cumberbatch at the premiere of The Thing with Feathers (photo by George Pimentel / Shutterstock for Sundance Film Festival)...

28/10/2025

Spotify's Greasy Tunes Caf Serves Up the Sights, Sounds, and Flavors of Lagos

For three weeks in Lagos, Spotify's Greasy Tunes Caf pop-up brought the cit...

28/10/2025

Spotify's New OFFCULT Playlist Is a Love Letter to the Future of German Rap

Once a niche subculture, German rap has evolved into an influential cultural movement. Now, Spotify is giving the genre a new home with OFFCULT, a playlist dedi...

28/10/2025

Shane Delia's Malta serves up the Mediterranean this summer

Shane Delia's Malta serves up the Mediterranean this summer 28 October, 2025 Media releases Feast on 9,000 years of culinary history Mondays from 24 No...