Sony Pixel Power calrec Sony

NVIDIA Takes Inference to New Heights Across MLPerf Tests

05/04/2023

MLPerf remains the definitive measurement for AI performance as an independent, third-party benchmark. NVIDIA's AI platform has consistently shown leadership across both training and inference since the inception of MLPerf, including the MLPerf Inference 3.0 benchmarks released today.

Three years ago when we introduced A100, the AI world was dominated by computer vision. Generative AI has arrived, said NVIDIA founder and CEO Jensen Huang.

This is exactly why we built Hopper, specifically optimized for GPT with the Transformer Engine. Today's MLPerf 3.0 highlights Hopper delivering 4x more performance than A100.

The next level of Generative AI requires new AI infrastructure to train large language models with great energy efficiency. Customers are ramping Hopper at scale, building AI infrastructure with tens of thousands of Hopper GPUs connected by NVIDIA NVLink and InfiniBand.

The industry is working hard on new advances in safe and trustworthy Generative AI. Hopper is enabling this essential work, he said.

The latest MLPerf results show NVIDIA taking AI inference to new levels of performance and efficiency from the cloud to the edge.

Specifically, NVIDIA H100 Tensor Core GPUs running in DGX H100 systems delivered the highest performance in every test of AI inference, the job of running neural networks in production. Thanks to software optimizations, the GPUs delivered up to 54% performance gains from their debut in September.

In healthcare, H100 GPUs delivered a 31% performance increase since September on 3D-UNet, the MLPerf benchmark for medical imaging.

Powered by its Transformer Engine, the H100 GPU, based on the Hopper architecture, excelled on BERT, a transformer-based large language model that paved the way for today's broad use of generative AI.

Generative AI lets users quickly create text, images, 3D models and more. It's a capability companies from startups to cloud service providers are rapidly adopting to enable new business models and accelerate existing ones.

Hundreds of millions of people are now using generative AI tools like ChatGPT - also a transformer model - expecting instant responses.

At this iPhone moment of AI, performance on inference is vital. Deep learning is now being deployed nearly everywhere, driving an insatiable need for inference performance from factory floors to online recommendation systems.

L4 GPUs Speed Out of the Gate NVIDIA L4 Tensor Core GPUs made their debut in the MLPerf tests at over 3x the speed of prior-generation T4 GPUs. Packaged in a low-profile form factor, these accelerators are designed to deliver high throughput and low latency in almost any server.

L4 GPUs ran all MLPerf workloads. Thanks to their support for the key FP8 format, their results were particularly stunning on the performance-hungry BERT model.

In addition to stellar AI performance, L4 GPUs deliver up to 10x faster image decode, up to 3.2x faster video processing and over 4x faster graphics and real-time rendering performance.

Announced two weeks ago at GTC, these accelerators are already available from major systems makers and cloud service providers. L4 GPUs are the latest addition to NVIDIA's portfolio of AI inference platforms launched at GTC.

Software, Networks Shine in System Test NVIDIA's full-stack AI platform showed its leadership in a new MLPerf test.

The so-called network-division benchmark streams data to a remote inference server. It reflects the popular scenario of enterprise users running AI jobs in the cloud with data stored behind corporate firewalls.

On BERT, remote NVIDIA DGX A100 systems delivered up to 96% of their maximum local performance, slowed in part because they needed to wait for CPUs to complete some tasks. On the ResNet-50 test for computer vision, handled solely by GPUs, they hit the full 100%.

Both results are thanks, in large part, to NVIDIA Quantum Infiniband networking, NVIDIA ConnectX SmartNICs and software such as NVIDIA GPUDirect.

Orin Shows 3.2x Gains at the Edge Separately, the NVIDIA Jetson AGX Orin system-on-module delivered gains of up to 63% in energy efficiency and 81% in performance compared with its results a year ago. Jetson AGX Orin supplies inference when AI is needed in confined spaces at low power levels, including on systems powered by batteries.

For applications needing even smaller modules drawing less power, the Jetson Orin NX 16G shined in its debut in the benchmarks. It delivered up to 3.2x the performance of the prior-generation Jetson Xavier NX processor.

A Broad NVIDIA AI Ecosystem The MLPerf results show NVIDIA AI is backed by the industry's broadest ecosystem in machine learning.

Ten companies submitted results on the NVIDIA platform in this round. They came from the Microsoft Azure cloud service and system makers including ASUS, Dell Technologies, GIGABYTE, H3C, Lenovo, Nettrix, Supermicro and xFusion.

Their work shows users can get great performance with NVIDIA AI both in the cloud and in servers running in their own data centers.

NVIDIA partners participate in MLPerf because they know it's a valuable tool for customers evaluating AI platforms and vendors. Results in the latest round demonstrate that the performance they deliver today will grow with the NVIDIA platform.

Users Need Versatile Performance NVIDIA AI is the only platform to run all MLPerf inference workloads and scenarios in data center and edge computing. Its versatile performance and efficiency make users the real winners.

Real-world applications typically employ many neural networks of different kinds that often need to deliver answers in real time.

For example, an AI application may need to understand a user's spoken request, classify an image, make a recommendation and then deliver a response as a spoken message in a human-sounding voice. Each step requires a different type
LINK: https://blogs.nvidia.com/blog/2023/04/05/inference-mlperf-ai/...
See more stories from nvidia

Most recent headlines

13/12/2025

YouTube TV to Launch Genre Packages

In a move that will help it offer more flexible and less costly programming options, YouTube TV has announced that it will be launching YouTube TV Plans with mo...

13/12/2025

Magna Systems Finishes UHD, IP-based OB Truck For Singapore Network

SINGAPORE Magna Systems has designed, built and completed what is believed to be the first full UHD and IP-based OB truck in Southeast Asia for a Singapore medi...

12/12/2025

SVG Summit 2025 Preview: Everything You Need to Know for Next Week's Big Show in NYC

SVG Summit 2025 Preview: Everything You Need to Know for Next Week's Big Sho...

12/12/2025

Hailey Gates and Alia Shawkat Welcome You to the Village of Atropia

Hailey Gates at the Atropia premiere (photo by George Pimentel / Shutterstock for Sundance Film Festival)...

12/12/2025

Spotify and ATP Tour Launch First Episode of New Video Series

Last month, Spotify announced a new collaboration with the ATP Tour, the global governing body of men's professional tennis, aimed at bringing the next gene...

12/12/2025

Arkansas TV Drops PBS Affiliation Amid Funding Cuts

CONWAY, Ark. In a notable example of how the elimination of Federal federal funding is forcing public stations to make massive cuts and changes in the way they...

12/12/2025

Wisycom and DPA Microphones Appoint Rene Moerch as Group...

Wisycom and DPA Microphones announce the appointment of Ren Moerch as Group Product Director, Wireless, a strategic leadership role that will guide the combine...

12/12/2025

SMPTE Releases Updated Engineering Report on Artificial I...

SMPTE , the home of media professionals, technologists, and engineers, in conjuncture with the European Broadcasting Union (EBU) and the Entertainment Technolog...

12/12/2025

Keepit and Ingram Micro form strategic relationship in Po...

Keepit, the vendor-independent, cloud-native data protection provider, today announced a strategic go-to-market relationship in Poland with Ingram Micro, a lead...

12/12/2025

Atomos Enhances FUJIFILM GFX ETERNA 55 with RAW Capabilit...

Atomos announced the immediate availability of a new firmware update for its Ninja TX GO and Ninja TX monitor-recorders, unlocking Open Gate 48P RAW recording w...

12/12/2025

Professional Wireless Systems Provides Comprehensive RF S...

Professional Wireless Systems (PWS) once again played a critical role in delivering flawless wireless coordination and support at the 2025 Latin Grammy Awards a...

12/12/2025

AIMS Announces Inaugural IPMX Product Testing and Certifi...

The Alliance for IP Media Solutions (AIMS), together with the Video Services Forum (VSF), the Advanced Media Workflow Association (AMWA) and the European Broadc...

12/12/2025

DHD Gears for Hamburg Open 2026 with Latest Audio Product...

DHD audio will demonstrate the latest additions to its range of digital audio production solutions on Booth 321 in Hall B6 at Hamburg Open 2026. The show will b...

12/12/2025

Chaos Brings macOS Support and AI Tools to V-Ray for Blen...

Chaos today announces the release of V-Ray for Blender, update 2, bringing its award-winning rendering technology to even more Blender users by adding support f...

12/12/2025

UltraLEDs Launches Precision LED Tape for Professional Fi...

Lighting specialist UltraLEDs has launched Precision LED Tape, a high-CRI lighting solution designed specifically for professional film, TV, and studio use. P...

12/12/2025

Zixi Appoints Roi Sasson as Vice President Engineering

Zixi, the Emmy Award-winning leader in live broadcast-quality video over IP, today announced that Roi Sasson has joined the company as Vice President, Engineer...

12/12/2025

BitFire and Appear Partner to Advance Cloud and Edge Work...

BitFire (bitfire.tv), the leader in software-defined live production and IP transmission, today announced a strategic partnership with Appear, a leader in high-...

12/12/2025

HPA Announces Tech 2026 Retreat Agenda

LOS ANGELES The Hollywood Professional Association (HPA) today said futurist Robert Tercek, creative technologist Jessie Hughes from Leonardo.AI and Emmy-winnin...

12/12/2025

BitFire, Appear Form Strategic Partnership Integrating IP-Based Solutions

HUDSON, Mass. BitFire and Appear have struck a strategic partnership aimed at offering broadcasters, sports leagues and streaming platforms a faster, more flexi...

12/12/2025

TV Tech, TVBEurope to Explore MXLs Impact on Media Production

The broadcast industry is evolving faster than ever. #IPWorkflows #remoteproduction, and next-gen audio systems are reshaping how teams design, deliver, and sca...

12/12/2025

Wrapbook Acquires TV and Film Production Scheduling Platform Cinapse

LOS ANGELES The payroll and production accounting platform Wrapbook has announced the acquisition of Cinapse, a modern scheduling platform for film and televisi...

12/12/2025

Ross Video Expands South Asian Operations

DEHLI Ross Video has announced that it is expanding and restructuring its commercial and technical teams in the South Asian Association for Regional Cooperation...

12/12/2025

Rise AV Launches Asia Pacific Council and Mentoring Program

LONDON Following the success of its UK launch in January 2025, Rise AV, the global not-for-profit initiative dedicated to supporting and advancing women in the ...

12/12/2025

Tubi To Introduce Matter Casting For Fire TV

SAN FRANCISCO Ad-supported streaming service Tubi next week will launch Matter Casting, a new casting standard that will enable seamless mobile-to-TV viewing di...

12/12/2025

HPA Announces Tech Retreat Highlights

LOS ANGELES The Hollywood Professional Association (HPA) today said futurist Robert Tercek, creative technologist Jessie Hughes from Leonardo.AI and Emmy-winnin...

12/12/2025

Cheers to AI: ADAM Robot Bartender Makes Drinks at Vegas Golden Knights Game

In Las Vegas's T-Mobile Arena, fans of the Golden Knights are getting more than just hockey - they're getting a taste of the future. ADAM, a robot devel...

12/12/2025

President of Ireland Catherine Connolly visit to RT Raidi na Gaeltachta in Casla, Connemara

Uachtar n na h ireann, Catherine Connolly visited RT Raidi na Gaeltachta's...

12/12/2025

TV Host and social media sensation Eric Roberts revealed as sixth contestant for Dancing with the Stars 2026

Ireland AM host Eric Roberts has been revealed as the sixth contestant taking to...

12/12/2025

December 11, 2025

Scripps Research team pioneers an efficient way to stereoselectively add fluorine to drug-like molecules A new method uses a novel catalyst and inexpensive fluo...

11/12/2025

AI for Sustainability: Lessons from Sarajevo

Thomson and the Center for News, Technology and Innovation (CNTI) convened a two-day workshop in Sarajevo bringing together more than 35 journalists, editors, p...

11/12/2025

ESPN's Aims for Spectacular With Heisman Trophy Show

ESPN's Aims for Spectacular With Heisman Trophy ShowEvent firsts include 1080p HDR production airing on both national broadcast and cableBy Dan Daley, Audio...

11/12/2025

SVG Students To Watch: Frankie Patton, University of Colorado

SVG Students To Watch: Frankie Patton, University of ColoradoThe 2025 grad is hitting the ground running as a PA on national broadcastsBy Brandon Costa, Directo...

11/12/2025

SVG Summit 2025 Technology Exhibits Preview, Part 3

SVG Summit 2025 Technology Exhibits Preview, Part 3By SVG Staff Thursday, December 11, 2025 - 7:24 am Print This Story | Subscribe Story Highlights The 2...

11/12/2025

SVG Sit-Down: What Makes Gen Z, X, and Y Fans Tick? Dave Gavant of WSC Sports Goes Inside the 2025 Fan Engagement Survey

SVG Sit-Down: What Makes Gen Z, X, and Y Fans Tick? Dave Gavant of WSC Sports Go...

11/12/2025

SVG Summit 2025 Preview: 5G, MXL, Spectrum Loss, and Outerspace on Tap for Tuesday Tech Talks'

SVG Summit 2025 Preview: 5G, MXL, Spectrum Loss, and Outerspace on Tap for Tues...

11/12/2025

2025 Sports Broadcasting Hall of Fame: David Levy, Turner Titan and Master of All Sports-Media Trades

2025 Sports Broadcasting Hall of Fame: David Levy, Turner Titan and Master of Al...

11/12/2025

SVG Launches Follow the Money' Podcast: Go Inside the Sports Media Biz with Sam McCleery and John Kosner

SVG Launches Follow the Money' Podcast: Go Inside the Sports Media Biz with...

11/12/2025

A Deep Dive Inside Game Creek Video's Bird and Magic Mobile Units, Home to Amazon's NBA on Prime Video'

A Deep Dive Inside Game Creek Video's Bird and Magic Mobile Units, Home to A...

11/12/2025

How Sound Effects for Monsters Funday Football' Emulated the Sonic Soul of Monsters, Inc.'

How Sound Effects for Monsters Funday Football' Emulated the Sonic Soul of ...

11/12/2025

SVG New Sponsor Spotlight: CSP Mobile Productions' Len Chase on Upgrading Truck Fleet to 1080p, HDR, and ST 2110

SVG New Sponsor Spotlight: CSP Mobile Productions' Len Chase on Upgrading Tr...

11/12/2025

Spotify and The Game Awards Debut Gaming-Inspired Spotify Singles From Labrinth, Evanescence x GUNSHIP, and Bilmuri

Having the right song soundtrack your moves can make all the difference when gam...

11/12/2025

Celebrate Taylor Swift's Record-Breaking Year and New Docuseries with Exclusive Playlist Cover Art Stickers

It's been a big year for Taylor Swift. Her highly anticipated album The Life...

11/12/2025

L3Harris Ramps Up Production of Next-Gen Missile Tracking Satellites at Expanded Florida Facility

New satellites for the SDA Tranche 1 Tracking program in production at L3Harris&...

11/12/2025

L3Harris Delivers First Meadowlands Production Unit to US Space Force

The Meadowlands system, a compact and mobile version of the CCS, uses ground-based radio frequency units to disrupt satellite communications....

11/12/2025

L3Harris Demonstrates Interoperable Network to Unify Department of War and U.S. Government Agencies

The L3Harris demonstration united tactical communications devices, counter-UAS c...

11/12/2025

2025: L3Harris Year in Review

Throughout 2025, L3Harris delivered innovative solutions to U.S. and allied warfighters across every domain. With an unrelenting commitment to excellence, our...

11/12/2025

Nielsen reveals exclusive new data and insights in annual Tops of Sports report

A Majority of the World's Population (51%) Identify As Soccer Fans The 2025 MLB postseason notched 58.2 billion viewing minutes, up +24% from the prior y...

11/12/2025

Zixi Names Roi Sasson Vice President, Engineering

WALTHAM, Mass. Video-over-IP software provider Zixi said Roi Sasson has joined the company as vice president, engineering....