
Inference, the work of using AI in applications, is moving into mainstream uses, and it's running faster than ever.
NVIDIA GPUs won all tests of AI inference in data center and edge computing systems in the latest round of the industry's only consortium-based and peer-reviewed benchmarks.
NVIDIA A100 and T4 GPUs swept all data center inference tests. NVIDIA A100 Tensor Core GPUs extended the performance leadership we demonstrated in the first AI inference tests held last year by MLPerf, an industry benchmarking consortium formed in May 2018.
The A100, introduced in May, outperformed CPUs by up to 237x in data center inference, according to the MLPerf Inference 0.7 benchmarks. NVIDIA T4 small form factor, energy-efficient GPUs beat CPUs by up to 28x in the same tests.
To put this into perspective, a single NVIDIA DGX A100 system with eight A100 GPUs now provides the same performance as nearly 1,000 dual-socket CPU servers on some AI applications.
Leadership performance enables cost efficiency in taking AI from research to production. This round of benchmarks also saw increased participation, with 23 organizations submitting - up from 12 in the last round - and with NVIDIA partners using the NVIDIA AI platform to power more than 85 percent of the total submissions.
A100 GPUs, Jetson AGX Xavier Take Performance to the Edge
While A100 is taking AI inference performance to new heights, the benchmarks show that T4 remains a solid inference platform for mainstream enterprise, edge servers and cost-effective cloud instances. In addition, the NVIDIA Jetson AGX Xavier builds on its leadership position in power constrained SoC-based edge devices by supporting all new use cases.
Jetson AGX Xavier joined the A100 and T4 GPUs in leadership performance at the edge. The results also point to our vibrant, growing AI ecosystem, which submitted 1,029 results using NVIDIA solutions representing 85 percent of the total submissions in the data center and edge categories. The submissions demonstrated solid performance across systems from partners including Altos, Atos, Cisco, Dell EMC, Dividiti, Fujitsu, Gigabyte, Inspur, Lenovo, Nettrix and QCT.
Expanding Use Cases Bring AI to Daily Life Backed by broad support from industry and academia, MLPerf benchmarks continue to evolve to represent industry use cases. Organizations that support MLPerf include Arm, Baidu, Facebook, Google, Harvard, Intel, Lenovo, Microsoft, Stanford, the University of Toronto and NVIDIA.
The latest benchmarks introduced four new tests, underscoring the expanding landscape for AI. The suite now scores performance in natural language processing, medical imaging, recommendation systems and speech recognition as well as AI use cases in computer vision.
You need go no further than a search engine to see the impact of natural language processing on daily life.
The recent AI breakthroughs in natural language understanding are making a growing number of AI services like Bing more natural to interact with, delivering accurate and useful results, answers and recommendations in less than a second, said Rangan Majumder, vice president of search and artificial intelligence at Microsoft.
Industry-standard MLPerf benchmarks provide relevant performance data on widely used AI networks and help make informed AI platform buying decisions, he said.
AI Helps Saves Lives in the Pandemic The impact of AI in medical imaging is even more dramatic. For example, startup Caption Health uses AI to ease the job of taking echocardiograms, a capability that helped save lives in U.S. hospitals in the early days of the COVID-19 pandemic.
That's why thought leaders in healthcare AI view models like 3D U-Net, used in the latest MLPerf benchmarks, as key enablers.
We've worked closely with NVIDIA to bring innovations like 3D U-Net to the healthcare market, said Klaus Maier-Hein, head of medical image computing at DKFZ, the German Cancer Research Center.
Computer vision and imaging are at the core of AI research, driving scientific discovery and representing core components of medical care. And industry-standard MLPerf benchmarks provide relevant performance data that helps IT organizations and developers accelerate their specific projects and applications, he added.
Commercially, AI use cases like recommendation systems, also part of the latest MLPerf tests, are already making a big impact. Alibaba used recommendation systems last November to transact $38 billion in online sales on Singles Day, its biggest shopping day of the year.
Adoption of NVIDIA AI Inference Passes Tipping Point AI inference passed a major milestone this year.
NVIDIA GPUs delivered a total of more than 100 exaflops of AI inference performance in the public cloud over the last 12 months, overtaking inference on cloud CPUs for the first time. Total cloud AI Inference compute capacity on NVIDIA GPUs has been growing roughly tenfold every two years.
GPUs in major cloud services now account for more inference performance than CPUs. With the high performance, usability and availability of NVIDIA GPU computing, a growing set of companies across industries such as automotive, cloud, robotics, healthcare, retail, financial services and manufacturing now rely on NVIDIA GPUs for AI inference. They include American Express, BMW, Capital One, Dominos, Ford, GE Healthcare, Kroger, Microsoft, Samsung and Toyota.
Companies across key industry sectors use NVIDIA's AI platform for inference. Why AI Inference Is Hard Use cases for AI are clearly expanding, but AI inference is hard for many reasons.
New kinds of neural networks like generative adversarial networks are constantly being spawned for new use cases and the models are growing exponentially. The best language models for AI now encompass billions of parameters, and research in the field is still young.
These mod
Most recent headlines
09/11/2025
Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...
05/11/2025
WASHINGTON Despite the ongoing government shutdown, Federal Communications Commission Chairman Brendan Carr has announced a tentative agenda for the agency'...
05/11/2025
The College Football Playoff (CFP), ESPN and TNT Sports have announced kick times and broadcast information for the 2025 CFP First Round, which will launch the ...
05/11/2025
NEW YORK IAB Tech Lab, the global digital advertising technical standards-setting body, has announced the release of device attestation support in the industry ...
05/11/2025
SINGAPORE Appear, an Oslo-based provider of live production technology, is opening a new facility in Singapore as part of the company's expansion into the A...
05/11/2025
DALLAS Parks Associates has released new data showing just how far the dramatic shift to streaming services has gone in recent years. Currently, more than nine ...
04/11/2025
SVG Sit-Down: Why Professional Fight League CEO John Martin Believes Growth Is I...
04/11/2025
SVG All-Stars: David Koppett, Executive Producer, Live Sports and Studio, NESN a...
04/11/2025
From concept to kick-off: How TAMS could transform sports workflows By Paul Markham
Tuesday, October 28, 2025 - 09:43
Print This Story
Techex tx darwin pr...
04/11/2025
College Hoops Preview 2025: The CW Tips Off Third Season of ACC Men's/Women&...
04/11/2025
College Hoops Preview 2025: Big Ten Network Heats Up for Busy Season With 500 Me...
04/11/2025
College Hoops Preview 2025: CBS Sports Readies 300+ Game Broadcasts Across Its P...
04/11/2025
College Hoops Preview 2025: NBC Sports Slate Features 200+ Big Ten, BIG EAST, an...
04/11/2025
College Hoops Preview 2025: ESPN Remote-Ops Team Preps for Massive Slate of 7,40...
04/11/2025
Never-before-seen footage of Selena Quintanilla and her family's band offers...
04/11/2025
Joel Edgerton at Train Dreams Park City premiere (photo by Soul Brother / Shutterstock for Sundance Film Festival)...
04/11/2025
Today, we announced our third quarter 2025 earnings, marking strong momentum as we surpassed 700 million Monthly Active Users and achieved double-digit subscrib...
04/11/2025
Idag rapporterar vi v rt resultat f r det tredje kvartalet 2025, vilket markerar en stark och fortsatt tillv xt d vi passerade 700 miljoner m natliga aktiva an...
04/11/2025
SBS calls for bold, thought-provoking factual ideas: up to $50,000 in developmen...
04/11/2025
Tomorrow's fight will demand networks that deliver both capacity and survivability, the speed to move mission applications at scale, and the resilience to e...
04/11/2025
New York, NY - November 3, 2025 - Neptune BidCo US Inc. (the Issuer or the Co...
04/11/2025
WASHINGTON The National Association of Broadcasters took aim at YouTube TV and its owner Google in a blog post for its heavy hand in deciding what viewers can ...
04/11/2025
HACKENSACK, N.J. The European Broadcasting Union (EBU) has awarded LiveU a five-year contract to deliver 24/7 live news content through its Eurovision News Exch...
04/11/2025
Bob Dylan Awarded Honorary Doctorate from Berklee College of Music The songwriter, performer, and cultural icon is recognized for a six-decade career that redef...
04/11/2025
SAN JOSE, Calif. Roku has launched Roku Ads API, a fully open, self-serve developer platform for connected TV (CTV) advertising. The Roku Ads API gives develope...
04/11/2025
Harmonic (NASDAQ: HLIT) today announced an expanded partnership with Spectrum to extend the company's industry-leading cOS vCMTS and advanced network and o...
04/11/2025
The inauguration of Empresa de Meios Audiovisuais' (EMAV's) first virtual studio in Lisbon marks a major technological milestone for the Portuguese audi...
04/11/2025
ZTransform, a leader in transformational system design, integration, and launch services for broadcasters, sports venues, educational facilities, and corporate ...
04/11/2025
Fred Baumgartner's op-ed (ATSC 3.0: I Cant Imagine Anyone Defending Our Current Adoption Strategy) on the broadcast industry's transition to ATSC 3.0 dr...
04/11/2025
Q&A with Music Alum Andrew van der Paardt The oboist and English horn player reports back from the pit of the New York City Ballet Orchestra, and tells how he...
04/11/2025
Damien Moloney as Jim Bergerac
As filming wraps on the highly anticipated second series of Bergerac(6x60'), UKTV today unveils a selection of first look im...
04/11/2025
Tuesday 4 November 2025
To view this content, please enable our use of cookies....
04/11/2025
Back to All News
Netflix and Embratur launch audiovisual tourism guide at the W...
04/11/2025
Back to All News
Frankenstein' Sightings Grip Hollywood With Halloween Wee...
04/11/2025
From the recent SMPTE Media Technology Summit in Pasadena, with FilmLight Image Engineer, Daniele Siragusano, and Research Engineer, Julius Tschannerl.
Matchin...
04/11/2025
Begins Thursday November 6 on RT One and RT Player at 10:15pm
Camogie: Inside...
04/11/2025
In Berlin on Tuesday, Deutsche Telekom and NVIDIA unveiled the world's first...
04/11/2025
When inspiration strikes, nothing kills momentum faster than a slow tool or a frozen timeline. Creative apps should feel fast and fluid - an extension of imagin...
04/11/2025
Douglas W. Phillips and Steven M. Paul join Scripps Research Board of Directors Finance and biomedical leaders bring decades of experience in investment strateg...
03/11/2025
SVG Sit-Down: Inside the Sports Rights Landscape (and the new IMG) with Andrew D...
03/11/2025
Challenging the norm: How TNT Sports is evolving coverage of the men's and w...
03/11/2025
Inspired storytelling: TNT Sports' Pete Thomas on creating opportunities out...
03/11/2025
NBA 2K League Returns With New Format Featuring NBA Players, Creators, and FansSeason will include online tournaments, in-person events, and open-ladder fan com...
03/11/2025
Live on the Water: The Rowing Channel Pulls Off Historic Production at Head Of T...
03/11/2025
Strategic partnership to expand specialized testing equipment, advance national security and support regional economic growth...
03/11/2025
In less than two weeks during late September and early October, the Federal Communications Commission acted on two proposed rulemakings that could have an enorm...
03/11/2025
Josh Miely is returning to a more hands-on radio and TV role with the National Association of Broadcasters....
03/11/2025
Broadcasters have spent years trying to integrate different vendor technologies in their facilities. As the industry has moved closer to software, that struggle...
03/11/2025
As the malevolent siege against broadcasters' interests intensifies from the far reaches of artificial intelligence misuse to relentless innovation in the m...
03/11/2025
Wheatstone founder and owner Gary Snow will retire from the company by the end of next year....