Sony Pixel Power calrec Sony

Why GPUs Are Great for AI

04/12/2023

GPUs have been called the rare Earth metals - even the gold - of artificial intelligence, because they're foundational for today's generative AI era.

Three technical reasons, and many stories, explain why that's so. Each reason has multiple facets well worth exploring, but at a high level:

GPUs employ parallel processing.

GPU systems scale up to supercomputing heights.

The GPU software stack for AI is broad and deep.

The net result is GPUs perform technical calculations faster and with greater energy efficiency than CPUs. That means they deliver leading performance for AI training and inference as well as gains across a wide array of applications that use accelerated computing.

In its recent report on AI, Stanford's Human-Centered AI group provided some context. GPU performance has increased roughly 7,000 times since 2003 and price per performance is 5,600 times greater, it reported.

A 2023 report captured the steep rise in GPU performance and price/performance. The report also cited analysis from Epoch, an independent research group that measures and forecasts AI advances.

GPUs are the dominant computing platform for accelerating machine learning workloads, and most (if not all) of the biggest models over the last five years have been trained on GPUs [they have] thereby centrally contributed to the recent progress in AI, Epoch said on its site.

A 2020 study assessing AI technology for the U.S. government drew similar conclusions.

We expect [leading-edge] AI chips are one to three orders of magnitude more cost-effective than leading-node CPUs when counting production and operating costs, it said.

NVIDIA GPUs have increased performance on AI inference 1,000x in the last ten years, said Bill Dally, the company's chief scientist in a keynote at Hot Chips, an annual gathering of semiconductor and systems engineers.

ChatGPT Spread the News ChatGPT provided a powerful example of how GPUs are great for AI. The large language model (LLM), trained and run on thousands of NVIDIA GPUs, runs generative AI services used by more than 100 million people.

Since its 2018 launch, MLPerf, the industry-standard benchmark for AI, has provided numbers that detail the leading performance of NVIDIA GPUs on both AI training and inference.

For example, NVIDIA Grace Hopper Superchips swept the latest round of inference tests. NVIDIA TensorRT-LLM, inference software released since that test, delivers up to an 8x boost in performance and more than a 5x reduction in energy use and total cost of ownership. Indeed, NVIDIA GPUs have won every round of MLPerf training and inference tests since the benchmark was released in 2019.

In February, NVIDIA GPUs delivered leading results for inference, serving up thousands of inferences per second on the most demanding models in the STAC-ML Markets benchmark, a key technology performance gauge for the financial services industry.

A RedHat software engineering team put it succinctly in a blog: GPUs have become the foundation of artificial intelligence.

AI Under the Hood A brief look under the hood shows why GPUs and AI make a powerful pairing.

An AI model, also called a neural network, is essentially a mathematical lasagna, made from layer upon layer of linear algebra equations. Each equation represents the likelihood that one piece of data is related to another.

For their part, GPUs pack thousands of cores, tiny calculators working in parallel to slice through the math that makes up an AI model. This, at a high level, is how AI computing works.

Highly Tuned Tensor Cores Over time, NVIDIA's engineers have tuned GPU cores to the evolving needs of AI models. The latest GPUs include Tensor Cores that are 60x more powerful than the first-generation designs for processing the matrix math neural networks use.

In addition, NVIDIA Hopper Tensor Core GPUs include a Transformer Engine that can automatically adjust to the optimal precision needed to process transformer models, the class of neural networks that spawned generative AI.

Along the way, each GPU generation has packed more memory and optimized techniques to store an entire AI model in a single GPU or set of GPUs.

Models Grow, Systems Expand The complexity of AI models is expanding a whopping 10x a year.

The current state-of-the-art LLM, GPT4, packs more than a trillion parameters, a metric of its mathematical density. That's up from less than 100 million parameters for a popular LLM in 2018.

In a recent talk at Hot Chips, NVIDIA Chief Scientist Bill Dally described how single-GPU performance on AI inference expanded 1,000x in the last decade. GPU systems have kept pace by ganging up on the challenge. They scale up to supercomputers, thanks to their fast NVLink interconnects and NVIDIA Quantum InfiniBand networks.

For example, the DGX GH200, a large-memory AI supercomputer, combines up to 256 NVIDIA GH200 Grace Hopper Superchips into a single data-center-sized GPU with 144 terabytes of shared memory.

Each GH200 superchip is a single server with 72 Arm Neoverse CPU cores and four petaflops of AI performance. A new four-way Grace Hopper systems configuration puts in a single compute node a whopping 288 Arm cores and 16 petaflops of AI performance with up to 2.3 terabytes of high-speed memory.

And NVIDIA H200 Tensor Core GPUs announced in November pack up to 288 gigabytes of the latest HBM3e memory technology.

Software Covers the Waterfront An expanding ocean of GPU software has evolved since 2007 to enable every facet of AI, from deep-tech features to high-level applications.

The NVIDIA AI platform includes hundreds of software libraries and apps. The CUDA programming language and the cuDNN-X library for deep learning provide a base on top of which developers have created software like NVIDIA NeMo, a framework to let users build, customize and run inference on their o
LINK: https://blogs.nvidia.com/blog/why-gpus-are-great-for-ai/...
See more stories from nvidia

Most recent headlines

09/11/2025

Dalet Unveils Agentic AI Media Workflows at IBC2025

Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...

03/11/2025

Brand New RT Documentary Series Tonight New RT Documentary Series Trackers: The People v The Banks

They've made that decision and ruined an awful lot of people's lives. ...

02/11/2025

Space42 Expands Earth Observation Constellation, Foresight, with Launch of Three New SAR Satellites

Abu Dhabi, UAE November 2, 2025: Space42 (ADX: SPACE42), the UAE-based AI-powe...

01/11/2025

Thunderbolt 3 and Symphony MkII

Thunderbolt 3 Now Standard on Symphony MkII - Starting November 11 Beginning November 11, all new Apogee Symphony I/O MkII units will ship with Thunderbolt 3 as...

01/11/2025

Expanding Symphony Desktop Using ADAT

How to Expand the Apogee Symphony Desktop with Cranborne 500ADAT Want to expand your Symphony Desktop beyond two inputs? Whether you're tracking a full drum...

01/11/2025

aconnic AG releases Half Year Financial Report 2025 and implements Change Measures

aconnic AG (ISIN: DE000A0LBKW6), Munich, has published the Financial Report for ...

01/11/2025

tvONE and Matrox Video Partner to Deliver Flawless AV-ove...

tvONE is proud to announce a strategic partnership with Matrox Video, combining CALICO PRO's high-performance video processing with the Matrox ConvertIP Ser...

01/11/2025

CJP Broadcast Joins Grass Valley Partner Programme to Str...

CJP Broadcast has joined the Grass Valley partner programme as both a Systems Integration Partner and AMPP Partner. The collaboration enhances CJP's ability...

01/11/2025

TAG Video Systems Earns Dual Recognition for ESG Initiati...

TAG Video Systems, the leader in software-based IP end-to-end workflow monitoring, deep probing, and real-time visualization, has earned a higher-rated DPP Comm...

01/11/2025

Operative Announces New CEO to Drive Next Phase of Growth

Michael Napodano Appointed New CEO Of Operative Media Operative today announced the appointment of Mike Napodano as Chief Executive Officer, marking the next s...

01/11/2025

Cine Gear Expo Atlanta 2025 Success at Trilith Studios

Film industry professionals flocked to Cine Gear Expo Atlanta 2025 at celebrated Trilith Studios in Fayetteville, Georgia, on October 3 and 4. Back for its 6th ...

01/11/2025

Christopher Ross BSC and 300 Asteras Light the Border Cro...

Photo courtesy of Peacock and Sky Christopher Ross, BSC, began his cinematic obsession early. He cites reading Scorsese on Scorsese as a teenager with teaching...

01/11/2025

ITN, Magnite Launch New Private Marketplace for Local Linear TV

NEW YORK ITN and the sell-side advertising company Magnite have announced the launch of what they are billing as the industrys first Local Linear TV Private Mar...

31/10/2025

FanDuel Sports Network To Deliver Selected Live NBA, NHL Games to Major Streaming Services for In-Market Viewers

FanDuel Sports Network To Deliver Selected Live NBA, NHL Games to Major Streamin...

31/10/2025

NBC Jumps Out of the Gate in Extended Breeder's Cup Deal With Dual Drones, Jockey Cams, RF Super-Mo

NBC Jumps Out of the Gate in Extended Breeder's Cup Deal With Dual Drones, J...

31/10/2025

Tribute: Remembering Segomotso Keorapetse (28 May 1968 22 October 2025)

FOR IMMEDIATE RELEASE 30 October 2025 It is with great sadness that we mourn the passing of Segomotso Keorapetse, an award- winning South African television d...

31/10/2025

Nexstar Extends Chairman and CEO Perry Sook Through 2029

IRVING, Texas As station groups move into an era that promises rapid tech, regulatory and economic changes, Nexstar Media Group said its board has extended chai...

31/10/2025

Late Night Thrives on Social Media With Billions of Views in 2025

While some analysts have questioned the ongoing economic viability of broacast-TV late night shows amid ongoing declines in linear viewing, new data from Tubula...

31/10/2025

Disney Programming Dropped From YouTube TV

The contentious contract negotiations between The Walt Disney Co. and YouTube TV have resulted in a blackout of Disney-owned programming on the pay TV operator....

31/10/2025

tvONE Integrates CALICO PRO Video Processing With Matrox ConvertIP Series

CINCINNATI Video conversion and AV signal distribution specialist tvONE and Matrox Video have struck a strategic partnership, combining CALICO PRO's video p...

31/10/2025

IAB Urges Standards for CTV Ad Measurement

NEW YORK The Interactive Advertising Bureau (IAB) today released a new industry guide that discusses the urgency of adopting new standards that will help advert...

31/10/2025

Late Night Shows Thrive on Social Media with Billions of Views in 2025

While some analysts have questioned the ongoing economic viability of late night shows on broadcast TV amid ongoing declines in linear viewing, new data from Tu...

31/10/2025

Berklee Celebrates the Inauguration of President Jim Lucchese

Berklee Celebrates the Inauguration of President Jim Lucchese In his inaugural address, Lucchese shared an optimistic vision for Berklee's future as a for...

31/10/2025

Family, Food, and Films: Netflix's 'Dining with the Kapoors' Arrives November 21

Back to All News Family, Food, and Films: Netflix's Dining with the Kapoors...

31/10/2025

DPA 4055 Featured in Technologies for Worship Magazine

The review highlights DPA 4055 Kick Drum Microphone for its compact design, ease of placement, and authentic tone that captures the true character of the drum p...

31/10/2025

RT Raidi na Gaeltachta Award 2025 to be presented to Piln N Chiarin

The RT Raidi na Gaeltachta Award 2025 will be presented to journalist P il n N Chiar in at the Oireachtas na Samhna in Belfast tomorrow, Saturday 1 November,...

31/10/2025

Share the magic: RT lyric fm Choirs for Christmas Competition 2025 open for submissions

RT lyric fm is calling for choirs across Ireland to share their festive music-m...

31/10/2025

Dnall Mac Ruair, Cuan Seireadin and Ts ite among the winners at the Oireachtas Communications Awards 2025

Three awards were presented to RT Raidi na Gaeltachta broadcasters at the Oire...

31/10/2025

RT is Supporting 29 Arts and Cultural Events across Ireland this November

RT continues its proud tradition of championing Ireland's vibrant arts and cultural landscape through its RT Supporting the Arts initiative. This November...

31/10/2025

RT selects Irish independent production company to produce Christian Worship on RT One and RT Player

RT selects Irish independent production company to produce Christian Worship on...

31/10/2025

Korea Joins AI Industrial Revolution: NVIDIA CEO Jensen Huang Unveils Historic Partnership at APEC Summit

Amidst Gyeongju, South Korea's ancient temples and modern skylines, Jensen H...

30/10/2025

Midwich Secures UK & Ireland Distribution Deal with X2O Media To Revolutionize Hybrid Learning

Midwich has signed a UK and Ireland distribution deal with X2O Media, a worldwid...

30/10/2025

SVG Students To Watch: Sam Newitt, Kansas State University

SVG Students To Watch: Sam Newitt, Kansas State UniversityThe South Dakota native thrives in many roles behind the scenes at K-StateHD.TVBy Brandon Costa, Direc...

30/10/2025

SVG Sit-Down: Swerve Sports' Christy Tanner Explores the Young FAST Channel's Early Success

SVG Sit-Down: Swerve Sports' Christy Tanner Explores the Young FAST Channel&...

30/10/2025

SVG Campus Shot Callers: Andy Liebsch, Senior Director, Video Services, Kansas State University

SVG Campus Shot Callers: Andy Liebsch, Senior Director, Video Services, Kansas S...

30/10/2025

Diversified Names Paul Lidsky CEO, Expanding Leadership Role After Serving as Board Chairman

Diversified Names Paul Lidsky CEO, Expanding Leadership Role After Serving as Bo...

30/10/2025

NBA, Cosm Enter Long-Term Partnership for Shared Reality Production, Distribution

NBA, Cosm Enter Long-Term Partnership for Shared Reality Production, Distributio...

30/10/2025

FanDuel Sports Network to Deliver Select Live NBA, NHL Games to Major Streaming Services for In-Market Viewers

FanDuel Sports Network to Deliver Select Live NBA, NHL Games to Major Streaming ...

30/10/2025

If I Had Legs, I'd Kick You, East of Wall, and More Sundance Institute-Supported Films Nominated for 35th Gotham Awards

As the year comes to a close, we can feel the invigorating wind sweeping in for ...

30/10/2025

Give Me the Backstory: Get to Know Max Walker-Silverman, the Writer-Director of Rebuilding

By Bailey Pennick One of the most exciting things about the Sundance Film Festi...

30/10/2025

Excellent training at SGL Carbon's Bonn site

The SGL Carbon site in Bonn has a long tradition of training. For many years, young talent has been successfully trained here, regularly achieving excellent exa...

30/10/2025

SBS, NITV and Screen Australia announce 2025 Digital Originals Shortlist

SBS, NITV and Screen Australia announce 2025 Digital Originals Shortlist 29 October, 2025 Media releases SBS, NITV and Screen Australia are excited to unve...

30/10/2025

Remarks for the 2025 APEC CEO Roundtable

Jon Rambeau, President of Integrated Mission Systems at L3Harris Technologies, speaks about industrial collaboration at the Asia-Pacific Economic Cooperation (A...

30/10/2025

L3Harris Technologies Reports Strong Third Quarter 2025 Results, Increases 2025 Guidance

MELBOURNE, Fla., October 30, 2025 - L3Harris Technologies (NYSE: LHX) reports th...

30/10/2025

FCC's Brendan Carr Issues Draft Proposal for More C-Band Spectrum Sales

WASHINGTON Federal Communications Commission Chair Brendan Carr said he has circulated a proposal for the agency to auction additional midband spectrum in the U...

30/10/2025

Diversified Names Paul Lidsky as CEO

PLANO, Texas Technology solutions provider Diversified has named Paul Lidsky as CEO, tasked with guiding the company's next stage of growth, driving market ...

30/10/2025

Interra Adds Stream Recording, BATON Integration to ORION

CUPERTINO, Calif. Interra Systems today unveiled ORION stream recording support and seamless integration with BATON Media Player, a combination that lets broadc...

30/10/2025

InterDigital Buys AI-Driven Video Codec Startup Deep Render

WILMINGTON, Del. InterDigital today announced the acquisition of Deep Render, an artificial intelligence startup with a team of AI experts focused on video code...

30/10/2025

TAG Video Systems Earns Two ESG Recognitions

NEW YORK TAG Video Systems has earned a higher-rated Digital Product Passport (DPP) Committed to Sustainability badge and the Aclymate Climate Wise Silver Tier ...