Sony Pixel Power calrec Sony

Why GPUs Are Great for AI

04/12/2023

GPUs have been called the rare Earth metals - even the gold - of artificial intelligence, because they're foundational for today's generative AI era.

Three technical reasons, and many stories, explain why that's so. Each reason has multiple facets well worth exploring, but at a high level:

GPUs employ parallel processing.

GPU systems scale up to supercomputing heights.

The GPU software stack for AI is broad and deep.

The net result is GPUs perform technical calculations faster and with greater energy efficiency than CPUs. That means they deliver leading performance for AI training and inference as well as gains across a wide array of applications that use accelerated computing.

In its recent report on AI, Stanford's Human-Centered AI group provided some context. GPU performance has increased roughly 7,000 times since 2003 and price per performance is 5,600 times greater, it reported.

A 2023 report captured the steep rise in GPU performance and price/performance. The report also cited analysis from Epoch, an independent research group that measures and forecasts AI advances.

GPUs are the dominant computing platform for accelerating machine learning workloads, and most (if not all) of the biggest models over the last five years have been trained on GPUs [they have] thereby centrally contributed to the recent progress in AI, Epoch said on its site.

A 2020 study assessing AI technology for the U.S. government drew similar conclusions.

We expect [leading-edge] AI chips are one to three orders of magnitude more cost-effective than leading-node CPUs when counting production and operating costs, it said.

NVIDIA GPUs have increased performance on AI inference 1,000x in the last ten years, said Bill Dally, the company's chief scientist in a keynote at Hot Chips, an annual gathering of semiconductor and systems engineers.

ChatGPT Spread the News ChatGPT provided a powerful example of how GPUs are great for AI. The large language model (LLM), trained and run on thousands of NVIDIA GPUs, runs generative AI services used by more than 100 million people.

Since its 2018 launch, MLPerf, the industry-standard benchmark for AI, has provided numbers that detail the leading performance of NVIDIA GPUs on both AI training and inference.

For example, NVIDIA Grace Hopper Superchips swept the latest round of inference tests. NVIDIA TensorRT-LLM, inference software released since that test, delivers up to an 8x boost in performance and more than a 5x reduction in energy use and total cost of ownership. Indeed, NVIDIA GPUs have won every round of MLPerf training and inference tests since the benchmark was released in 2019.

In February, NVIDIA GPUs delivered leading results for inference, serving up thousands of inferences per second on the most demanding models in the STAC-ML Markets benchmark, a key technology performance gauge for the financial services industry.

A RedHat software engineering team put it succinctly in a blog: GPUs have become the foundation of artificial intelligence.

AI Under the Hood A brief look under the hood shows why GPUs and AI make a powerful pairing.

An AI model, also called a neural network, is essentially a mathematical lasagna, made from layer upon layer of linear algebra equations. Each equation represents the likelihood that one piece of data is related to another.

For their part, GPUs pack thousands of cores, tiny calculators working in parallel to slice through the math that makes up an AI model. This, at a high level, is how AI computing works.

Highly Tuned Tensor Cores Over time, NVIDIA's engineers have tuned GPU cores to the evolving needs of AI models. The latest GPUs include Tensor Cores that are 60x more powerful than the first-generation designs for processing the matrix math neural networks use.

In addition, NVIDIA Hopper Tensor Core GPUs include a Transformer Engine that can automatically adjust to the optimal precision needed to process transformer models, the class of neural networks that spawned generative AI.

Along the way, each GPU generation has packed more memory and optimized techniques to store an entire AI model in a single GPU or set of GPUs.

Models Grow, Systems Expand The complexity of AI models is expanding a whopping 10x a year.

The current state-of-the-art LLM, GPT4, packs more than a trillion parameters, a metric of its mathematical density. That's up from less than 100 million parameters for a popular LLM in 2018.

In a recent talk at Hot Chips, NVIDIA Chief Scientist Bill Dally described how single-GPU performance on AI inference expanded 1,000x in the last decade. GPU systems have kept pace by ganging up on the challenge. They scale up to supercomputers, thanks to their fast NVLink interconnects and NVIDIA Quantum InfiniBand networks.

For example, the DGX GH200, a large-memory AI supercomputer, combines up to 256 NVIDIA GH200 Grace Hopper Superchips into a single data-center-sized GPU with 144 terabytes of shared memory.

Each GH200 superchip is a single server with 72 Arm Neoverse CPU cores and four petaflops of AI performance. A new four-way Grace Hopper systems configuration puts in a single compute node a whopping 288 Arm cores and 16 petaflops of AI performance with up to 2.3 terabytes of high-speed memory.

And NVIDIA H200 Tensor Core GPUs announced in November pack up to 288 gigabytes of the latest HBM3e memory technology.

Software Covers the Waterfront An expanding ocean of GPU software has evolved since 2007 to enable every facet of AI, from deep-tech features to high-level applications.

The NVIDIA AI platform includes hundreds of software libraries and apps. The CUDA programming language and the cuDNN-X library for deep learning provide a base on top of which developers have created software like NVIDIA NeMo, a framework to let users build, customize and run inference on their o
LINK: https://blogs.nvidia.com/blog/why-gpus-are-great-for-ai/...
See more stories from nvidia

North America Stories

12/06/2026

Study: 2026 Election Cycle to Hit Record $11.6 Billion Ad Spend

Share Copy link Facebook X Linkedin Bluesky Email...

12/06/2026

NAB Elects Leadership at June Board of Directors Meeting

Share Copy link Facebook X Linkedin Bluesky Email...

12/06/2026

FCC Believes World Cup Communication Will Score Highly

Share Copy link Facebook X Linkedin Bluesky Email...

12/06/2026

Broadcasters Back NO FAKES Act

Share Copy link Facebook X Linkedin Bluesky Email...

12/06/2026

Scripps Unveils Coverage Plans For America's 250th Anniversary

Share Copy link Facebook X Linkedin Bluesky Email...

12/06/2026

June 11, 2026

A fentanyl countermeasure that adapts to combat future black-market drugs Scripps Research scientists developed a vaccine that teaches the immune system to rapi...

11/06/2026

HBSs Johannes Franken on Digital Innovations, the Role of the Influencer at the 2026 FIFA World Cup

The immense size of the tourney and its Atlantic-spanning operation also disting...

11/06/2026

Nielsen: Soccer Fandom in North America Tops 136 Million, Up 10.9% in Five Years

Nielsen has released a new soccer fandom consumer research report, The Fans Behind The Game: FIFA World Cup 2026 Edition, examining the soccer audience in the...

11/06/2026

Telemundo Announces All-Day Opening Day Coverage for FIFA World Cup 2026 on June 11

Telemundo will launch its FIFA World Cup 2026 coverage on Thursday, June 11 with...

11/06/2026

Fubo Announces Distribution Agreement With NBCUniversal

FuboTV Inc. has announced a distribution agreement with NBCUniversal. Fubo customers can now stream Telemundo and Universo, with NBC Sports Network (NBCSN), NBC...

11/06/2026

DAZN Announces In-App Features for FIFA World Cup 2026 Coverage in Spain, Italy, and Japan

DAZN has announced its in-app features for FIFA World Cup 2026 coverage in Spain...

11/06/2026

Roblox Report: Sports Engagement on Platform Drives Real-World Fandom and Purchases

Roblox has released the 2026 Roblox Digital Expression Report: Wave 4 - Sports D...

11/06/2026

Andrea Bocelli, David Guetta, Megan Thee Stallion, and EJAE Release Official FIFA World Cup 2026 Anthem DNA'

FIFA has unveiled DNA, the Official FIFA World Cup 2026 Anthem, performed by A...

11/06/2026

ESPN Announces Extensive English- and Spanish-Language World Cup 2026 Coverage

ESPN will provide English- and Spanish-language news and information coverage of FIFA World Cup 2026 across its U.S. media platforms from June 11 through July 1...

11/06/2026

SVG Students To Watch: Teddy Batkin, Rochester Institute of Technology

The latest product of the outstanding RIT Sports Network program, this recent grad from Long Island is carving out a promising path in broadcast engineering In...

11/06/2026

DAZN and DSPORTS Announce Distribution Agreement Across Five Latin American Countries

DAZN has announced a multi-year agreement to make DSPORTS channels available to ...

11/06/2026

Resource Actors Throughout the Years at Sundance Institute's Directors Lab

Laura Dern at the 1986 Sundance Institute Directors Lab (Photo by Eric Edwards) By Lucy Spicer It takes a village to bring together the Sundance Institute lab...

11/06/2026

MTI FILM acquires Mango/New Edit

MTI FILM acquires Mango/New Edit Posted by MTI Film on June 10, 2026 LOS ANGELES, CA - June 2026 - MTI FILM, the multiple Emmy Award winning Hollywood post-p...

11/06/2026

Ungrounded LLM Fabricates Every Detail for Nearly 1 in 5 Movie and TV Titles Tested, New Gracenote Report Finds

Study underscores the need for authoritative content intelligence to build trust...

11/06/2026

PTZOptics, LayerJot Partner on AI-Powered PTZ at InfoComm 2026

Share Copy link Facebook X Linkedin Bluesky Email...

11/06/2026

Chyron Unveils PAINT 10.4

Share Copy link Facebook X Linkedin Bluesky Email...

11/06/2026

Maxon Brings Real-Time Architectural Visualization to AIA26 With New Redshift for Revit and Archicad Integration Beta

Maxon Brings Real-Time Architectural Visualization to AIA26 With New Redshift fo...

11/06/2026

ABC Kid's Caper Crew Shoots Australian Adventure with Blackmagic Design

ABC Kid's Caper Crew Shoots Australian Adventure with Blackmagic Design Brie Clayton June 11, 2026 0 Comments DP Judd Overton and team bring Wes A...

11/06/2026

PTZOptics and LayerJot demo Visual Reasoning at InfoComm...

PTZOptics, and LayerJot today announced live demonstrations at InfoComm 2026 showing how prompt-based AI, robotic camera control, and high-performance computing...

11/06/2026

Lightware launches GPIO Button to deliver simplified hard...

Lightware, an industry leader in signal management, announces the release of GPIO-Button-10S, a dedicated control interface enabling straightforward press-to-a...

11/06/2026

NABs LeGeyt Urges Congress to Limit NFL's Antitrust Exemption

Share Copy link Facebook X Linkedin Bluesky Email...

11/06/2026

Fubo Inks New Distribution Agreement with NBCUniversal

Share Copy link Facebook X Linkedin Bluesky Email...

11/06/2026

Kiloview to Showcase Broadcast-Grade AV-over-IP Solutions...

Kiloview, a leading innovator in AV-over-IP video solutions, will return to InfoComm 2026 (Booth# N8327) with broadcast-grade AV-over-IP solutions designed for ...

11/06/2026

Berklee's Tonya Butler Named Music Business Educator of the Year

Berklee's Tonya Butler Named Music Business Educator of the Year The Music Business Association honored Butler at its annual Bizzy Awards. June 10, 2026 ...

11/06/2026

Ann Mincieli to Receive Honorary Doctorate at Berklee NYC Graduate Commencement

Ann Mincieli to Receive Honorary Doctorate at Berklee NYC Graduate Commencement The five-time Grammy-winning engineer and producer, known for her longstanding...

11/06/2026

Hadewych Minis and Geert van Rampelberg to Star in New Netflix Series Directed by Paula van der Oest

Back to All News Hadewych Minis and Geert van Rampelberg to Star in New Netflix...

11/06/2026

Official Trailer for Anime Adaptation of Thunder 3' Unveiled Ahead of July 9 Premiere

Back to All News Official Trailer for Anime Adaptation of Thunder 3' Unvei...

11/06/2026

Save Big and Play Bigger: GeForce NOW Summer Sale Brings Major Membership Savings

The GeForce NOW summer sale kicked off today with limited-time savings of up to ...

10/06/2026

SVG Sit-Down: Team Whistle's Joe Caporoso on Building World Cup Content Around Fans, Culture, IRL Experiences

DAZN-owned digital-media company launches three fan-first series leaning into cr...

10/06/2026

Clear-Com Appoints Jason Dino as Southwest Regional Sales Manager

Clear-Com has announced the appointment of Jason Dino as Southwest Regional Sales Manager USA, covering Southern California and the Southwest region. Dino joins...

10/06/2026

Caretta Research: 2026 World Cup Revenue Growth Due to More Matches; Rights Revenue Up 32%

An 11% decrease in number of global broadcast deals reflects the organization...

10/06/2026

Women Without Boundaries Awards Are Back!

The Women Without Boundaries Awards recognize women whose work is advancing the future of media, broadcast, AV, workplace technology, digital experience, and re...

10/06/2026

On Eve of World Cup Kickoff, FIFA and HBS Offer Deep Dive into IBC Operations, Commentary, and Ref Cam

Today is match day minus two for FIFA and HBS. On Thursday, there will be two ma...

10/06/2026

SES Supporting World's Biggest Soccer Tournament Broadcast Distribution Worldwide

SES is supporting broadcast distribution of the world's biggest football tou...

10/06/2026

BirdDog Achieves Full NDI 6.3 Compatibility Across Entire Product Line

NDI has announced that BirdDog has become the first hardware manufacturer to achieve full NDI 6.3 compatibility across its complete lineup of cameras, encoders,...

10/06/2026

Emmy Award-Winning Audio Team To Present at SVG Audio Symposium

Vince Caputo and Scott Carter, winners of the 2026 Sports Emmy for Outstanding Post Produced Audio have been announced as presenters for the 2026 SVG Advanced A...

10/06/2026

FOX One Set to Deliver World Cup in 4K; Personalization via AI Drives Experience

FOX One today unveiled a slate of new product features and enhancements designed to elevate the viewing experience for fans on the official streaming platform o...

10/06/2026

PWHL Scales Broadcast Operation in Season 3, Relying on World-Feed Model and Key Vendors

Primary production partners Dome Productions and Raycom Sports once again played...

10/06/2026

NFL Films Application Deadline for Women in Sports Filmmaking Experienceship, Augusts 26-29 in Mount Laurel, Closes June 18

The Women in Sports Filmmaking Experienceship is an immersive professional devel...

10/06/2026

NBAs In-House Broadcast Ops & Engineering Teams Power Global Finals Coverage From NYC, San Antonio

The league has expanded its HSAN architecture for the NBA Finals to manage more ...

10/06/2026

MoonPay X Games League Winter Draft Set for September 16 at Cosm Los Angeles

The inaugural MoonPay X Games League (XGL) Winter Draft will take place Wednesday, September 16, 2026 at Cosm Los Angeles from 7-9 p.m. PT. The event will strea...

10/06/2026

University of Oklahoma and Learfield Extend 30-Year Partnership, Announce Sooner Evolution Center

The University of Oklahoma (OU) Athletics Department and Learfield have announce...

10/06/2026

VSF Releases RIST Satellite-Hybrid Out-of-Band Specification

The Video Services Forum (VSF) has released TR-06-4 Part 8, a new specification for RIST Satellite-Hybrid: Out-of-Band Method. The specification creates a mecha...

10/06/2026

Riedel Artist Intercom Powers Live Neurovascular Conference in Lisbon

Riedel Communications provided the communications infrastructure for the 14th World Live Neurovascular Conference (WLNC) in Lisbon, supporting live medical proc...

10/06/2026

Sundance Film Festival 101: Films by LGBTQ+ Directors

A still from The Doom Generation by Gregg Araki (Courtesy of Sundance Institute) By Lucy Spicer Have you checked out our Sundance Film Festival 101 list yet...