Sony Pixel Power calrec Sony

Why GPUs Are Great for AI

04/12/2023

GPUs have been called the rare Earth metals - even the gold - of artificial intelligence, because they're foundational for today's generative AI era.

Three technical reasons, and many stories, explain why that's so. Each reason has multiple facets well worth exploring, but at a high level:

GPUs employ parallel processing.

GPU systems scale up to supercomputing heights.

The GPU software stack for AI is broad and deep.

The net result is GPUs perform technical calculations faster and with greater energy efficiency than CPUs. That means they deliver leading performance for AI training and inference as well as gains across a wide array of applications that use accelerated computing.

In its recent report on AI, Stanford's Human-Centered AI group provided some context. GPU performance has increased roughly 7,000 times since 2003 and price per performance is 5,600 times greater, it reported.

A 2023 report captured the steep rise in GPU performance and price/performance. The report also cited analysis from Epoch, an independent research group that measures and forecasts AI advances.

GPUs are the dominant computing platform for accelerating machine learning workloads, and most (if not all) of the biggest models over the last five years have been trained on GPUs [they have] thereby centrally contributed to the recent progress in AI, Epoch said on its site.

A 2020 study assessing AI technology for the U.S. government drew similar conclusions.

We expect [leading-edge] AI chips are one to three orders of magnitude more cost-effective than leading-node CPUs when counting production and operating costs, it said.

NVIDIA GPUs have increased performance on AI inference 1,000x in the last ten years, said Bill Dally, the company's chief scientist in a keynote at Hot Chips, an annual gathering of semiconductor and systems engineers.

ChatGPT Spread the News ChatGPT provided a powerful example of how GPUs are great for AI. The large language model (LLM), trained and run on thousands of NVIDIA GPUs, runs generative AI services used by more than 100 million people.

Since its 2018 launch, MLPerf, the industry-standard benchmark for AI, has provided numbers that detail the leading performance of NVIDIA GPUs on both AI training and inference.

For example, NVIDIA Grace Hopper Superchips swept the latest round of inference tests. NVIDIA TensorRT-LLM, inference software released since that test, delivers up to an 8x boost in performance and more than a 5x reduction in energy use and total cost of ownership. Indeed, NVIDIA GPUs have won every round of MLPerf training and inference tests since the benchmark was released in 2019.

In February, NVIDIA GPUs delivered leading results for inference, serving up thousands of inferences per second on the most demanding models in the STAC-ML Markets benchmark, a key technology performance gauge for the financial services industry.

A RedHat software engineering team put it succinctly in a blog: GPUs have become the foundation of artificial intelligence.

AI Under the Hood A brief look under the hood shows why GPUs and AI make a powerful pairing.

An AI model, also called a neural network, is essentially a mathematical lasagna, made from layer upon layer of linear algebra equations. Each equation represents the likelihood that one piece of data is related to another.

For their part, GPUs pack thousands of cores, tiny calculators working in parallel to slice through the math that makes up an AI model. This, at a high level, is how AI computing works.

Highly Tuned Tensor Cores Over time, NVIDIA's engineers have tuned GPU cores to the evolving needs of AI models. The latest GPUs include Tensor Cores that are 60x more powerful than the first-generation designs for processing the matrix math neural networks use.

In addition, NVIDIA Hopper Tensor Core GPUs include a Transformer Engine that can automatically adjust to the optimal precision needed to process transformer models, the class of neural networks that spawned generative AI.

Along the way, each GPU generation has packed more memory and optimized techniques to store an entire AI model in a single GPU or set of GPUs.

Models Grow, Systems Expand The complexity of AI models is expanding a whopping 10x a year.

The current state-of-the-art LLM, GPT4, packs more than a trillion parameters, a metric of its mathematical density. That's up from less than 100 million parameters for a popular LLM in 2018.

In a recent talk at Hot Chips, NVIDIA Chief Scientist Bill Dally described how single-GPU performance on AI inference expanded 1,000x in the last decade. GPU systems have kept pace by ganging up on the challenge. They scale up to supercomputers, thanks to their fast NVLink interconnects and NVIDIA Quantum InfiniBand networks.

For example, the DGX GH200, a large-memory AI supercomputer, combines up to 256 NVIDIA GH200 Grace Hopper Superchips into a single data-center-sized GPU with 144 terabytes of shared memory.

Each GH200 superchip is a single server with 72 Arm Neoverse CPU cores and four petaflops of AI performance. A new four-way Grace Hopper systems configuration puts in a single compute node a whopping 288 Arm cores and 16 petaflops of AI performance with up to 2.3 terabytes of high-speed memory.

And NVIDIA H200 Tensor Core GPUs announced in November pack up to 288 gigabytes of the latest HBM3e memory technology.

Software Covers the Waterfront An expanding ocean of GPU software has evolved since 2007 to enable every facet of AI, from deep-tech features to high-level applications.

The NVIDIA AI platform includes hundreds of software libraries and apps. The CUDA programming language and the cuDNN-X library for deep learning provide a base on top of which developers have created software like NVIDIA NeMo, a framework to let users build, customize and run inference on their o
LINK: https://blogs.nvidia.com/blog/why-gpus-are-great-for-ai/...
See more stories from nvidia

North America Stories

24/04/2026

Churchill Downs Inc. Acquires Preakness Stakes for $85 Million

Churchill Downs Inc. (CDI) has announced a definitive agreement to acquire the intellectual property of the Preakness Stakes and Black-Eyed Susan Stakes from 1/...

24/04/2026

Inter Miami CF Opens Miami Freedom Park with 26 LED Displays From Daktronics, DCL

Daktronics has partnered with DCL (Design Communications, Ltd.) to design, manuf...

24/04/2026

NAB 2026: Chyron Launches PRIME Translate for Multi-Language Live Production

Chyron has announced PRIME Translate, a workflow solution that produces live content simultaneously in multiple languages within the PRIME platform. The system ...

24/04/2026

Eutelsat Supports Co-op Cable in Launching Expanded DTH Offering Across the Caribbean

Eutelsat has announced a new partnership with Co-op Cable, introducing an expand...

24/04/2026

Pitch Dublin Deploys Panasonic Laser LCD Projectors for Indoor Golf and Hospitality Venue

Pitch Dublin, an indoor golf simulation and hospitality venue on Dawson Street i...

24/04/2026

G&D and VuWall Appoint Mirko Aubel as EVP Sales EMEA/APAC, Eric Hnique as Chief Revenue Officer

G&D and VuWall have announced two senior leadership appointments, effective Apri...

24/04/2026

Victory+ Becomes Exclusive Local Streaming Home for Minnesota Lynx

Victory , the free sports streaming service from A Parent Media Co. Inc. (APMC), has announced a multi-year agreement to become the exclusive local streaming ho...

24/04/2026

SVG Students To Watch: Jason Weitz, University of South Florida

The former business major from Massachusetts has found his home in graphics and bug operation while contributing to live ESPN productions In the live-sports-vi...

24/04/2026

MASN and Spectrum Announce Multiyear Carriage Agreement

The Mid-Atlantic Sports Network (MASN) and Spectrum have announced a multiyear carriage agreement making MASN available to Spectrum customers in areas of southe...

24/04/2026

Case Study: 4Wall Entertainment Powers 2026 NFL Draft on NETGEAR AV Line Network

The NFL Draft is rebuilt from the ground up in a new city every year. The three-day fan festival is expected to draw 500,000 or more attendees, with millions fo...

24/04/2026

Diversified Continues Live Sports Expansion, Supports Mobile TV Group Ops Center Buildout

Diversified has continued expansion of its sports and media capabilities to supp...

24/04/2026

NAB Show 2026 Is In The Books! Our Coverage Continues at SVG's SportsTech@NABShow Blog

NAB reports that the 2026 NAB Show wrapped with more than 58,000 registered atte...

24/04/2026

NAB 2026: Clear-Com Announces Updates to Arcadia Central Station and Eclipse HX with New ARC Architecture

Clear-Com has announced significant updates to its Arcadia Central Station and E...

24/04/2026

NAB Reports 58,000+ Registered Attendees at 2026 NAB Show, Up From 2025 but Down From 2024

NAB reports that the 2026 NAB Show wrapped with more than 58,000 registered atte...

24/04/2026

Ratings Roundup: NHL Sees Best Regular Season Average Since 2013; CBS Sports Secures Most Watched Final Round Masters

Ratings Roundup is a rundown of recent rating news and is derived from press rel...

24/04/2026

Bleacher Reports Live NFL Draft Stream Builds on Record-Breaking Audience With Player-Driven, Interactive Production

B/R NFL Draft Live' refines the digital giant's productions around footb...

24/04/2026

Sportradar Report: The Viewing Experience Is the Product' as Sports Media Enters New Era of Personalization, Data-Driven Storytelling

Study highlights five pillars shaping modern fan engagement as broadcasters reth...

24/04/2026

ESPN/ABC, NFL Network Prepare for Record-Setting NFL Draft Presentation From Pittsburgh

The 2026 event, the first Draft with NFL Network under the ESPN umbrella, will b...

24/04/2026

SVG GameDay, Ep. 12: Detroit Lions' Jessica Shlemon - Motor City Football at Ford Field

In-venue and creative video staffers at the professional and collegiate level ha...

24/04/2026

Van Wagner Brings Warhol-Inspired Pop Art Vision to NFL Draft Videoboard Production

Integral to the Draft production for three almost decades, the company tells the...

24/04/2026

Film Festival Watch: 17 Sundance Institute-Supported Films to Screen at the 2026 Hot Docs Festival

Championing documentaries that illuminate and expand the artform is at the core ...

24/04/2026

NASA Receives L3Harris' Modified Next-Generation Research Aircraft

The NASA 777 aircraft departs the L3Harris facility in Waco, Texas....

24/04/2026

L3Harris Closes $1B Investment from Department of War in Missile Solutions Business

WASHINGTON, April 23, 2026 - L3Harris Technologies (NYSE: LHX) has closed a $1 b...

24/04/2026

Autonomous Logistics is the Marine Corps' Next Combat Advantage

In partnership with Airbus U.S. Space & Defense, Inc., L3Harris Technologies is advancing autonomous aviation for the U.S. Marine Corps' Aerial Logistics Co...

24/04/2026

Open, Connected, Decisive: the L3Harris hC2 Software Suite

L3Harris' hC2 software, powered by Systematic SitaWare, strengthens battlefield decision-making by using open architectures to connect platforms, sensors an...

24/04/2026

AERIS X Airborne Early Warning & Control: The Right Choice for Allied Homeland Defense

Artist rendering of L3Harris' AERIS X next-generation airborne early warning...

24/04/2026

Bitfocus Buttons wins NAB Show Product of the Year Award

Recognition for incredible advances in broadcast and enterprise control...

24/04/2026

Dalet Takes Home The Best in Show Award for Dalia at 2026 NAB

Dalet Takes Home The Best in Show Award for Dalia at 2026 NAB Brie Clayton April 23, 2026 0 Comments Media-aware agentic AI wins big for real-world ef...

24/04/2026

Calrec and Grass Valley Unlock Exceptional Choice and Flexibility for Broadcasters with ImPulseV and AMPP Integration

Calrec and Grass Valley Unlock Exceptional Choice and Flexibility for Broadcaste...

24/04/2026

Bitfocus Buttons wins NAB Show Product of the Year Award

Bitfocus Buttons wins NAB Show Product of the Year Award Brie Clayton April 24, 2026 0 Comments Recognition for incredible advances in broadcast and e...

24/04/2026

NAB Show Reports More Than 58,000 Registered Attendees for 2026

Share Copy link Facebook X Linkedin Bluesky Email...

24/04/2026

Dalet Takes Home The Best in Show Award for Dalia at 2026...

Media-aware agentic AI wins big for real-world efficiencies and time to value Dalet, a leading technology and service provider for media-rich organizations, to...

24/04/2026

Mediagenix Sweeps 2026 NAB Awards With Wins for Product o...

Mediagenix wins for its Scheduling Optimization capabilities that help broadcasters and FAST operators move beyond traditional scheduling automation toward cont...

24/04/2026

Setplex Secures Top Honors with NAB Show Project of the Y...

Setplex today announced that it has taken home the NAB Show Project of the Year Award in the Distribution category for its innovative deployment with UVOtv. Dur...

24/04/2026

farmerswife and Cirkus are Exhibiting MPTS 2026

Media and post-production teams are invited to experience next-level resource planning, project management, and connected media workflows at Stand K59 in The Gr...

23/04/2026

NAB Honors Rob Lowe and John Tesh With Hall of Fame Induction

Share Copy link Facebook X Linkedin Bluesky Email...

23/04/2026

Roku, Samsung Dominate CTV Platform Market in U.S.

Share Copy link Facebook X Linkedin Bluesky Email...

23/04/2026

G&D and VuWall Strengthen International Sales Team

Share Copy link Facebook X Linkedin Bluesky Email...

23/04/2026

The 2026 NAB Show Reports More than 58,000 Attendees

Share Copy link Facebook X Linkedin Bluesky Email...

23/04/2026

SmallHD Monitor Overlay License for Hi-5 and Hi-5 SX deli...

Partnership between ARRI and SmallHD brings new Hi-5 license Configurable monitor overlays adapt to individual working styles Supported by SmallHD monitors ru...

23/04/2026

Jeff Cronenweth ASC Sheds Light on Tron Ares with Astera

Lighting Master Cronenweth ASC brings a unique look to each grid world with the help of Astera Jeff Cronenweth on the set of Disney's TRON: ARES. Photo by...

23/04/2026

ZEISS Supreme Primes Shine in Star-Driven Short Dr Sam

DP Chloe Smolkin ( The Late Show, Kidz Bop ) joins director Danielle Beckmann and writer/actor Raji Ahsan behind the camera for the heartfelt short comedy Dr...

23/04/2026

Tribeca Festival 2026 Expands Tribeca Now, Spotlighting Digital Creators

April 23rd, 2026 Press Materials Available Here TRIBECA FESTIVAL 2026 EXPANDS TRIBECA NOW, SPOTLIGHTING DIGITAL CREATORS Tribeca Becomes First Major Film Fes...

23/04/2026

Strange Things Are Happening at McDonald's: The 'Tales From 85' Happy Meal Arrives Soon in Restaurants Worldwide

Back to All News Strange Things Are Happening at McDonald's: The Tales From...

23/04/2026

How Netflix Adaptations Are Heating Up the Top 10 and Bestseller Lists

Back to All News How Netflix Adaptations Are Heating Up the Top 10 and Bestseller Lists Entertainment 23 April 2026 Global Link copied to clipboard Downlo...

23/04/2026

OpenAI's New GPT-5.5 Powers Codex on NVIDIA Infrastructure - and NVIDIA Is Already Putting It to Work

AI agents have revolutionized developer workflows, and their next frontier is kn...

23/04/2026

Tag, You're It: GeForce NOW Levels Up Game Discovery With Xbox Game Pass and Ubisoft+ Labels

GeForce NOW is doubling down on what matters most: gamers. This week's upgra...

22/04/2026

Live From NAB 2026: Solid State Logics Berny Carpenter on Expanding System T With Virtual DSP, Cloud Workflows

Solid State Logic is advancing its System T platform with a stronger focus on IP...

22/04/2026

Live From NAB 2026: Dolbys Giles Baker on the Growth of Dolby OptiView, Immersive Vision and Audio for Live Sports

From immersive audio to live streaming, Dolby Laboratories is focused on the fut...