Sony Pixel Power calrec Sony

Why GPUs Are Great for AI

04/12/2023

GPUs have been called the rare Earth metals - even the gold - of artificial intelligence, because they're foundational for today's generative AI era.

Three technical reasons, and many stories, explain why that's so. Each reason has multiple facets well worth exploring, but at a high level:

GPUs employ parallel processing.

GPU systems scale up to supercomputing heights.

The GPU software stack for AI is broad and deep.

The net result is GPUs perform technical calculations faster and with greater energy efficiency than CPUs. That means they deliver leading performance for AI training and inference as well as gains across a wide array of applications that use accelerated computing.

In its recent report on AI, Stanford's Human-Centered AI group provided some context. GPU performance has increased roughly 7,000 times since 2003 and price per performance is 5,600 times greater, it reported.

A 2023 report captured the steep rise in GPU performance and price/performance. The report also cited analysis from Epoch, an independent research group that measures and forecasts AI advances.

GPUs are the dominant computing platform for accelerating machine learning workloads, and most (if not all) of the biggest models over the last five years have been trained on GPUs [they have] thereby centrally contributed to the recent progress in AI, Epoch said on its site.

A 2020 study assessing AI technology for the U.S. government drew similar conclusions.

We expect [leading-edge] AI chips are one to three orders of magnitude more cost-effective than leading-node CPUs when counting production and operating costs, it said.

NVIDIA GPUs have increased performance on AI inference 1,000x in the last ten years, said Bill Dally, the company's chief scientist in a keynote at Hot Chips, an annual gathering of semiconductor and systems engineers.

ChatGPT Spread the News ChatGPT provided a powerful example of how GPUs are great for AI. The large language model (LLM), trained and run on thousands of NVIDIA GPUs, runs generative AI services used by more than 100 million people.

Since its 2018 launch, MLPerf, the industry-standard benchmark for AI, has provided numbers that detail the leading performance of NVIDIA GPUs on both AI training and inference.

For example, NVIDIA Grace Hopper Superchips swept the latest round of inference tests. NVIDIA TensorRT-LLM, inference software released since that test, delivers up to an 8x boost in performance and more than a 5x reduction in energy use and total cost of ownership. Indeed, NVIDIA GPUs have won every round of MLPerf training and inference tests since the benchmark was released in 2019.

In February, NVIDIA GPUs delivered leading results for inference, serving up thousands of inferences per second on the most demanding models in the STAC-ML Markets benchmark, a key technology performance gauge for the financial services industry.

A RedHat software engineering team put it succinctly in a blog: GPUs have become the foundation of artificial intelligence.

AI Under the Hood A brief look under the hood shows why GPUs and AI make a powerful pairing.

An AI model, also called a neural network, is essentially a mathematical lasagna, made from layer upon layer of linear algebra equations. Each equation represents the likelihood that one piece of data is related to another.

For their part, GPUs pack thousands of cores, tiny calculators working in parallel to slice through the math that makes up an AI model. This, at a high level, is how AI computing works.

Highly Tuned Tensor Cores Over time, NVIDIA's engineers have tuned GPU cores to the evolving needs of AI models. The latest GPUs include Tensor Cores that are 60x more powerful than the first-generation designs for processing the matrix math neural networks use.

In addition, NVIDIA Hopper Tensor Core GPUs include a Transformer Engine that can automatically adjust to the optimal precision needed to process transformer models, the class of neural networks that spawned generative AI.

Along the way, each GPU generation has packed more memory and optimized techniques to store an entire AI model in a single GPU or set of GPUs.

Models Grow, Systems Expand The complexity of AI models is expanding a whopping 10x a year.

The current state-of-the-art LLM, GPT4, packs more than a trillion parameters, a metric of its mathematical density. That's up from less than 100 million parameters for a popular LLM in 2018.

In a recent talk at Hot Chips, NVIDIA Chief Scientist Bill Dally described how single-GPU performance on AI inference expanded 1,000x in the last decade. GPU systems have kept pace by ganging up on the challenge. They scale up to supercomputers, thanks to their fast NVLink interconnects and NVIDIA Quantum InfiniBand networks.

For example, the DGX GH200, a large-memory AI supercomputer, combines up to 256 NVIDIA GH200 Grace Hopper Superchips into a single data-center-sized GPU with 144 terabytes of shared memory.

Each GH200 superchip is a single server with 72 Arm Neoverse CPU cores and four petaflops of AI performance. A new four-way Grace Hopper systems configuration puts in a single compute node a whopping 288 Arm cores and 16 petaflops of AI performance with up to 2.3 terabytes of high-speed memory.

And NVIDIA H200 Tensor Core GPUs announced in November pack up to 288 gigabytes of the latest HBM3e memory technology.

Software Covers the Waterfront An expanding ocean of GPU software has evolved since 2007 to enable every facet of AI, from deep-tech features to high-level applications.

The NVIDIA AI platform includes hundreds of software libraries and apps. The CUDA programming language and the cuDNN-X library for deep learning provide a base on top of which developers have created software like NVIDIA NeMo, a framework to let users build, customize and run inference on their o
LINK: https://blogs.nvidia.com/blog/why-gpus-are-great-for-ai/...
See more stories from nvidia

North America Stories

31/03/2026

PSSI Taps Appear X for Live Coverage of FIFA World Cup

Share Copy link Facebook X Linkedin Bluesky Email...

31/03/2026

OpenDrives Shows Off Sports Expertise in Sports Business...

OpenDrives, a leader in software-defined video and rich media storage management solutions, will demonstrate several new innovations at the 2026 NAB Show in Apr...

31/03/2026

SDVI to Showcase Platform Enhancements New Technology Int...

At the 2026 NAB Show, SDVI will demonstrate how its Rally media supply chain management platform continues to give media operations teams the tools they need to...

31/03/2026

Boland Communications Introduces QD4K315HDR10 QD-OLED Ser...

Boland Communications today announced that at the 2026 NAB Show, it will introduce its new QD4K315HDR10, a 31.5-inch QD-OLED monitor delivering exceptional colo...

31/03/2026

NUGEN Audio CEO Dr Paul Tapper to Lead Presentation About...

Dr. Paul Tapper, CEO of NUGEN Audio, will lead an essential industry discussion at the 2026 NAB Show, examining whether dialog intelligibility is poised to beco...

31/03/2026

BCNEXXT at NAB 2026 - Turning Industry Pressure into Oppo...

BCNEXXT will use NAB 2026 to focus on how broadcasters can turn today's operational and market pressures into opportunity. Meeting with customers in the IAB...

31/03/2026

Mediagenix Showcases Semantic Intelligence Powered Title...

Mediagenix, a global leader in smart content solutions to profitably connect the right content to the right audience, will showcase its latest innovations at th...

31/03/2026

DHD Introduces AI-Based Audio Noise Reduction to XD3 IP C...

DHD announces that AI-based audio noise reduction is now available as a powerful new option for its XD3 IP Core. Developed in cooperation with ai-coustics, audi...

31/03/2026

PTZOptics to showcase intelligent video and camera contro...

PTZOptics will showcase its vision for intelligent video and advanced camera automation at NAB Show 2026 (#N1902). Through immersive demonstrations and technolo...

31/03/2026

Experience Commerce Wins Social Media Mandate for ICONIQA...

Experience Commerce, a leading full-service digital marketing agency and part of the Cheil SWA Group, has secured the social media mandate for ICONIQA Hotels & ...

31/03/2026

Fabric to Showcase Cloud-Native Origin Platform and Next-...

Fabric, the entertainment industry's leader in data and operations solutions, will showcase a major expansion of its media technology platform at NAB 2026, ...

31/03/2026

Boland Communications Introduces New QD-OLED Series Monitors

Share Copy link Facebook X Linkedin Bluesky Email...

31/03/2026

Carr Warns NFL Over Streaming Rights, Consumer Costs

Share Copy link Facebook X Linkedin Bluesky Email...

31/03/2026

Farhan Khan Appointed FCC Chief Information Officer

Share Copy link Facebook X Linkedin Bluesky Email...

31/03/2026

Federal Judge Pauses Nexstar/Tegna Merger

Share Copy link Facebook X Linkedin Bluesky Email...

31/03/2026

Boris FX Acquires Vegas Pro, Sound Forge, and Acid Pro

Boris FX Acquires Vegas Pro, Sound Forge, and Acid Pro Jessie Electa Petrov March 30, 2026 0 Comments The acclaimed VFX developer bolsters its award-w...

31/03/2026

Berklee India Exchange Presents Anirudh Varma Collective

Berklee India Exchange Presents Anirudh Varma Collective The collective will present a workshop on Hindustani classical music and a concert in collaboration w...

30/03/2026

NAB 2026: Manifold to Demonstrate 400GbE COTS FPGA Support

Manifold Technologies, a Germany-based provider of cloud infrastructure for live broadcast production, will demonstrate support for 400GbE COTS FPGA accelerator...

30/03/2026

NAB 2026: Boland Communications Introduces QD-OLED Series Monitors

Boland Communications will introduce its QD4K315HDR10, a 31.5-inch QD-OLED monitor, at NAB Show 2026 (Booth C3519, April 18-22). The company is also introducing...

30/03/2026

NAB 2026: PTZOptics to Showcase Move 4K and Horizon Platform

PTZOptics will demonstrate its Move 4K PTZ cameras and Horizon web-based control platform at NAB Show 2026 (Booth N1902). Move 4K with Horizon is now available...

30/03/2026

NAB 2026: Net Insight to Showcase Updated Nimbra Edge

Net Insight will demonstrate the next version of Nimbra Edge, its orchestration and control layer for live media services across multi-domain environments, at N...

30/03/2026

NAB 2026: Appear to Showcase Live Production Processing

Appear ASA will exhibit at NAB Show 2026 (Booth W1531, April 19-22, Las Vegas). The company completed an IPO in November 2025. Our customer-first approach is ...

30/03/2026

NAB 2026: Harmonic Announces New Live Sports Streaming Capabilities

Harmonic has announced new capabilities for its sports streaming platform, covering multiview, programmatic advertising, in-stream advertising, and content wate...

30/03/2026

NAB 2026: Ateme to Showcase GenAI, Agentic AI, and Streaming

Ateme (Booth W1723) will demonstrate broadcast, streaming, and AI-driven media workflow solutions at NAB Show 2026. GenAI and Agentic AI Ateme will demonstrat...

30/03/2026

NAB 2026: Bitmovin's Player Web X Adds Advertising Support, Vertical Video, and Proprietary ABR Algorithm

Bitmovin has announced new capabilities for Player Web X, its web video player, ...

30/03/2026

NAB 2026: Brazil's Minister of Communications and FCC Commissioner To Speak

The 2026 NAB Show (April 18-22, exhibits April 19-22, Las Vegas Convention Center) will host Brazil's Minister of Communications, Frederico de Siqueira Filh...

30/03/2026

NAB 2026: EVS To Showcase Expanded Live Production Ecosystem

EVS will exhibit at NAB Show 2026 (Booth N1841), highlighting new products and updates across its live production portfolio, including the debut of T-Motion med...

30/03/2026

NAB 2026: Solid State Logic To Demonstrate Expanded Virtual System T Platform

Solid State Logic will demonstrate its virtualized System T platform at NAB Show 2026 (Booth C6907). Demonstrations will include the VTE1 virtual DSP engine, ne...

30/03/2026

NAB 2026: Globecast To Showcase Managed Media Services Approach

Globecast will exhibit at NAB Show 2026 (Booth W3335), highlighting its hybrid service model spanning satellite, IP, fiber, and cloud. The company will demonst...

30/03/2026

NAB 2026: IP Showcase Returns as IPMX Moves to Deployment

The Alliance for IP Media Solutions (AIMS), Advanced Media Workflow Association (AMWA), and the Video Services Forum (VSF) have announced that the IP Showcase w...

30/03/2026

NAB 2026: BBright To Demonstrate Single-Stream ST 2110 Playout

At NAB Show 2026 BBright will present a demonstration of its One Stream for the World concept, showing how a single ST 2110 playout stream can simultaneously ...

30/03/2026

NAB 2026: OpenDrives To Demonstrate New Storage and Edge Products

OpenDrives will demonstrate new products at NAB Show 2026, with two locations in the West Hall: a pod (W3443-E) in the Sports Business Hub and a cabana at W1158...

30/03/2026

Behind the Mic: Amazon Prime Hosts 90th Master Tournament With Host Terry Gannon

Behind The Mic provides a roundup of recent news regarding on-air talent, including new deals, departures, and assignments compiled from press releases and repo...

30/03/2026

Op-Ed: Preparing for Agentic AI in Live Sports

The economics of live sports streaming have changed. New rights models, cloud production tools, and lower-cost distribution have made it possible for high schoo...

30/03/2026

Government of Canada Selects MAS for Strategic Tanker Fleet Sustainment

CC-330 Husky. 2024 Eric Desbiens Photography. Used with permission for the announcement and related communications. No residual rights....

30/03/2026

L3Harris Included in MDA Space Solution for RCN ISTAR Program

L3Harris Technologies will provide WESCAM CMX -8 sensor systems for integration on new Uncrewed Aircraft Systems from MDA Space, enhancing the Royal Canadian Na...

30/03/2026

EVS to Debut T-Motion Robotics at 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

30/03/2026

SDVI To Feature New Rally Media Supply Chain Management Enhancements

Share Copy link Facebook X Linkedin Bluesky Email...

30/03/2026

Boland Communications Introduces QD4K315HDR10 QD-OLED Series Monitors

Share Copy link Facebook X Linkedin Bluesky Email...

30/03/2026

Mileto Tecnologia accelerates streaming growth with Synam...

Synamedia today announced that Mileto Tecnologia, one of Brazil's largest pay-TV operators, has chosen the Synamedia Go platform to support its rapid OTT ex...

30/03/2026

FOR-A's Software-Defined, AI-Powered Development Advances with Nippon TV and NVIDIA Technology

FOR-A's Software-Defined, AI-Powered Development Advances with Nippon TV and...

30/03/2026

Give Your Astrophotography REAL Depth - After Effects Tutorial

Give Your Astrophotography REAL Depth - After Effects Tutorial Graham Quince March 30, 2026 0 Comments In this tutorial, I talk you through the full w...

30/03/2026

Alfalite returns to NAB Show alongside FOR-A, showcasing LED solutions for broadcast and mission-critical environments

Alfalite returns to NAB Show alongside FOR-A, showcasing LED solutions for broad...

30/03/2026

WideOrbit Announces New Name, New Features for Flagship Radio Automation Software

Introducing WO Aurora WideOrbit is pleased to introduce WO Aurora, a new name fo...

30/03/2026

Netflix Announces the Reunion for Love is Blind: Sweden Season 3 - Premiering April 2

Back to All News Netflix Announces the Reunion for Love is Blind: Sweden Season...

30/03/2026

Netflix unveils new images from the second season of 'Gangs of Galicia'

Back to All News Netflix unveils new images from the second season of Gangs of Galicia Entertainment 30 March 2026 GlobalSpain Link copied to clipboard Do...

30/03/2026

The Latest on Netflix Anime, Unveiled at AnimeJapan 2026

Back to All News The Latest on Netflix Anime, Unveiled at AnimeJapan 2026 Entertainment 30 March 2026 GlobalJapan Link copied to clipboard From romance an...

30/03/2026

Top 10 Reasons Government Meetings Need Transcriptions (and Why It Matters More Than Ever)

Tyngsboro, Mass., March 30, 2026 - City councils, county commissions, school boa...

29/03/2026

Victory+ Turns to Creator Economy, Bringing In Popular Women's Sports Influencer Coach Jackie J to Host Live NWSL Alt-Cast

Cloud-based production, real-time engagement, and creator-driven storytelling ai...

28/03/2026

Globecast Reimagines Managed Media Services for a Hybrid...

Globecast, the leading provider of broadcast, media and entertainment managed services, will showcase its reimagined approach to media operations at the 2026 NA...