Sony Pixel Power calrec Sony

Why GPUs Are Great for AI

04/12/2023

GPUs have been called the rare Earth metals - even the gold - of artificial intelligence, because they're foundational for today's generative AI era.

Three technical reasons, and many stories, explain why that's so. Each reason has multiple facets well worth exploring, but at a high level:

GPUs employ parallel processing.

GPU systems scale up to supercomputing heights.

The GPU software stack for AI is broad and deep.

The net result is GPUs perform technical calculations faster and with greater energy efficiency than CPUs. That means they deliver leading performance for AI training and inference as well as gains across a wide array of applications that use accelerated computing.

In its recent report on AI, Stanford's Human-Centered AI group provided some context. GPU performance has increased roughly 7,000 times since 2003 and price per performance is 5,600 times greater, it reported.

A 2023 report captured the steep rise in GPU performance and price/performance. The report also cited analysis from Epoch, an independent research group that measures and forecasts AI advances.

GPUs are the dominant computing platform for accelerating machine learning workloads, and most (if not all) of the biggest models over the last five years have been trained on GPUs [they have] thereby centrally contributed to the recent progress in AI, Epoch said on its site.

A 2020 study assessing AI technology for the U.S. government drew similar conclusions.

We expect [leading-edge] AI chips are one to three orders of magnitude more cost-effective than leading-node CPUs when counting production and operating costs, it said.

NVIDIA GPUs have increased performance on AI inference 1,000x in the last ten years, said Bill Dally, the company's chief scientist in a keynote at Hot Chips, an annual gathering of semiconductor and systems engineers.

ChatGPT Spread the News ChatGPT provided a powerful example of how GPUs are great for AI. The large language model (LLM), trained and run on thousands of NVIDIA GPUs, runs generative AI services used by more than 100 million people.

Since its 2018 launch, MLPerf, the industry-standard benchmark for AI, has provided numbers that detail the leading performance of NVIDIA GPUs on both AI training and inference.

For example, NVIDIA Grace Hopper Superchips swept the latest round of inference tests. NVIDIA TensorRT-LLM, inference software released since that test, delivers up to an 8x boost in performance and more than a 5x reduction in energy use and total cost of ownership. Indeed, NVIDIA GPUs have won every round of MLPerf training and inference tests since the benchmark was released in 2019.

In February, NVIDIA GPUs delivered leading results for inference, serving up thousands of inferences per second on the most demanding models in the STAC-ML Markets benchmark, a key technology performance gauge for the financial services industry.

A RedHat software engineering team put it succinctly in a blog: GPUs have become the foundation of artificial intelligence.

AI Under the Hood A brief look under the hood shows why GPUs and AI make a powerful pairing.

An AI model, also called a neural network, is essentially a mathematical lasagna, made from layer upon layer of linear algebra equations. Each equation represents the likelihood that one piece of data is related to another.

For their part, GPUs pack thousands of cores, tiny calculators working in parallel to slice through the math that makes up an AI model. This, at a high level, is how AI computing works.

Highly Tuned Tensor Cores Over time, NVIDIA's engineers have tuned GPU cores to the evolving needs of AI models. The latest GPUs include Tensor Cores that are 60x more powerful than the first-generation designs for processing the matrix math neural networks use.

In addition, NVIDIA Hopper Tensor Core GPUs include a Transformer Engine that can automatically adjust to the optimal precision needed to process transformer models, the class of neural networks that spawned generative AI.

Along the way, each GPU generation has packed more memory and optimized techniques to store an entire AI model in a single GPU or set of GPUs.

Models Grow, Systems Expand The complexity of AI models is expanding a whopping 10x a year.

The current state-of-the-art LLM, GPT4, packs more than a trillion parameters, a metric of its mathematical density. That's up from less than 100 million parameters for a popular LLM in 2018.

In a recent talk at Hot Chips, NVIDIA Chief Scientist Bill Dally described how single-GPU performance on AI inference expanded 1,000x in the last decade. GPU systems have kept pace by ganging up on the challenge. They scale up to supercomputers, thanks to their fast NVLink interconnects and NVIDIA Quantum InfiniBand networks.

For example, the DGX GH200, a large-memory AI supercomputer, combines up to 256 NVIDIA GH200 Grace Hopper Superchips into a single data-center-sized GPU with 144 terabytes of shared memory.

Each GH200 superchip is a single server with 72 Arm Neoverse CPU cores and four petaflops of AI performance. A new four-way Grace Hopper systems configuration puts in a single compute node a whopping 288 Arm cores and 16 petaflops of AI performance with up to 2.3 terabytes of high-speed memory.

And NVIDIA H200 Tensor Core GPUs announced in November pack up to 288 gigabytes of the latest HBM3e memory technology.

Software Covers the Waterfront An expanding ocean of GPU software has evolved since 2007 to enable every facet of AI, from deep-tech features to high-level applications.

The NVIDIA AI platform includes hundreds of software libraries and apps. The CUDA programming language and the cuDNN-X library for deep learning provide a base on top of which developers have created software like NVIDIA NeMo, a framework to let users build, customize and run inference on their o
LINK: https://blogs.nvidia.com/blog/why-gpus-are-great-for-ai/...
See more stories from nvidia

North America Stories

15/04/2026

BBC World Service TV Selects Open Broadcast Systems IP Decoders for Global Distribution

Open Broadcast Systems has announced that BBC World Service has selected its IP ...

15/04/2026

NAB 2026: LiveU Expands Collaboration with Sony to Include File-Based Workflow Integration

LiveU has announced an expansion of its collaboration with Sony Corporation, add...

15/04/2026

NAB 2026: Ateme and NVIDIA Announce Immersive Video Workflow for Apple Vision Pro

Ateme has announced a collaboration with NVIDIA to support live Apple Immersive ...

15/04/2026

Professional Fighters League Renews Multi-Year Partnership with DAZN DACH

The Professional Fighters League (PFL) has announced a multi-year partnership renewal with DAZN DACH, covering Germany, Switzerland, Austria, Liechtenstein, and...

15/04/2026

NAB 2026: Canon Sets New Benchmark with CINE-SERVO 40-1200m Lens; New Remote Camera Controller Supports Up to 200 Cameras

Canon U.S.A. (NAB Booth C3825) today took the lid off of the CINE-SERVO 40-1200m...

15/04/2026

NAB 2026: Panasonic and NEP Group to Demonstrate KAIROS and NEP Platform Integration

Panasonic Video and Audio Systems North America and NEP Group will demonstrate a...

15/04/2026

Exclusive Wasabi Report: AI Spending Is Surging, But ROI Tells a Different Story

For the fourth year running, independent analysts found businesses across all industries and verticals pay roughly the same amount in fees as they spend on stor...

15/04/2026

NBC Sports to Broadcast The Soccer Tournament Live on NBC, Peacock, and NBCSN, May 30-June 1

The Soccer Tournament (TST) has announced a media rights deal with NBC Sports to...

15/04/2026

NAB 2026: JB&A Announces Exhibitors for Pre-NAB 2026 Technology Event

JB&A will host the Pre-NAB 2026 Technology Event on April 17-18 at Flamingo Las Vegas, ahead of NAB Show. The event features hands-on demonstrations and technic...

15/04/2026

NAB 2026: Sennheiser Group to Exhibit with Spectera and AMBEO Updates

The Sennheiser Group will exhibit at NAB Show 2026 (Booth 4931, Central Hall), with demonstrations from Sennheiser, Neumann, and Merging across three areas: Rel...

15/04/2026

NAB 2026: NAB Show 2026 to Feature Expanded AI, Sports, and Creator Economy Programming

NAB Show 2026 will take place April 18-22 at the Las Vegas Convention Center, wi...

15/04/2026

NAB 2026: AI-Media Launches LEXI Text Encoder and LEXI Voice Encoder

AI-Media has announced the LEXI Text Encoder and LEXI Voice Encoder at NAB Show 2026, the company's first new encoder hardware release in more than a decade...

15/04/2026

NAB 2026: Cartoni Debuts New Camera Support Products

Italian camera support manufacturer Cartoni will introduce several new products at NAB Show 2026 (Booth C6540, Central Hall), including the Master 30 OB fluid h...

15/04/2026

NAB 2026: Lawo and swXtch.io Sign MOU to Explore groundSwXtch Integration

Lawo and swXtch.io have announced a memorandum of understanding at NAB Show 2026, under which Lawo will explore incorporating swXtch.io's groundSwXtch softw...

15/04/2026

NAB 2026: CacheFly to Demonstrate New CDN Features

CacheFly will exhibit at NAB Show 2026 (Booth W3129, April 19-22, Las Vegas Convention Center), showcasing three new additions to its content delivery platform:...

15/04/2026

NAB 2026: Synamedia Launches GO Shorts for Mobile-First Short-Form Video

Synamedia has announced GO Shorts, a new module within its Synamedia Go OTT platform that uses AI to convert an operator's existing content library into a s...

15/04/2026

NAB 2026 Preview, Central Hall: Everything You Need To Know Heading Into the Show

The NAB Show kicks off on Saturday, and the SVG and SVG Europe editorial teams a...

15/04/2026

AJA Video Systems to Acquire Video Encoding Software Company Comprimato

AJA Video Systems has announced an agreement to acquire Comprimato, a live video encoding and processing software company. The deal will unite the two companies...

15/04/2026

NBA Playoffs 2026: Prime Vision, Prime Insights Offer New Data-Driven Experiences for NBA Fans

Prime Video Sports' NBA Playoffs coverage, which includes the entire SoFi NB...

15/04/2026

Top Live-Sound-System Manufacturers Team Up To Better Manage Stadium Noise

Just announced, the SDE standard provides a unified method and file format to ensure consistent and reliably comparable noise predictions Sports and entertainm...

15/04/2026

WIDOW: The Mission Software Defining Rotary Strike

Image courtesy of MD Helicopters...

15/04/2026

L3Harris Announces Billion Dollar Expansion to Boost Solid Rocket Motor Production in Orange County, Virginia

Virginia Gov. Abigail Spanberger, L3Harris VP Mark Farley, and state and local l...

15/04/2026

Advancing America's Space Defense: L3Harris Completes Critical Milestone on Way to Delivering GBOSS Capability to the Warfighter

U.S. Space Forces Ground-Based Optical Sensor System upgrade at the Maui Space S...

15/04/2026

Winter Olympics, Super Bowl Power NBCU-Versant to Gold Medal Performance in Nielsen's February Gauge Reports

NBCU-Versant notches 13.1% of TV viewing in February, its best since August 2024...

15/04/2026

Nielsen CMI shows New Zealand's over-65s are a growing, cashed-up and still-working audience brands can't ignore

New data reveals older Kiwis are financially resilient, loyal to local products,...

15/04/2026

Autocue to Mark 2026 NAB Show Debut of its New PTZ Prompter

Share Copy link Facebook X Linkedin Bluesky Email...

15/04/2026

Locality Deploys Nielsen's Media Data Engine

Share Copy link Facebook X Linkedin Bluesky Email...

15/04/2026

Viant Announces Agreement to Acquire TVision

Share Copy link Facebook X Linkedin Bluesky Email...

15/04/2026

Evergent introduces Agentic Revenue Orchestration Platfor...

Evergent introduces its Agentic Revenue Orchestration Platform, transforming how subscription businesses across direct-to-consumer streaming, pay-TV, telecommun...

15/04/2026

CentralCast Delivers Breakthrough Efficiencies to Public...

Harmonic's XOS Media Processor Delivers Exceptional Video Quality to More than Half of U.S. Public Media Viewership Harmonic (NASDAQ: HLIT) today announce...

15/04/2026

DPA N Series Wireless System Unlocks Duplex Gap and Guard...

LONGMONT, COLORADO, APRIL 15, 2026 DPA Microphones N Series Digital Wireless System users in North America can now take full advantage of the system's exc...

15/04/2026

Cobalt Iron Launches Compass Tape Gateway Modernizing IBM...

Cobalt Iron, a leading provider of SaaS-based enterprise data protection, today announced the launch of Compass Tape Gateway (CTG), a transformative enhancemen...

15/04/2026

Disguise to Showcase Cutting-Edge Experience Tech for Sports, Broadcast and More at NAB 2026

Disguise to Showcase Cutting-Edge Experience Tech for Sports, Broadcast and More...

15/04/2026

Arooj Aftab Makes the Music She Wants to Hear

Arooj Aftab Makes the Music She Wants to Hear The singular artist explores the juxtaposition of grief and joy, dark and light, in her distinctive sound. Apri...

15/04/2026

Panasonic, NEP Partner on IP-Based Live Production

Share Copy link Facebook X Linkedin Bluesky Email...

15/04/2026

Encompass Digital Media Powers Global Cloud Transformatio...

Interra Systems, a provider of end-to-end quality assurance solutions for the digital media industry, is proud to announce its central role in the digital trans...

15/04/2026

Viaccess-Orca Powers MasOranges Multi-Brand TV Expansion...

VO Integrates Client-Side and Server-Side Ad Technologies with its Secure Video Player and Segmentation Tools, Leveraging Broadpeak's SSAI and BPK SmartLib ...

15/04/2026

TMT Insights Unveils AI at NAB 2026

TMT Insights, a leader in professional services and software development for the media and entertainment (M&E) industry, today announced the launch of TMT AI at...

15/04/2026

Encompass Digital Media and Oracle Cloud Infrastructure E...

Encompass Digital Media, a global leader in managed broadcast and cloud-based media services, today announced an expanded partnership with Oracle Cloud Infrastr...

15/04/2026

Middleman Software Debuts PhaseLock Technology and Expand...

Middleman Software, a leading provider of signaling and synchronization solutions for broadcast and streaming workflows, today announced it will unveil PhaseLoc...

15/04/2026

Deity Announces PR-4 Compact Field Recorder

Deity Microphones today announced the PR-4, a compact six-track field recorder featuring four inputs, 32-bit float recording, advanced routing, and a workflow-f...

15/04/2026

Blackmagic Design Announces Fairlight Live

Blackmagic Design Announces Fairlight Live Brie Clayton April 15, 2026 0 Comments Powerful software-based live audio mixer with support for SMPTE-2110...

15/04/2026

Haivision Launches Falkon X4 5G Transmitters

Share Copy link Facebook X Linkedin Bluesky Email...

15/04/2026

JB&A Spotlights Exhibitors Ahead of Pre-NAB 2026 Technology Event

Share Copy link Facebook X Linkedin Bluesky Email...

15/04/2026

Wave Central, EVS Announce SMPTE ST 2110 Interop Collaboration

Share Copy link Facebook X Linkedin Bluesky Email...

15/04/2026

Top Reason for Subscribing to Pay TV Is Live News and TV, Survey Finds

Share Copy link Facebook X Linkedin Bluesky Email...

15/04/2026

NAB Leadership Foundation Honors Outstanding Broadcasters

Share Copy link Facebook X Linkedin Bluesky Email...

15/04/2026

Wisycom Expands MPR60 With New Multichannel IFB Mode

Share Copy link Facebook X Linkedin Bluesky Email...

15/04/2026

Thomas Riedel Acquires Premium Manufacturer ARRI

Strategic Alignment With the Riedel Group for Innovation and Growth Thomas Riedel, founder and owner of Riedel Communications and the Riedel Group, has acqui...

15/04/2026

Iyuno Deploys SDVI Rally Platform to Accelerate and Scale...

SDVI, the leading platform provider for cloud-native media supply chains, today announced that the company's Rally platform has been deployed by Iyuno Media...