Sony Pixel Power calrec Sony

Why GPUs Are Great for AI

04/12/2023

GPUs have been called the rare Earth metals - even the gold - of artificial intelligence, because they're foundational for today's generative AI era.

Three technical reasons, and many stories, explain why that's so. Each reason has multiple facets well worth exploring, but at a high level:

GPUs employ parallel processing.

GPU systems scale up to supercomputing heights.

The GPU software stack for AI is broad and deep.

The net result is GPUs perform technical calculations faster and with greater energy efficiency than CPUs. That means they deliver leading performance for AI training and inference as well as gains across a wide array of applications that use accelerated computing.

In its recent report on AI, Stanford's Human-Centered AI group provided some context. GPU performance has increased roughly 7,000 times since 2003 and price per performance is 5,600 times greater, it reported.

A 2023 report captured the steep rise in GPU performance and price/performance. The report also cited analysis from Epoch, an independent research group that measures and forecasts AI advances.

GPUs are the dominant computing platform for accelerating machine learning workloads, and most (if not all) of the biggest models over the last five years have been trained on GPUs [they have] thereby centrally contributed to the recent progress in AI, Epoch said on its site.

A 2020 study assessing AI technology for the U.S. government drew similar conclusions.

We expect [leading-edge] AI chips are one to three orders of magnitude more cost-effective than leading-node CPUs when counting production and operating costs, it said.

NVIDIA GPUs have increased performance on AI inference 1,000x in the last ten years, said Bill Dally, the company's chief scientist in a keynote at Hot Chips, an annual gathering of semiconductor and systems engineers.

ChatGPT Spread the News ChatGPT provided a powerful example of how GPUs are great for AI. The large language model (LLM), trained and run on thousands of NVIDIA GPUs, runs generative AI services used by more than 100 million people.

Since its 2018 launch, MLPerf, the industry-standard benchmark for AI, has provided numbers that detail the leading performance of NVIDIA GPUs on both AI training and inference.

For example, NVIDIA Grace Hopper Superchips swept the latest round of inference tests. NVIDIA TensorRT-LLM, inference software released since that test, delivers up to an 8x boost in performance and more than a 5x reduction in energy use and total cost of ownership. Indeed, NVIDIA GPUs have won every round of MLPerf training and inference tests since the benchmark was released in 2019.

In February, NVIDIA GPUs delivered leading results for inference, serving up thousands of inferences per second on the most demanding models in the STAC-ML Markets benchmark, a key technology performance gauge for the financial services industry.

A RedHat software engineering team put it succinctly in a blog: GPUs have become the foundation of artificial intelligence.

AI Under the Hood A brief look under the hood shows why GPUs and AI make a powerful pairing.

An AI model, also called a neural network, is essentially a mathematical lasagna, made from layer upon layer of linear algebra equations. Each equation represents the likelihood that one piece of data is related to another.

For their part, GPUs pack thousands of cores, tiny calculators working in parallel to slice through the math that makes up an AI model. This, at a high level, is how AI computing works.

Highly Tuned Tensor Cores Over time, NVIDIA's engineers have tuned GPU cores to the evolving needs of AI models. The latest GPUs include Tensor Cores that are 60x more powerful than the first-generation designs for processing the matrix math neural networks use.

In addition, NVIDIA Hopper Tensor Core GPUs include a Transformer Engine that can automatically adjust to the optimal precision needed to process transformer models, the class of neural networks that spawned generative AI.

Along the way, each GPU generation has packed more memory and optimized techniques to store an entire AI model in a single GPU or set of GPUs.

Models Grow, Systems Expand The complexity of AI models is expanding a whopping 10x a year.

The current state-of-the-art LLM, GPT4, packs more than a trillion parameters, a metric of its mathematical density. That's up from less than 100 million parameters for a popular LLM in 2018.

In a recent talk at Hot Chips, NVIDIA Chief Scientist Bill Dally described how single-GPU performance on AI inference expanded 1,000x in the last decade. GPU systems have kept pace by ganging up on the challenge. They scale up to supercomputers, thanks to their fast NVLink interconnects and NVIDIA Quantum InfiniBand networks.

For example, the DGX GH200, a large-memory AI supercomputer, combines up to 256 NVIDIA GH200 Grace Hopper Superchips into a single data-center-sized GPU with 144 terabytes of shared memory.

Each GH200 superchip is a single server with 72 Arm Neoverse CPU cores and four petaflops of AI performance. A new four-way Grace Hopper systems configuration puts in a single compute node a whopping 288 Arm cores and 16 petaflops of AI performance with up to 2.3 terabytes of high-speed memory.

And NVIDIA H200 Tensor Core GPUs announced in November pack up to 288 gigabytes of the latest HBM3e memory technology.

Software Covers the Waterfront An expanding ocean of GPU software has evolved since 2007 to enable every facet of AI, from deep-tech features to high-level applications.

The NVIDIA AI platform includes hundreds of software libraries and apps. The CUDA programming language and the cuDNN-X library for deep learning provide a base on top of which developers have created software like NVIDIA NeMo, a framework to let users build, customize and run inference on their o
LINK: https://blogs.nvidia.com/blog/why-gpus-are-great-for-ai/...
See more stories from nvidia

North America Stories

06/02/2026

Chris Myers Joins Net Insight as SVP of Sales, Americas

Share Copy link Facebook X Linkedin Bluesky Email...

06/02/2026

Sen. Cruz Announces Hearing on Broadcast Media Ownership Rules

Share Copy link Facebook X Linkedin Bluesky Email...

06/02/2026

NAB Show Relocates TV and Radio HQ To LVCC Central Hall

Share Copy link Facebook X Linkedin Bluesky Email...

06/02/2026

Sony Solutions Widely Deployed for Super Bowl LX in San Francisco

Share Copy link Facebook X Linkedin Bluesky Email...

06/02/2026

Telemundo Puerto Rico Launches In Mainland U.S.

Share Copy link Facebook X Linkedin Bluesky Email...

06/02/2026

February 05, 2026

How invisible vaccine scaffolding boosts HIV immune response Scripps Research scientists designed a DNA scaffold that carries HIV vaccine proteins into the bo...

05/02/2026

Tech Focus: Wireless Audio, Part 2 - RF Mics Have a Key Role in Sports Broadcasting

Three examples of how wireless microphones are deployed to bring fans in deep an...

05/02/2026

Samsung's Galaxy S25 Ultra Camera To Capture the Opening Ceremony

Broadcast coverage will include 25 cameras distributed around the venues, including to some athletes; Galaxy AI Interpreter will also be deployed The Opening C...

05/02/2026

Kiswe to Power Mountain West Conference's New Direct-to-Consumer Streaming Platform

Kiswe has partnered with the Mountain West Conference to power the next iteratio...

05/02/2026

NBCUniversal, Roku Launch the NBC Winter Olympics Experience

NBCUniversal and Roku announce the launch of the 2026 NBC Winter Olympics Experience, a destination delivering NBCUniversal's comprehensive CTV coverage of ...

05/02/2026

Vizrt Transforms Corporate Communications with AI-Powered Augmented Reality in Zoom

Vizrt, which specializes in live production technology as well as transforming v...

05/02/2026

Canon Intros RF7-14mm Fisheye Zoom, RF14mm Prime Lens

Canon USA has launched the RF7-14mm F2.8-3.5 L fisheye STM zoom lens and the RF14mm F1.4 L VCM prime lens. Building on Canon's legacy of innovative optics, ...

05/02/2026

UMass Lowell's Tsongas Center Upgrades with Ikegami UHK-X600 Cameras

The Paul E. Tsongas Center at UMass Lowell in Massachusetts has chosen Ikegami cameras for incorporation into its broadcast-quality television production facili...

05/02/2026

Exchange, NBCUniversal to Provide Service Members with Free Streaming of Winter Olympics

Once again, service members and Veterans worldwide will enjoy free access to NBC...

05/02/2026

Advanced Systems Group Appoints Industry Veteran Derek Pezzotti to Lead Sports and Venue Market Growth

Advanced Systems Group, LLC (ASG), a technology and services provider for media ...

05/02/2026

Broadcast Management Group Expands Management Team to Support Managed Services and Live Production Growth

Broadcast Management Group (BMG) is strengthening its leadership team to support...

05/02/2026

NBC Sports Selects Comcast Technology Solutions for Production of Winter Olympics

NBC Sports selects Comcast Technology Solutions (CTS) to provide multiscreen vid...

05/02/2026

AIM Sports Group Enhances AIM Sportsplex With Spiideo's Advanced Automated Video Technology

AIM Sports Group, a sports enterprise dedicated to elevating youth athletics thr...

05/02/2026

Inside the 2026 Milano Cortina IBC: How Tech Makes a Difference for Rightsholders, Fans, the Environment

Designed for efficient use of shared services and resources, the home of OBS pro...

05/02/2026

SVG Students To Watch: Brandon Malin, University of Michigan

The Yankees fan from Connecticut is executive producer of BTN StudentU for the Wolverines In the live-sports-video industry, the future is bright. Our series S...

05/02/2026

OBS Is Ready To Deliver for Milano Cortina Opening Ceremony

In an Olympic first, the ceremony will be held in four locations simultaneously...

05/02/2026

Remembering Charlie Jablonski, an Olympic Broadcasting Legend

Members of the broadcast and tech communities share four decades of memories of the technology leader The 2026 Milano Cortina Olympics are upon us, and every O...

05/02/2026

NBC Sports Has an Army of Technology Providers Supporting Winter Olympics Production

Key vendors include Appear, Audio-Technica, Canon, Chyron, Cisco, Comcast Techno...

05/02/2026

HC-130J Aircraft Enhances Coast Guard Readiness

A U.S. Coast Guard HC-130J aircraft during a test flight at L3Harris' facility in Waco, Texas....

05/02/2026

Al Seer Marine and L3Harris Deepen Strategic Agreement to Advance Maritime Unmanned Systems in the Middle East

Al Seer Marine and L3Harris have announced a strategic partnership combining UAE...

05/02/2026

Football And Younger Viewers Drive Ad Supported TV Viewing To 2025 High, Nielsen's Q4 2025 Ad Supported Gauge Finds

During this period, streaming comprised the majority of ad supported TV (45.6%),...

05/02/2026

New Orlando TV Station Focuses On Puerto Rican Viewers

Share Copy link Facebook X Linkedin Bluesky Email...

05/02/2026

Teads, Google TV Partner To Grow CTV HomeScreen Ad Availability

Share Copy link Facebook X Linkedin Bluesky Email...

05/02/2026

Advanced Systems Group Appoints Industry Veteran Derek Pe...

Advanced Systems Group, LLC (ASG), a technology and services provider for media creatives and content owners, announced the appointment of Derek Pezzotti as Sen...

05/02/2026

Taurus Technologies Elevates Podcast Production with Brig...

Taurus Technologies, a Dallas-area professional AV systems integrator, has upgraded its in-house podcast studio with Brightline Lighting's AV/720 low-voltag...

05/02/2026

NBC Sports Selects Production Infrastructure and Signal P...

NBC Universal to Present XXV Olympic Winter Games Feb. 6-22 and Milan Cortina Paralympics March 6-15 NBC Sports to Utilize Grass Valley's Frame Rate Conver...

05/02/2026

Atomos Unveils All New Shogun AV-19

Atomos today announced Shogun AV-19, a rack-mountable, 19-inch 4K HDR monitor-recorder-switcher designed for professional live production, broadcast, and video ...

05/02/2026

Vizrt revolutionizes corporate communications with AI-pow...

Vizrt, the leader in live production technology, revolutionizing viewer experience and engagement, today introduces two brand new solutions in partnership with ...

05/02/2026

Appear Appoints Simon Frost as Chief Marketing Officer to...

Appear, a global leader in live production technology, today announced the appointment of Simon Frost in a newly created role as Chief Marketing Officer (CMO). ...

05/02/2026

Noah Chamis ICLS Illuminates Only Murders in the Building...

New York gaffer Noah Chamis, ICLS ( You Deserve Each Other , The Half of It , Project Runway ) practices a mix of technical precision and creative play in his...

05/02/2026

NBC Sports Deploys Audio-Technica Microphones for Winter Olympics

Share Copy link Facebook X Linkedin Bluesky Email...

05/02/2026

Hemisphere Media Group, Entravision Launch WAPA Orlando

Share Copy link Facebook X Linkedin Bluesky Email...

05/02/2026

SMT Providing Timing And Production Data Services for Winter Olympics

Share Copy link Facebook X Linkedin Bluesky Email...

05/02/2026

The Final Season of 'Turn of the Tide' Premieres on April 10

Back to All News The Final Season of Turn of the Tide Premieres on April 10 Entertainment 05 February 2026 GlobalSpainPortugal Link copied to clipboard Do...

05/02/2026

Nicolette van Dam and Bas Smit Present the Dutch Version of 'Love Is Blind'

Back to All News Nicolette van Dam and Bas Smit Present the Dutch Version of Lo...

05/02/2026

GeForce NOW Celebrates Six Years of Streaming With 24 Games in February

Break out the cake and green sprinkles - GeForce NOW is turning six. Since launch, members have streamed over 1 billion hours, and the party's just getting...

04/02/2026

Save the Date: SVG Regional Sports Production Summit Heads to Denver June 29-30

The 11th-annual Summit will not only the unprecedented headwinds facing the business, but also the groundbreaking opportunities for the future....

04/02/2026

2026 Grammy Awards Audio Team Collaborates for Live Broadcast

Just moments before the 2026 Grammy Awards kicked off, members of the event's audio team assembled for a group photo at the base of the stage inside Los Ang...

04/02/2026

Riedel Connects Live Surgery and Medical Professionals at VISAR 2025

At the Vienna Interdisciplinary Symposium on Aortic Repair (VISAR), Riedel Communications' Managed Technology Division delivered a turnkey technical infrast...

04/02/2026

Mountain West Announces New Media Rights Package in Collaboration with CBS Sports, FOX Sports, The CW Network, and Kiswe

The Mountain West Conference announces a new media rights package featuring CBS ...

04/02/2026

NFL 2026 International Games Announced in Madrid, Mexico, and Paris

Earlier this week, the NFL announced it would play regular season games in Madrid, Paris, and Mexico City in 2026 as part of a nine-game international schedule,...

04/02/2026

Super Bowl Halftime Show Preview: PA Speakers on Wheels Return for Football-Meets-Music Event

Custom-built carts carry music speakers for Apple Music Super Bowl LX Halftime S...

04/02/2026

Wireless Audio, Part 1: RF Does More With Less, Because It Has To

In an era of constrained spectrum, two tactics have emerged: work closely with regulatory bodies and utilize engineering chops The Federal Communications Commi...

04/02/2026

Wireless Audio, Part 2: RF Mics Have a Key Role in Sports Broadcasting

Three examples of how wireless microphones are deployed to bring fans in deep and up close Microphone manufacturers have myriad ways to put wireless to work fo...

04/02/2026

Sennheiser Is Moving to Music City

The mic manufacturer will join other suppliers in the new Rock Nashville production campus Sennheiser is relocating its U.S. headquarters from its long-time lo...