
GPUs have been called the rare Earth metals - even the gold - of artificial intelligence, because they're foundational for today's generative AI era.
Three technical reasons, and many stories, explain why that's so. Each reason has multiple facets well worth exploring, but at a high level:
GPUs employ parallel processing.
GPU systems scale up to supercomputing heights.
The GPU software stack for AI is broad and deep.
The net result is GPUs perform technical calculations faster and with greater energy efficiency than CPUs. That means they deliver leading performance for AI training and inference as well as gains across a wide array of applications that use accelerated computing.
In its recent report on AI, Stanford's Human-Centered AI group provided some context. GPU performance has increased roughly 7,000 times since 2003 and price per performance is 5,600 times greater, it reported.
A 2023 report captured the steep rise in GPU performance and price/performance. The report also cited analysis from Epoch, an independent research group that measures and forecasts AI advances.
GPUs are the dominant computing platform for accelerating machine learning workloads, and most (if not all) of the biggest models over the last five years have been trained on GPUs [they have] thereby centrally contributed to the recent progress in AI, Epoch said on its site.
A 2020 study assessing AI technology for the U.S. government drew similar conclusions.
We expect [leading-edge] AI chips are one to three orders of magnitude more cost-effective than leading-node CPUs when counting production and operating costs, it said.
NVIDIA GPUs have increased performance on AI inference 1,000x in the last ten years, said Bill Dally, the company's chief scientist in a keynote at Hot Chips, an annual gathering of semiconductor and systems engineers.
ChatGPT Spread the News ChatGPT provided a powerful example of how GPUs are great for AI. The large language model (LLM), trained and run on thousands of NVIDIA GPUs, runs generative AI services used by more than 100 million people.
Since its 2018 launch, MLPerf, the industry-standard benchmark for AI, has provided numbers that detail the leading performance of NVIDIA GPUs on both AI training and inference.
For example, NVIDIA Grace Hopper Superchips swept the latest round of inference tests. NVIDIA TensorRT-LLM, inference software released since that test, delivers up to an 8x boost in performance and more than a 5x reduction in energy use and total cost of ownership. Indeed, NVIDIA GPUs have won every round of MLPerf training and inference tests since the benchmark was released in 2019.
In February, NVIDIA GPUs delivered leading results for inference, serving up thousands of inferences per second on the most demanding models in the STAC-ML Markets benchmark, a key technology performance gauge for the financial services industry.
A RedHat software engineering team put it succinctly in a blog: GPUs have become the foundation of artificial intelligence.
AI Under the Hood A brief look under the hood shows why GPUs and AI make a powerful pairing.
An AI model, also called a neural network, is essentially a mathematical lasagna, made from layer upon layer of linear algebra equations. Each equation represents the likelihood that one piece of data is related to another.
For their part, GPUs pack thousands of cores, tiny calculators working in parallel to slice through the math that makes up an AI model. This, at a high level, is how AI computing works.
Highly Tuned Tensor Cores Over time, NVIDIA's engineers have tuned GPU cores to the evolving needs of AI models. The latest GPUs include Tensor Cores that are 60x more powerful than the first-generation designs for processing the matrix math neural networks use.
In addition, NVIDIA Hopper Tensor Core GPUs include a Transformer Engine that can automatically adjust to the optimal precision needed to process transformer models, the class of neural networks that spawned generative AI.
Along the way, each GPU generation has packed more memory and optimized techniques to store an entire AI model in a single GPU or set of GPUs.
Models Grow, Systems Expand The complexity of AI models is expanding a whopping 10x a year.
The current state-of-the-art LLM, GPT4, packs more than a trillion parameters, a metric of its mathematical density. That's up from less than 100 million parameters for a popular LLM in 2018.
In a recent talk at Hot Chips, NVIDIA Chief Scientist Bill Dally described how single-GPU performance on AI inference expanded 1,000x in the last decade. GPU systems have kept pace by ganging up on the challenge. They scale up to supercomputers, thanks to their fast NVLink interconnects and NVIDIA Quantum InfiniBand networks.
For example, the DGX GH200, a large-memory AI supercomputer, combines up to 256 NVIDIA GH200 Grace Hopper Superchips into a single data-center-sized GPU with 144 terabytes of shared memory.
Each GH200 superchip is a single server with 72 Arm Neoverse CPU cores and four petaflops of AI performance. A new four-way Grace Hopper systems configuration puts in a single compute node a whopping 288 Arm cores and 16 petaflops of AI performance with up to 2.3 terabytes of high-speed memory.
And NVIDIA H200 Tensor Core GPUs announced in November pack up to 288 gigabytes of the latest HBM3e memory technology.
Software Covers the Waterfront An expanding ocean of GPU software has evolved since 2007 to enable every facet of AI, from deep-tech features to high-level applications.
The NVIDIA AI platform includes hundreds of software libraries and apps. The CUDA programming language and the cuDNN-X library for deep learning provide a base on top of which developers have created software like NVIDIA NeMo, a framework to let users build, customize and run inference on their o
North America Stories
24/06/2026
In Brazil's TV 3.0 Trials, Appear's X5 is transporting live signals from Bras lia to S o Paulo over the public internet using secure, reliable next-gene...
24/06/2026
Melbourne, Australia - 24 June 2026: Mediaproxy, the global standard for software-based IP compliance monitoring and multiviewing solutions, has named Heartland...
24/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
24/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
24/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
24/06/2026
First Rush Brings SDI Multicam ProRes Recording to Apple Silicon Macs
Brie Clayton June 23, 2026
0 Comments
First Rush is a native macOS application d...
24/06/2026
Vertical Drama Beneath Crimson Sails Created with Blackmagic Design
Brie Clayton June 23, 2026
0 Comments
Thunder Child Productions relies on cameras&...
23/06/2026
When we began planning our transition from an SDI-based infrastructure to a new ...
23/06/2026
Imagine Communications has announced the appointment of Greg Garmon as Senior Vice President, Americas Video Sales. Garmon will oversee account growth and busin...
23/06/2026
Snap has promoted Emma Wakely to Head of Sports and Media Partnerships, Americas, succeeding Anmol Malhotra, who has been elevated to Global Head of Content and...
23/06/2026
YES Network and The Gotham Sports App will air MI New York's Major League Cr...
23/06/2026
The Universal Talent Identifier (HAND) has issued HAND IDs to 34 top projected prospects in the 2026 NBA Draft class, including AJ Dybantsa, Cameron Boozer, and...
23/06/2026
World Boxing has announced the launch of World Boxing TV, a subscription-based streaming platform built on the Joymo platform, offering live events, on-demand c...
23/06/2026
FloSports will stream 32 off-road motorcycle racing events on FloRacing, includi...
23/06/2026
SES has announced the expansion of its ASTRA TV platform in Spain with the addition of 14 regional channels in HD and UHD quality and the launch of new hybrid s...
23/06/2026
Appear ASA has announced its role in Rede Legislativa de R dio e TV's contri...
23/06/2026
LTN has announced that PBS has selected it as its IP video partner to modernize content distribution and contribution across more than 330 public television sta...
23/06/2026
Ease Live has announced that its graphics overlay platform is powering an interactive fan experience on Rally.TV, the official streaming platform of the FIA Wor...
23/06/2026
Chyron has announced updates to Chyron LIVE, its cloud-native live production pl...
23/06/2026
ESPN has announced ESPN Fan House, a fan engagement hub powered by Flowcode, launching in August ahead of the 2026 college football season. Publicis Sports will...
23/06/2026
The city's solid position in broadcast, entertainment, and sports attracted the major microphone manufacturer
Sennheiser Group is moving its Americas Regio...
23/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/06/2026
PlayBox Neo appoints Besco as Channel Reseller to establish a firm foothold in Asia Pacific's thriving high-tech export-driven economic boom
PlayBox Neo, t...
23/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/06/2026
LTN, a global leader in IP-based video transport and network services, today announced that PBS has selected LTN as its IP video partner to modernize and future...
23/06/2026
LiveU will introduce its Q Era to Australia and New Zealand for the first time at ABE2026 on Stand No. 25, (July 30 31). Leading the showcase is the LU900Q, a n...
23/06/2026
Miri Technologies Inc. has begun shipping its highly anticipated V410 live 4K video encoder/decoder for streaming, IP-based production workflows and AV-over-IP ...
23/06/2026
DHD audio reports the completion of an upgrade to the audio production facilities at the Galilee headquarters of Radio Tzafon. The station broadcasts two progra...
23/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
23/06/2026
Multifaceted Growth Executive Brings 20+ Years of Experience Leading Organizations Across Tech and M&E
Imagine Communications today announced the appointment ...
23/06/2026
Visual Productions Unveils RdmRelay2 Four-channel Relay Control at InfoComm 2026
Brie Clayton June 22, 2026
0 Comments
New Relay Solution Combines DMX, ...
23/06/2026
SMPTE Makes Its Standards Freely Accessible, Opening Standards Library to the Gl...
23/06/2026
Building AI systems at scale is demanding, requiring low-latency inference, fast vector search, strong GPU price-performance and infrastructure that can grow wi...
23/06/2026
News Highlights:
NVIDIA technology runs 81% of the TOP500 and 90% of the systems new to the list.
26 systems on the TOP500 adopted the NVIDIA Grace CPU, up ei...
23/06/2026
Companies are asking how to build specialized AI that fits with the way their workflows actually run.
The first wave of enterprise AI was about access. Compan...
23/06/2026
Newly identified molecule strengthens the eye's response to damage in retinal disease Scripps Research discovery finds that restoring the naturally occurrin...
22/06/2026
Behind The Mic provides a roundup of recent news regarding on-air talent, includ...
22/06/2026
Cosm has announced the appointment of David Ho as Chief Legal Officer, a newly created executive role reporting to President and CEO Jeb Terry. Ho will oversee ...
22/06/2026
Warner Bros. Discovery and Amazon Web Services (AWS) have announced the developm...
22/06/2026
Daktronics has completed an audio control system upgrade at Petco Park in San Di...
22/06/2026
Accelerate Media has named John Willi as President and announced the launch of the Accelerate Sports Network (ASN), a prep sports media and streaming platform c...
22/06/2026
All Women's Sports Network (AWSN) and 3XBA (3 3 Basketball Association) have announced live television coverage of the annual 3XBA tournament on Friday, Jun...
22/06/2026
OWL AI has announced the appointment of Jay Prasad as Chief Executive Officer and member of the Board of Directors. Prasad succeeds Josh Gwyther, who has served...
22/06/2026
CP Communications delivered RF video and audio support for TNT's Inside the NBA at the 2026 NBA Finals, providing main show coverage in San Antonio and ea...
22/06/2026
Polymarket has announced a partnership with GRID, an official esports data platf...
22/06/2026
As sports venues continue to evolve into more video-centric, fan-engagement-driv...