
GPUs have been called the rare Earth metals - even the gold - of artificial intelligence, because they're foundational for today's generative AI era.
Three technical reasons, and many stories, explain why that's so. Each reason has multiple facets well worth exploring, but at a high level:
GPUs employ parallel processing.
GPU systems scale up to supercomputing heights.
The GPU software stack for AI is broad and deep.
The net result is GPUs perform technical calculations faster and with greater energy efficiency than CPUs. That means they deliver leading performance for AI training and inference as well as gains across a wide array of applications that use accelerated computing.
In its recent report on AI, Stanford's Human-Centered AI group provided some context. GPU performance has increased roughly 7,000 times since 2003 and price per performance is 5,600 times greater, it reported.
A 2023 report captured the steep rise in GPU performance and price/performance. The report also cited analysis from Epoch, an independent research group that measures and forecasts AI advances.
GPUs are the dominant computing platform for accelerating machine learning workloads, and most (if not all) of the biggest models over the last five years have been trained on GPUs [they have] thereby centrally contributed to the recent progress in AI, Epoch said on its site.
A 2020 study assessing AI technology for the U.S. government drew similar conclusions.
We expect [leading-edge] AI chips are one to three orders of magnitude more cost-effective than leading-node CPUs when counting production and operating costs, it said.
NVIDIA GPUs have increased performance on AI inference 1,000x in the last ten years, said Bill Dally, the company's chief scientist in a keynote at Hot Chips, an annual gathering of semiconductor and systems engineers.
ChatGPT Spread the News ChatGPT provided a powerful example of how GPUs are great for AI. The large language model (LLM), trained and run on thousands of NVIDIA GPUs, runs generative AI services used by more than 100 million people.
Since its 2018 launch, MLPerf, the industry-standard benchmark for AI, has provided numbers that detail the leading performance of NVIDIA GPUs on both AI training and inference.
For example, NVIDIA Grace Hopper Superchips swept the latest round of inference tests. NVIDIA TensorRT-LLM, inference software released since that test, delivers up to an 8x boost in performance and more than a 5x reduction in energy use and total cost of ownership. Indeed, NVIDIA GPUs have won every round of MLPerf training and inference tests since the benchmark was released in 2019.
In February, NVIDIA GPUs delivered leading results for inference, serving up thousands of inferences per second on the most demanding models in the STAC-ML Markets benchmark, a key technology performance gauge for the financial services industry.
A RedHat software engineering team put it succinctly in a blog: GPUs have become the foundation of artificial intelligence.
AI Under the Hood A brief look under the hood shows why GPUs and AI make a powerful pairing.
An AI model, also called a neural network, is essentially a mathematical lasagna, made from layer upon layer of linear algebra equations. Each equation represents the likelihood that one piece of data is related to another.
For their part, GPUs pack thousands of cores, tiny calculators working in parallel to slice through the math that makes up an AI model. This, at a high level, is how AI computing works.
Highly Tuned Tensor Cores Over time, NVIDIA's engineers have tuned GPU cores to the evolving needs of AI models. The latest GPUs include Tensor Cores that are 60x more powerful than the first-generation designs for processing the matrix math neural networks use.
In addition, NVIDIA Hopper Tensor Core GPUs include a Transformer Engine that can automatically adjust to the optimal precision needed to process transformer models, the class of neural networks that spawned generative AI.
Along the way, each GPU generation has packed more memory and optimized techniques to store an entire AI model in a single GPU or set of GPUs.
Models Grow, Systems Expand The complexity of AI models is expanding a whopping 10x a year.
The current state-of-the-art LLM, GPT4, packs more than a trillion parameters, a metric of its mathematical density. That's up from less than 100 million parameters for a popular LLM in 2018.
In a recent talk at Hot Chips, NVIDIA Chief Scientist Bill Dally described how single-GPU performance on AI inference expanded 1,000x in the last decade. GPU systems have kept pace by ganging up on the challenge. They scale up to supercomputers, thanks to their fast NVLink interconnects and NVIDIA Quantum InfiniBand networks.
For example, the DGX GH200, a large-memory AI supercomputer, combines up to 256 NVIDIA GH200 Grace Hopper Superchips into a single data-center-sized GPU with 144 terabytes of shared memory.
Each GH200 superchip is a single server with 72 Arm Neoverse CPU cores and four petaflops of AI performance. A new four-way Grace Hopper systems configuration puts in a single compute node a whopping 288 Arm cores and 16 petaflops of AI performance with up to 2.3 terabytes of high-speed memory.
And NVIDIA H200 Tensor Core GPUs announced in November pack up to 288 gigabytes of the latest HBM3e memory technology.
Software Covers the Waterfront An expanding ocean of GPU software has evolved since 2007 to enable every facet of AI, from deep-tech features to high-level applications.
The NVIDIA AI platform includes hundreds of software libraries and apps. The CUDA programming language and the cuDNN-X library for deep learning provide a base on top of which developers have created software like NVIDIA NeMo, a framework to let users build, customize and run inference on their o
North America Stories
18/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
18/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
18/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
18/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
18/04/2026
New York, NY, April 17, 2026 -- TAG Video Systems has announced an integration with Amazon Web Services (AWS) Elemental MediaConnect Router. The integration bri...
18/04/2026
Pro Sound Effects (PSE), the leading provider of professionally recorded sound effects for film, television, advertising, and technology today announced the lau...
18/04/2026
Fincons Group, a multinational IT business consulting and system integrator firm, announced today that it has achieved Amazon Web Services (AWS) Media & Enterta...
18/04/2026
Adobe extends leadership in video: unleashing new AI-powered creation in Firefly...
17/04/2026
The World Fencing League (WFL) has announced DAZN as its primary global streamin...
17/04/2026
ARRI Releases ALEXA 35 SUP 6.0 and LPS-1 SUP 1.3 Software Updates
ARRI has announced software updates for its ALEXA 35 Live camera and Live Production System L...
17/04/2026
Sennheiser has announced Spectera Studio, an offline system planner for its Spec...
17/04/2026
Deltatre and the German Football Association (DFB) have announced DFB.TV+, a DFB-owned direct-to-consumer streaming service developed and operated by Deltatre. ...
17/04/2026
Global sports marketing agency IMG has announced new senior leadership roles, designed to strengthen how it supports rightsholders and partners in the midst of ...
17/04/2026
NAGRAVISION and Harmonic have announced a watermarking-as-a-service solution for...
17/04/2026
NETGEAR and EVS Broadcast Equipment have announced a global technology partnersh...
17/04/2026
America's broadcasters are launching the NEXTGEN TV Converter Box Program, a new initiative designed to provide millions of American viewers with a low-cost...
17/04/2026
Quantum Corp. will exhibit at NAB Show 2026 (Booth N1726), presenting what it ca...
17/04/2026
Leadership and Staff Announce '20/20 Vision' Playbook...
17/04/2026
TNT Sports has announced a multi-year agreement for U.S. media rights to the FIA World Endurance Championship (WEC). Three events will air on truTV - the 24 Hou...
17/04/2026
Scripps Sports has announced a multi-year broadcast partnership with PBR (Profes...
17/04/2026
Adder Technology, a specialist in connectivity solutions and high performance IP KVM, today announced the latest release of AIM, its IP KVM matrix management so...
17/04/2026
Victory , a free sports streaming platform from A Parent Media Co. Inc. (APMC), has announced a multi-year content distribution partnership with the Dallas Cowb...
17/04/2026
In-venue and creative video staffers at the professional and collegiate level have one major thing in common: the intensity and attention to detail ramps up dur...
17/04/2026
Quickplay has announced the full-scale deployment of Gray Media's streaming ...
17/04/2026
Clear-Com has introduced the FreeSpeak Cell cellular-based wireless intercom system that uses LTE and 5G infrastructure to support large-scale production commun...
17/04/2026
The 2026 NAB Show kicks off Saturday, April 18, with the show floor and exhibits opening April 19-22 at the Las Vegas Convention Center. The show features more ...
17/04/2026
Ratings Roundup is a rundown of recent rating news and is derived from press rel...
17/04/2026
L3Harris towed array systems provide U.S. Navy submarines with extended acoustic...
17/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
17/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
17/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
17/04/2026
Cobalt Digital and SineSix Media Partner to Transform Accessibility Compliance into a More Engaging Viewer Experience at NAB Show 2026
Collaboration integrates...
17/04/2026
Strengthening Appear's North America team with a new Vice President of Business Development
Appear ASA (Appear, OSE: APR), a global leader in live producti...
17/04/2026
New framework helps broadcasters, streaming platforms, and sports organizations apply AI to live video for monetization, metadata, highlights, and downstream wo...
17/04/2026
New whitepaper gives broadcasters and OTT operators independent, codec-by-codec evidence that the VisualOn Optimizer transforms viewer quality of experience.
V...
17/04/2026
Blackmagic Design Announces New Blackmagic URSA Cine Immersive 100G
Brie Clayton April 17, 2026
0 Comments
World's first immersive cinema camera f...
17/04/2026
NAB 2026: Vubiquity and Eluvio Showcase Streaming Solution that Significantly Re...
17/04/2026
What Makes a Good Marathon Running Playlist? We asked Xander Dawson, an eighth-semester saxophone major at Boston Conservatory running the Boston Marathon thi...
17/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
17/04/2026
At NAB Show 2026, PTZOptics (Booth N1902) will showcase a live sports streaming demo created in collaboration with Moondream, offering a new look at how Visual ...
17/04/2026
The Eindhoven University of Technology is a research university in the Netherlands spanning 25 buildings, specialising in engineering, science and technology. D...
17/04/2026
Documentary Editor - US, Remote
Brie Clayton April 17, 2026
0 Comments
Documentary Editor
April 13, 2026Freelance Video Cameraman - Los......
17/04/2026
Blackmagic Design Announces New Blackmagic URSA Cine 12K LF 100G
Brie Clayton April 17, 2026
0 Comments
Revolutionary digital film camera brings cinem...
17/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
17/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
17/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
17/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
17/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
17/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
17/04/2026
Dubformer, the AI-powered dubbing platform, and Adapt, a global leader in AI-driven localization and dubbing solutions, have enabled Goalcast, the inspirational...