Sony Pixel Power calrec Sony

Why GPUs Are Great for AI

04/12/2023

GPUs have been called the rare Earth metals - even the gold - of artificial intelligence, because they're foundational for today's generative AI era.

Three technical reasons, and many stories, explain why that's so. Each reason has multiple facets well worth exploring, but at a high level:

GPUs employ parallel processing.

GPU systems scale up to supercomputing heights.

The GPU software stack for AI is broad and deep.

The net result is GPUs perform technical calculations faster and with greater energy efficiency than CPUs. That means they deliver leading performance for AI training and inference as well as gains across a wide array of applications that use accelerated computing.

In its recent report on AI, Stanford's Human-Centered AI group provided some context. GPU performance has increased roughly 7,000 times since 2003 and price per performance is 5,600 times greater, it reported.

A 2023 report captured the steep rise in GPU performance and price/performance. The report also cited analysis from Epoch, an independent research group that measures and forecasts AI advances.

GPUs are the dominant computing platform for accelerating machine learning workloads, and most (if not all) of the biggest models over the last five years have been trained on GPUs [they have] thereby centrally contributed to the recent progress in AI, Epoch said on its site.

A 2020 study assessing AI technology for the U.S. government drew similar conclusions.

We expect [leading-edge] AI chips are one to three orders of magnitude more cost-effective than leading-node CPUs when counting production and operating costs, it said.

NVIDIA GPUs have increased performance on AI inference 1,000x in the last ten years, said Bill Dally, the company's chief scientist in a keynote at Hot Chips, an annual gathering of semiconductor and systems engineers.

ChatGPT Spread the News ChatGPT provided a powerful example of how GPUs are great for AI. The large language model (LLM), trained and run on thousands of NVIDIA GPUs, runs generative AI services used by more than 100 million people.

Since its 2018 launch, MLPerf, the industry-standard benchmark for AI, has provided numbers that detail the leading performance of NVIDIA GPUs on both AI training and inference.

For example, NVIDIA Grace Hopper Superchips swept the latest round of inference tests. NVIDIA TensorRT-LLM, inference software released since that test, delivers up to an 8x boost in performance and more than a 5x reduction in energy use and total cost of ownership. Indeed, NVIDIA GPUs have won every round of MLPerf training and inference tests since the benchmark was released in 2019.

In February, NVIDIA GPUs delivered leading results for inference, serving up thousands of inferences per second on the most demanding models in the STAC-ML Markets benchmark, a key technology performance gauge for the financial services industry.

A RedHat software engineering team put it succinctly in a blog: GPUs have become the foundation of artificial intelligence.

AI Under the Hood A brief look under the hood shows why GPUs and AI make a powerful pairing.

An AI model, also called a neural network, is essentially a mathematical lasagna, made from layer upon layer of linear algebra equations. Each equation represents the likelihood that one piece of data is related to another.

For their part, GPUs pack thousands of cores, tiny calculators working in parallel to slice through the math that makes up an AI model. This, at a high level, is how AI computing works.

Highly Tuned Tensor Cores Over time, NVIDIA's engineers have tuned GPU cores to the evolving needs of AI models. The latest GPUs include Tensor Cores that are 60x more powerful than the first-generation designs for processing the matrix math neural networks use.

In addition, NVIDIA Hopper Tensor Core GPUs include a Transformer Engine that can automatically adjust to the optimal precision needed to process transformer models, the class of neural networks that spawned generative AI.

Along the way, each GPU generation has packed more memory and optimized techniques to store an entire AI model in a single GPU or set of GPUs.

Models Grow, Systems Expand The complexity of AI models is expanding a whopping 10x a year.

The current state-of-the-art LLM, GPT4, packs more than a trillion parameters, a metric of its mathematical density. That's up from less than 100 million parameters for a popular LLM in 2018.

In a recent talk at Hot Chips, NVIDIA Chief Scientist Bill Dally described how single-GPU performance on AI inference expanded 1,000x in the last decade. GPU systems have kept pace by ganging up on the challenge. They scale up to supercomputers, thanks to their fast NVLink interconnects and NVIDIA Quantum InfiniBand networks.

For example, the DGX GH200, a large-memory AI supercomputer, combines up to 256 NVIDIA GH200 Grace Hopper Superchips into a single data-center-sized GPU with 144 terabytes of shared memory.

Each GH200 superchip is a single server with 72 Arm Neoverse CPU cores and four petaflops of AI performance. A new four-way Grace Hopper systems configuration puts in a single compute node a whopping 288 Arm cores and 16 petaflops of AI performance with up to 2.3 terabytes of high-speed memory.

And NVIDIA H200 Tensor Core GPUs announced in November pack up to 288 gigabytes of the latest HBM3e memory technology.

Software Covers the Waterfront An expanding ocean of GPU software has evolved since 2007 to enable every facet of AI, from deep-tech features to high-level applications.

The NVIDIA AI platform includes hundreds of software libraries and apps. The CUDA programming language and the cuDNN-X library for deep learning provide a base on top of which developers have created software like NVIDIA NeMo, a framework to let users build, customize and run inference on their o
LINK: https://blogs.nvidia.com/blog/why-gpus-are-great-for-ai/...
See more stories from nvidia

North America Stories

11/07/2025

2025 Sundance Institute Producers Lab Fellows Announced

PARK CITY, UTAH, July 11, 2025 - The nonprofit Sundance Institute announced today the 11 producers chosen for its annual Producers Labs, returning to Ucross Fou...

11/07/2025

L3Harris Delivers First P-8A Poseidon Aircraft to US Navy

L3Harris Technologies President of Intelligence, Surveillance and Reconnaissance Jason Lambert and General Manager of L3Harris Waco facility Sean Ling held a ce...

11/07/2025

WETA Launches WETA+ Free Streaming Service

ARLINGTON, Va. WETA, the flagship public media station in the national capital area, has launched WETA+, a new streaming service tailored for the local Washingt...

11/07/2025

TV Tech's Top Regulatory Stories of 2025

The Federal Communications Commission has emerged as one of the central players in the broadcast TV landscape in 2025, with its deregulatory policies sparking h...

11/07/2025

Calrec to Feature Suite of Interconnected Audio Solutions at IBC2025

Calrec will introduce usability, customization and system enhancements across its entire range of Argo consoles during IBC2025, Sept. 12-15, at the RAI Amsterda...

11/07/2025

Encompass Supports DAZN's Coverage of 2025 FIFA Club World Cup

LONDON Encompass Digital Media said it will support live and on-demand viewing of the 2025 FIFA Club World Cup across multiple global regions for sports enterta...

11/07/2025

SBE Survey: Certified Broadcast Engineers Earn More

Two-thirds of broadcast engineers reaped the benefits of a pay raise within the last year....

11/07/2025

SmallHD Unveils Quantum 27 OLED Monitor

CARY, N.C. SmallHD has launched the Quantum 27, a new 26.5-inch Quantum-Dot OLED monitor designed to deliver postproduction image quality in a compact, set-frie...

11/07/2025

Tegna Will Pay $225K to Settle FCC Investigation

The Federal Communications Commission's Enforcement Bureau and Tegna have entered into a consent decree that will settle an investigation into the accidenta...

11/07/2025

Sens. Markey, Lujn Again Call for FCC Vote on Paramount-Skydance Merger

WASHINGTON Following news in early July that Paramount had settled President Donald Trump's lawsuit, Sens. Edward J. Markey (D-Mass.) and Ben Ray Luj n (D-N...

11/07/2025

Model/Actriz Performs Lead Single Cinderella on The Late Show with Stephen Colbert

Model/Actriz Performs Lead Single Cinderella on The Late Show with Stephen Colbe...

11/07/2025

Behind the Mic: Amazon Prime Preps for First Season of NBA Action; MSG Networks Adjusts Broadcast Booths for Rangers, Devils

Behind the Mic: Amazon Prime Preps for First Season of NBA Action; MSG Networks ...

11/07/2025

SVG New Sponsor Spotlight: Suite Studios' Craig Hering on Adapting to Clients' Needs With Scalable Cloud-Based Storage

SVG New Sponsor Spotlight: Suite Studios' Craig Hering on Adapting to Client...

11/07/2025

2025 SVG Content Management Forum Breaks Down AI's Impact, Continued Transition to the Cloud

2025 SVG Content Management Forum Breaks Down AI's Impact, Continued Transit...

11/07/2025

A Journey HOME: University of Nebraska's HuskerVision Goes IP

A Journey HOME: University of Nebraska's HuskerVision Goes IP Leaders from the HuskerVision and Lawo share their IP learnings By SVG Staff Friday, July 1...

11/07/2025

CMSI, Remote Picture Labs, Ace ESPN's Cloud-Based Editing Efforts for Wimbledon

CMSI, Remote Picture Labs, Ace ESPN's Cloud-Based Editing Efforts for Wimble...

11/07/2025

Netflix Enters the Live-Boxing-Production Ring for Round 2 With Historic Taylor-Serrano 3 Card at MSG

Netflix Enters the Live-Boxing-Production Ring for Round 2 With Historic Taylor-...

11/07/2025

'Too Hot to Handle: Italy' Is Coming on July 18 Only on Netflix

Back to All News Too Hot to Handle: Italy Is Coming on July 18 Only on Netflix Entertainment 11 July 2025 GlobalItaly Link copied to clipboard July 11, 20...

11/07/2025

Netflix Will Release 'Death Inc.' Seasons 1, 2 and 3

Back to All News Netflix Will Release Death Inc. Seasons 1, 2 and 3 Entertainment 11 July 2025 GlobalSpain Link copied to clipboard Season 1 Season 2 Se...

11/07/2025

A Gaming GPU Helps Crack the Code on a Thousand-Year Cultural Conversation

Ceramics - the humble mix of earth, fire and artistry - have been part of a global conversation for millennia. From Tang Dynasty trade routes to Renaissance pa...

10/07/2025

Nielsen Appoints Richard Pacheco as Head of Global Partnerships

NEW YORK - July 10, 2025 - Nielsen, the global leader in audience measurement, data and analytics, today announced that it appointed Richard Pacheco as head of ...

10/07/2025

Sponsored: Robotic Deployments Are Transforming Local News

Local newscasts don't exist in a vacuum. News directors and station management constantly evaluate what's working, what isn't and perhaps most impor...

10/07/2025

Stuttgart Media University Upgrades Studio with Lawo mc56

Lawo has announced that Stuttgart Media University (Hochschule der Medien, HdM) has comprehensively modernized its central recording studio after selecting an I...

10/07/2025

SMPTE Opens Early Bird Registration for Media Technology Summit

The Society of Motion Picture and Television Engineers (SMPTE) has opened early-bird registration for the Media Technology Summit, which will take place in a ne...

10/07/2025

TNDV Television Launches Aspiration 35 to Support Cinematic Workflows

NASHVILLE, Tenn. TNDV Television has launched Aspiration 35, a new version of its 40-foot Aspiration truck reimagined for cinematic multicamera productions....

10/07/2025

Key Code Education Launches Beginner, Intermediate Training Courses

BURBANK, Calif. Key Code Education, a provider of instructor-led postproduction training, is growing its curriculum with new programs for beginner and intermedi...

10/07/2025

Actus Digital to Show Actus X Intelligent Monitoring With AI at IBC2025

HACKENSACK, N.J. Actus Digital will demonstrate how broadcasters can transform compliance monitoring from a necessary expense into a strategic revenue driver at...

10/07/2025

Comments on FCC Ownership Rules Due in August

The Federal Register has published a summary of the Federal Communications Commission's Public Notice seeking comments on its ownership rules that lists a d...

10/07/2025

Netflix Presents the Official Trailer for 'Superestar'

Back to All News Netflix Presents the Official Trailer for SuperestarPlay Video Play Video Entertainment 10 July 2025 GlobalSpain Link copied to clipboard...

10/07/2025

From Terabytes to Turnkey: AI-Powered Climate Models Go Mainstream

In the race to understand our planet's changing climate, speed and accuracy are everything. But today's most widely used climate simulators often strugg...

10/07/2025

Indonesia on Track to Achieve Sovereign AI Goals With NVIDIA, Cisco and IOH

As one of the world's largest emerging markets, Indonesia is making strides toward its Golden 2045 Vision - an initiative tapping digital technologies and...

10/07/2025

5G for All? What the DFL's Use of Easy5G, RefCam Could Mean for Events in the Future

5G for all? What the DFL's use of Easy5G and RefCam could mean for events in...

10/07/2025

Save the Date: PGA TOUR Studios Welcomes SVG Remote Production Summit on Oct 14-15

Save the Date: PGA TOUR Studios Welcomes SVG Remote Production Summit on Oct 14-...

10/07/2025

Cloud on the Road: How Remote-Production-Service Providers Are Adapting to a New Era

Cloud on the Road: How Remote-Production-Service Providers Are Adapting to a New...

10/07/2025

Seattle Kraken's Ryan Schaber on the NHL Team Taking Live Game Productions In-House

Seattle Kraken's Ryan Schaber on the NHL Team Taking Live Game Productions I...

10/07/2025

FOX Sports Reboots Small Control Room in Los Angeles as Hub for Vertical-First Production

FOX Sports Reboots Small Control Room in Los Angeles as Hub for Vertical-First P...

10/07/2025

SVG Sit-Down: MSE's Zach Leonsis, ViewLift's Rick Allen Go Deep on Joint Venture Targeting Local-Sports-Media Market

SVG Sit-Down: MSE's Zach Leonsis, ViewLift's Rick Allen Go Deep on Joint...

10/07/2025

Bringing Culture Into Focus on My Brilliant Career': First Nations Voices Reshaping Storytelling on Set

Back to All News Bringing Culture Into Focus on My Brilliant Career': Firs...

10/07/2025

Daktronics and Grass Valley Announce Strategic Partnership to Deliver End-to-End Venue Solutions

Strategic Alliance Combines Daktronics' LED Display and Content Management S...

10/07/2025

Reach the PEAK' on GeForce NOW

Grab a friend and climb toward the clouds - PEAK is now available on GeForce NOW, enabling members to try the hugely popular indie hit on virtually any device. ...

10/07/2025

How to Run Coding Assistants for Free on RTX AI PCs and Workstations

Coding assistants or copilots - AI-powered assistants that can suggest, explain and debug code - are fundamentally changing how software is developed for both e...

09/07/2025

Through Their Lens: What Cinematographer Jomo Fray Saw at the 2025 Directors Lab

By Bailey Pennick There's something arresting about the way Jomo Fray captures the world. The cinematographer, now best known for his unparalleled work on ...

09/07/2025

How Sencore is Upgrading IPTV for the Hospitality Industry

Key Highlights Centralized management interface for full control, monitoring, and diagnostics Scalable, multi-site OTT decryption and distribution Secure int...

09/07/2025

L3Harris Appoints Rob Mitrevski to Lead Enterprise Pursuit of Golden Dome

MELBOURNE, Fla., July 9, 2025 - L3Harris Technologies (NYSE: LHX) has appointed Rob Mitrevski as President, Golden Dome Strategy and Integration, a new role cre...

09/07/2025

TAM Ireland awards programme data harmonisation contract to MetaBroadcast and Nielsen's Gracenote

Collaboration will result in improved data quality and understanding of genre-le...

09/07/2025

NAB Slams NextGen TV Critics for Protecting Their Turf

The National Association of Broadcasters is hitting back at critics who oppose its proposal to phase out the current ATSC 1.0 DTV over-the-air standard and tran...

09/07/2025

Zeam Launches on LG Smart TVs

Zeam Media's hyperlocal streaming platform Zeam has announced a new distribution deal with LG that will bring the streaming service to LG smart TVs and devi...

09/07/2025

TAM Ireland awards programme data harmonisation contract...

MetaBroadcast, the UK's leading metadata management specialist, announced today that it was awarded a three-year contract from TAM Ireland (Television Audie...

09/07/2025

Bitfocus transforms complex control for any media applica...

More than 700 professional devices and applications already integrated through open software Bitfocus, the specialist in media control and monitoring, is show...

09/07/2025

Actus Digital Transforms Broadcast Compliance with AI-Pow...

Actus Digital, a LiveU company, will demonstrate how broadcasters can transform compliance monitoring from a necessary expense into a strategic revenue driver a...