Sony Pixel Power calrec Sony

Why GPUs Are Great for AI

04/12/2023

GPUs have been called the rare Earth metals - even the gold - of artificial intelligence, because they're foundational for today's generative AI era.

Three technical reasons, and many stories, explain why that's so. Each reason has multiple facets well worth exploring, but at a high level:

GPUs employ parallel processing.

GPU systems scale up to supercomputing heights.

The GPU software stack for AI is broad and deep.

The net result is GPUs perform technical calculations faster and with greater energy efficiency than CPUs. That means they deliver leading performance for AI training and inference as well as gains across a wide array of applications that use accelerated computing.

In its recent report on AI, Stanford's Human-Centered AI group provided some context. GPU performance has increased roughly 7,000 times since 2003 and price per performance is 5,600 times greater, it reported.

A 2023 report captured the steep rise in GPU performance and price/performance. The report also cited analysis from Epoch, an independent research group that measures and forecasts AI advances.

GPUs are the dominant computing platform for accelerating machine learning workloads, and most (if not all) of the biggest models over the last five years have been trained on GPUs [they have] thereby centrally contributed to the recent progress in AI, Epoch said on its site.

A 2020 study assessing AI technology for the U.S. government drew similar conclusions.

We expect [leading-edge] AI chips are one to three orders of magnitude more cost-effective than leading-node CPUs when counting production and operating costs, it said.

NVIDIA GPUs have increased performance on AI inference 1,000x in the last ten years, said Bill Dally, the company's chief scientist in a keynote at Hot Chips, an annual gathering of semiconductor and systems engineers.

ChatGPT Spread the News ChatGPT provided a powerful example of how GPUs are great for AI. The large language model (LLM), trained and run on thousands of NVIDIA GPUs, runs generative AI services used by more than 100 million people.

Since its 2018 launch, MLPerf, the industry-standard benchmark for AI, has provided numbers that detail the leading performance of NVIDIA GPUs on both AI training and inference.

For example, NVIDIA Grace Hopper Superchips swept the latest round of inference tests. NVIDIA TensorRT-LLM, inference software released since that test, delivers up to an 8x boost in performance and more than a 5x reduction in energy use and total cost of ownership. Indeed, NVIDIA GPUs have won every round of MLPerf training and inference tests since the benchmark was released in 2019.

In February, NVIDIA GPUs delivered leading results for inference, serving up thousands of inferences per second on the most demanding models in the STAC-ML Markets benchmark, a key technology performance gauge for the financial services industry.

A RedHat software engineering team put it succinctly in a blog: GPUs have become the foundation of artificial intelligence.

AI Under the Hood A brief look under the hood shows why GPUs and AI make a powerful pairing.

An AI model, also called a neural network, is essentially a mathematical lasagna, made from layer upon layer of linear algebra equations. Each equation represents the likelihood that one piece of data is related to another.

For their part, GPUs pack thousands of cores, tiny calculators working in parallel to slice through the math that makes up an AI model. This, at a high level, is how AI computing works.

Highly Tuned Tensor Cores Over time, NVIDIA's engineers have tuned GPU cores to the evolving needs of AI models. The latest GPUs include Tensor Cores that are 60x more powerful than the first-generation designs for processing the matrix math neural networks use.

In addition, NVIDIA Hopper Tensor Core GPUs include a Transformer Engine that can automatically adjust to the optimal precision needed to process transformer models, the class of neural networks that spawned generative AI.

Along the way, each GPU generation has packed more memory and optimized techniques to store an entire AI model in a single GPU or set of GPUs.

Models Grow, Systems Expand The complexity of AI models is expanding a whopping 10x a year.

The current state-of-the-art LLM, GPT4, packs more than a trillion parameters, a metric of its mathematical density. That's up from less than 100 million parameters for a popular LLM in 2018.

In a recent talk at Hot Chips, NVIDIA Chief Scientist Bill Dally described how single-GPU performance on AI inference expanded 1,000x in the last decade. GPU systems have kept pace by ganging up on the challenge. They scale up to supercomputers, thanks to their fast NVLink interconnects and NVIDIA Quantum InfiniBand networks.

For example, the DGX GH200, a large-memory AI supercomputer, combines up to 256 NVIDIA GH200 Grace Hopper Superchips into a single data-center-sized GPU with 144 terabytes of shared memory.

Each GH200 superchip is a single server with 72 Arm Neoverse CPU cores and four petaflops of AI performance. A new four-way Grace Hopper systems configuration puts in a single compute node a whopping 288 Arm cores and 16 petaflops of AI performance with up to 2.3 terabytes of high-speed memory.

And NVIDIA H200 Tensor Core GPUs announced in November pack up to 288 gigabytes of the latest HBM3e memory technology.

Software Covers the Waterfront An expanding ocean of GPU software has evolved since 2007 to enable every facet of AI, from deep-tech features to high-level applications.

The NVIDIA AI platform includes hundreds of software libraries and apps. The CUDA programming language and the cuDNN-X library for deep learning provide a base on top of which developers have created software like NVIDIA NeMo, a framework to let users build, customize and run inference on their o
LINK: https://blogs.nvidia.com/blog/why-gpus-are-great-for-ai/...
See more stories from nvidia

North America Stories

18/09/2025

The Gauge: Poland | August 2025

August, the last month of the summer vacation, brought stability to the amount of time spent in front of the TV screen. The data shows that Polish viewers watch...

18/09/2025

FCC Commissioner Gomez Blasts ABC for Suspending Jimmy Kimmel

WASHINGTON The controversy over ABC's decision to suspend Jimmy Kimmel, continues to heat up, with FCC Commissioner Anna M. Gomez blasting ABCs decision to ...

18/09/2025

FOR-A America Appoints Jo Aun to Lead US Product Developm...

FOR-A America named Jo Aun as Senior Manager of Product Management and Engineering, reporting directly to Satoshi Kanemura, President and COO. In this new posit...

18/09/2025

MASV Highlights Partner Integrations Fueling Modern Media...

MASV (massive.io), the fastest and most reliable large file transfer platform for media professionals and an IDC Innovator 2025 for Media & Entertainment*, toda...

18/09/2025

Dish TV India partners with ThinkAnalytics to power AI-dr...

ThinkAnalytics, the global leader in video content discovery and persaonalization, today announced that Dish TV India, the leading content distribution company,...

18/09/2025

DirecTV Adds Three Sports Channels to Its FAST Streaming Offering

DirecTV continues to expand its women's sports offerings by adding Sports Fanatics' and Whoopi Goldberg's Free Ad-Supported Television (FAST) All Wo...

18/09/2025

Fall Football Kicks Off With Native HDR Production, NextGen TV Telecast

WASHINGTON NextGen TV viewers are beginning to see what high dynamic range (HDR) brings to their football enjoyment with the launch of the sport's fall seas...

18/09/2025

FCC Extends Deadline for EEO Audit Replies

WASHINGTON After issuing audit letters seeking Equal Employment Opportunity data from a randomly selected group of TV and radio stations in August, the Federal ...

18/09/2025

Warner Bros. Discovery Signs New Deal with Nielsen

NEW YORK Warner Bros. Discovery and Nielsen have signed a new, long-term, multi-year deal that covers measurement for all Warner Bros. Discovery platforms acros...

18/09/2025

CIMM Launches Startup Program and Innovation Showcase

NEW YORK As part of its ongoing efforts to develop better measurement solutions, the Coalition for Innovative Media Measurement (CIMM) announced the launch of t...

18/09/2025

UPDATED: ABC Takes 'Jimmy Kimmel Live! Off the Air Indefinitely

IRVING, Texas Following threats from the Federal Communications Commission and the announcement that the nations largest station group, Nexstar Media Group, wou...

18/09/2025

GeForce NOW Unleashes Dying Light: The Beast' in the Cloud

GeForce NOW is packing a monstrous punch this week. Dying Light: The Beast, the latest adrenaline fueled chapter in Techland's parkour meets survival horror...

17/09/2025

Tech Focus: Audio Training, Part 2 - Manufacturers Offer Extensive Online Learning

Tech Focus: Audio Training, Part 2 - Manufacturers Offer Extensive Online Learni...

17/09/2025

Tech Focus: Audio Training, Part 1 - A1 Shortage Remains a Major-League Challenge for Sports Broadcasting

Tech Focus: Audio Training, Part 1 - A1 Shortage Remains a Major-League Challeng...

17/09/2025

The Gauge: Mexico August 2025

During August, streaming's share of TV viewing in Mexico showed an increase of 0.4% compared to the previous month, accounting for 25% of TV viewing. Discl...

17/09/2025

Jo Aun Joins FOR-A America as Senior Manager, Product Engineering

CYPRESS, Calif. FOR-A America has named Jo Aun as senior manager of product engineering, a new role responsible for guiding the planning, development and rollou...

17/09/2025

PlayBox Neo and CIS Group Power CazeTV with a seamless Pl...

PlayBox Neo, in partnership with CIS Group, a leading provider of media and broadcast technology solutions, has successfully deployed PlayBox Neo's Dual Cha...

17/09/2025

Energy Regulatory Agency Underscores Commitment with Ene...

In a relationship that mirrors societal advances in sustainability, Brightline Lighting and the Federal Energy Regulatory Commission (FERC) Headquarters have en...

17/09/2025

Clear-Com Powers Star-Studded Communications at Houston A...

Clear-Com is proud to support the world-class productions of Alley Theatre, one of the oldest and largest nonprofit resident theatres in the United States. With...

17/09/2025

Arch Platform Technologies Announces Strategic Collaborat...

Arch Platform Technologies (www.archpt.io), a pioneer in automated, scalable cloud infrastructure for high-performance workflows, today announced a Strategic Co...

17/09/2025

With over 39bn EUR in assets under management and record-...

Over 300 selected decision-makers from start-ups, corporates, and VC funds worldwide will gather for the third edition of the event, united by a single goal: to...

17/09/2025

Telestream Celebrates Award Win at IBC2025

Telestream, a global leader in media workflow technologies, is excited to announce that its flagship Vantage platform and its next-generation AI capabilities re...

17/09/2025

Mediagenix Celebrates Triple Best of Show Wins at IBC2025...

Mediagenix, a global leader in smart content solutions that profitably connect the right content to the right audience, proudly announces its three Best of Show...

17/09/2025

PlayBox Neo Appoints Transtel Universal as Top Reseller P...

In a move to further establish a firm foothold across South East Asia, PlayBox Neo, the well-respected name in broadcast playout and channel branding, has appoi...

17/09/2025

Wisycom Unveils Two New Solutions at IBC 2025

Wisycom, a global leader in advanced wireless audio solutions, announced two major wireless solutions at IBC 2025 (Stand 8.D30). This includes the Portable RF-o...

17/09/2025

Six Berklee Alumni Win Emmy Awards

Six Berklee Alumni Win Emmy Awards The recipients were recognized for their contributions to acclaimed programs Severance, The Studio, The Penguin, SNL50: The...

17/09/2025

Applications Open for Berklee in Santo Domingo

Applications Open for Berklee in Santo Domingo The weeklong contemporary music program will run January 5-10, 2026. By Colette Greenstein September 17, 2025 ...

17/09/2025

Ukrainian Students Find Creative Consonance' at Berklee Valencia

Ukrainian Students Find Creative Consonance' at Berklee Valencia Through ELIA's UAx Platform, six students from Kyiv joined Berklee Valencia for a week...

17/09/2025

Meet Kenna Hilburn, Avids New Incoming Chief Product Officer

Earlier this year Avid announced Kenna Hilburn as its new senior vice president of product. Recently Hilburn was promoted to Avids new Chief Product Officer, su...

17/09/2025

Fox TV Stations Join Madhive's Local Live Sports Marketplace

NEW YORK Madhive has announced that the Fox Television Stations have joined its Live Sports Marketplace....

17/09/2025

Sony Electronics Partners with Newhouse School at Syracuse University

SYRACUSE, N.Y. Sony Electronics has announced that it is partnering with the Newhouse School at Syracuse University to provide state-of-the-art equipment, hands...

17/09/2025

Roku's First TV Smart Projector Now Available in the U.S.

SAN JOSE, Calif. Roku has announced that the first smart projector using its Roku TV operating system, the Aurzen Roku TV Smart Projector D1R Cube, is now avail...

17/09/2025

Meet the Streamlabs Streaming Assistant, Accelerated by NVIDIA RTX

Today's creators are equal parts entertainer, producer and gamer, juggling game commentary, scene changes, replay clips, chat moderation and technical troub...

17/09/2025

FOR-A America Appoints Jo Aun to Lead U.S. Product Development

Jo Returns to FOR-A as Senior Manager of Product Management and Engineering...

16/09/2025

SVG All-Stars: Leigh Michaud, Manager, Remote Operations, ESPN

SVG All-Stars: Leigh Michaud, Manager, Remote Operations, ESPNThe UConn grad rose from ESPN's mailroom to become one of its most valuable ops leadersBy Bran...

16/09/2025

Live From IBC 2025: Friday's Latest From Halls 1-4, Outdoor Exhibits in Amsterdam

Live From IBC 2025: Friday's Latest From Halls 1-4, Outdoor Exhibits in Amst...

16/09/2025

Live From IBC 2025: Saturday's Latest From Halls 5-7 in Amsterdam

Live From IBC 2025: Saturday's Latest From Halls 5-7 in Amsterdam By SVG Staff Friday, September 12, 2025 - 17:00 Print This Story The SVG Europe and ...

16/09/2025

Live From IBC 2025: Sunday's Latest From Halls 8-10 in Amsterdam

Live From IBC 2025: Sunday's Latest From Halls 8-10 in Amsterdam By SVG Staff Saturday, September 13, 2025 - 17:00 Print This Story The SVG Europe and...

16/09/2025

Live From IBC 2025: Monday's Latest From Halls 11-14 in Amsterdam

Live From IBC 2025: Monday's Latest From Halls 11-14 in Amsterdam By SVG Staff Sunday, September 14, 2025 - 17:00 Print This Story The SVG Europe and ...

16/09/2025

Amazon Prime Video Picks Up Four Hours of Early-Round Masters Coverage in 2026

Amazon Prime Video Picks Up Four Hours of Early-Round Masters Coverage in 2026 By Jason Dachman, Editorial Director, U.S. Tuesday, September 16, 2025 - 10:15...

16/09/2025

VERSANT Inks Deal for League One Volleyball as Women's Sports Rights Slate Grows

VERSANT Inks Deal for League One Volleyball as Women's Sports Rights Slate G...

16/09/2025

ESPN VP, Corporate Communications, Katina Arnold Named SVP, Disney Advertising Communications

ESPN VP, Corporate Communications, Katina Arnold Named SVP, Disney Advertising C...

16/09/2025

IBC 2025 in Review: SVG Europe's Full Collection of Video Interviews From the Show Floor

IBC 2025 in Review: SVG Europe's Full Collection of Video Interviews From th...

16/09/2025

One Enterprise, One Mission: Aligning the Supply Chain to the Warfighter

At DSEI 2025, James Dunne of L3Harris Maritime UK chaired a panel on aligning the supply chain to the warfighter, where leaders discussed modernising support fo...

16/09/2025

Football and Back-to-School Dynamics Spark First Gains Since April for Traditional TV

College Football Scores Top Telecast in August with 16M+ Viewers on FOX, Followe...

16/09/2025

Index Exchange and Gracenote Team to Enhance Contextual Intelligence in Programmatic Streaming TV

Collaboration marks the first SSP integration of Gracenote IDs, enabling show-le...

16/09/2025

IBC2025 Attracts 43,858 Visitors

AMSTERDAM The organizers of IBC2025 are reporting that 43,858 visitors from more than 170 countries attended the event, which had more than 1,300 exhibitors and...

16/09/2025

Wooden Camera Releases Accessory Collection for FUJIFILMs...

Wooden Camera announces the release of its new Accessory Collection for the FUJIFILM GFX ETERNA 55. The highlights of this collection include vital power soluti...

16/09/2025

AntonBauer Launches Free Cloud Platform for Smarter Batte...

Anton/Bauer, a leading manufacturer of mobile power solutions for broadcast and cinematic equipment, has announced the launch of Anton/Bauer Fleet Management, a...

16/09/2025

Teradek Launches Prism Jetpack - A New Era of 5G Video Co...

Teradek, a leading provider of video transmission and live production solutions, today announced the launch of Prism Jetpack, a groundbreaking 5G video contribu...