Sony Pixel Power calrec Sony

TOPS of the Class: Decoding AI Performance on RTX AI PCs and Workstations

12/06/2024

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, software, tools and accelerations for RTX PC users.

The era of the AI PC is here, and it's powered by NVIDIA RTX and GeForce RTX technologies. With it comes a new way to evaluate performance for AI-accelerated tasks, and a new language that can be daunting to decipher when choosing between the desktops and laptops available.

While PC gamers understand frames per second (FPS) and similar stats, measuring AI performance requires new metrics.

Coming Out on TOPS The first baseline is TOPS, or trillions of operations per second. Trillions is the important word here - the processing numbers behind generative AI tasks are absolutely massive. Think of TOPS as a raw performance metric, similar to an engine's horsepower rating. More is better.

Compare, for example, the recently announced Copilot+ PC lineup by Microsoft, which includes neural processing units (NPUs) able to perform upwards of 40 TOPS. Performing 40 TOPS is sufficient for some light AI-assisted tasks, like asking a local chatbot where yesterday's notes are.

But many generative AI tasks are more demanding. NVIDIA RTX and GeForce RTX GPUs deliver unprecedented performance across all generative tasks - the GeForce RTX 4090 GPU offers more than 1,300 TOPS. This is the kind of horsepower needed to handle AI-assisted digital content creation, AI super resolution in PC gaming, generating images from text or video, querying local large language models (LLMs) and more.

Insert Tokens to Play TOPS is only the beginning of the story. LLM performance is measured in the number of tokens generated by the model.

Tokens are the output of the LLM. A token can be a word in a sentence, or even a smaller fragment like punctuation or whitespace. Performance for AI-accelerated tasks can be measured in tokens per second.

Another important factor is batch size, or the number of inputs processed simultaneously in a single inference pass. As an LLM will sit at the core of many modern AI systems, the ability to handle multiple inputs (e.g. from a single application or across multiple applications) will be a key differentiator. While larger batch sizes improve performance for concurrent inputs, they also require more memory, especially when combined with larger models.

The more you batch, the more (time) you save. RTX GPUs are exceptionally well-suited for LLMs due to their large amounts of dedicated video random access memory (VRAM), Tensor Cores and TensorRT-LLM software.

GeForce RTX GPUs offer up to 24GB of high-speed VRAM, and NVIDIA RTX GPUs up to 48GB, which can handle larger models and enable higher batch sizes. RTX GPUs also take advantage of Tensor Cores - dedicated AI accelerators that dramatically speed up the computationally intensive operations required for deep learning and generative AI models. That maximum performance is easily accessed when an application uses the NVIDIA TensorRT software development kit (SDK), which unlocks the highest-performance generative AI on the more than 100 million Windows PCs and workstations powered by RTX GPUs.

The combination of memory, dedicated AI accelerators and optimized software gives RTX GPUs massive throughput gains, especially as batch sizes increase.

Text-to-Image, Faster Than Ever Measuring image generation speed is another way to evaluate performance. One of the most straightforward ways uses Stable Diffusion, a popular image-based AI model that allows users to easily convert text descriptions into complex visual representations.

With Stable Diffusion, users can quickly create and refine images from text prompts to achieve their desired output. When using an RTX GPU, these results can be generated faster than processing the AI model on a CPU or NPU.

That performance is even higher when using the TensorRT extension for the popular Automatic1111 interface. RTX users can generate images from prompts up to 2x faster with the SDXL Base checkpoint - significantly streamlining Stable Diffusion workflows.

ComfyUI, another popular Stable Diffusion user interface, added TensorRT acceleration last week. RTX users can now generate images from prompts up to 60% faster, and can even convert these images to videos using Stable Video Diffuson up to 70% faster with TensorRT.

TensorRT acceleration can be put to the test in the new UL Procyon AI Image Generation benchmark, which delivers speedups of 50% on a GeForce RTX 4080 SUPER GPU compared with the fastest non-TensorRT implementation.

TensorRT acceleration will soon be released for Stable Diffusion 3 - Stability AI's new, highly anticipated text-to-image model - boosting performance by 50%. Plus, the new TensorRT-Model Optimizer enables accelerating performance even further. This results in a 70% speedup compared with the non-TensorRT implementation, along with a 50% reduction in memory consumption.

Of course, seeing is believing - the true test is in the real-world use case of iterating on an original prompt. Users can refine image generation by tweaking prompts significantly faster on RTX GPUs, taking seconds per iteration compared with minutes on a Macbook Pro M3 Max. Plus, users get both speed and security with everything remaining private when running locally on an RTX-powered PC or workstation.

The Results Are in and Open Sourced But don't just take our word for it. The team of AI researchers and engineers behind the open-source Jan.ai recently integrated TensorRT-LLM into its local chatbot app, then tested these optimizations for themselves.

Source: Jan.ai The researchers tested its implementation of TensorRT-LLM against the open-source llama.cpp inference engine across a variety of GPUs and CPUs used by the community. They found that TensorRT is 30-70% faster than llam
LINK: https://blogs.nvidia.com/blog/ai-decoded-tops/...
See more stories from nvidia

North America Stories

06/03/2026

March 05, 2026

A new clue to how the body detects physical force Scripps Research scientists offer new insight into how the body detects light touch and how disruptions in tha...

05/03/2026

SVG GFX Forum Draws 250+ Industry Pros To Talk Creative Philosphy, AR/XR, Future of Live Sports Graphics

More than 250 sports-graphics professionals packed into The Cutting Room in New ...

05/03/2026

Cosm Atlanta Set to Open at Centennial Yards on June 10

Cosm announces that it will open its third immersive entertainment venue at downtown Atlanta's Centennial Yards on June 10, 2026. Its first public event wil...

05/03/2026

The CW Network to Broadcast Six Banana Ball Games During 2026 Season

Following last year's broadcast television debut, The CW Network is growing its partnership with the Savannah Bananas, announcing that it will be the exclus...

05/03/2026

AT&T Named Founding Partner and Official Connectivity Provider of TGL Presented by SoFi

TGL presented by SoFi, the primetime, men's team golf league, announces AT&T...

05/03/2026

Netflix Goes Live with Ateme's TITAN

Ateme, which specializes in video compression software, announces that it has entered into a multi-year agreement with Netflix for the deployment of its TITAN L...

05/03/2026

NEP Launches Software-Based OB Unit for Scalable Live Production Across Europe

As part of its strategy to deliver flexible and scalable media solutions, NEP Europe, part of NEP Group, has redesigned an outside broadcast (OB) unit to help b...

05/03/2026

Net Insight's Nimbra Edge and Nimbra 400 Now Verified for YouTube Live

Net Insight's Nimbra Edge and the Nimbra 400 series are now included in YouTube's Live verified encoders list. This confirms that the solutions have bee...

05/03/2026

Audinate Launches Dante Director Professional: Enterprise-Grade Management Platform for Distributed AV Networks

Audinate, the creator of the Dante audio networking platform, announces the laun...

05/03/2026

LiveU Marks First Large-Scale Global Deployment of AI-Driven LIQ at Winter Games

LiveU announces the first large-scale deployment of its AI-driven LiveU IQ (LIQ) technology at a global, multi-venue sporting event, setting a new benchmark for...

05/03/2026

SportsEngine Play to Stream 7 on 7 National Football Recruiting Tournament March 7-8

SportsEngine Play, a first-of-its-kind subscription streaming service for captur...

05/03/2026

SVG Students To Watch: Madison McCarter, Michigan State University

The senior from small-town Pennsylvania has had a huge impact on the Spartans' B1G Student U program In the live-sports-video industry, the future is brigh...

05/03/2026

ESPN, Disney, Pixar Team Up With NHL for Inside Out Classic' Animated Alt-Cast on April 5

Real-time animated presentation of Capitals-Rangers game blends tracking data, o...

05/03/2026

The Silicon Bowl Effect: How Big Tech is Reinventing the Sports Fan Experience

With Super Bowl LX taking place in Santa Clara, all eyes were not only on the field - but on the future of fandom itself. New technologies are being introduced ...

05/03/2026

World Baseball Classic 2026: MLB Network Provides World-Feed Infrastructure to Venues, Broadcasters

The network aims for a uniform production while accommodating tech differences i...

05/03/2026

World Baseball Classic 2025: FOX Sports Deploys World Feed Plus,' Adds Studio Presence for Semis and Final

MLB Network produces the host broadcast, and FOX layers in cameras, studio prese...

05/03/2026

Film Festival Watch: See These 18 Sundance Institute-Supported Docs at the 2026 True/False Film Fest

Barbara Hammer appears in Barbara Forever by Brydie O'Connor, an official se...

05/03/2026

The Next Frontier of Quality Control - Verifying Content...

Quality control (QC) has long been a gatekeeper of technical fidelity. Editors and technicians have adhered to a simple, disciplined checklist: bitrate, resolut...

05/03/2026

Synamedia and Asport deliver global-scale streaming for G...

Leading video software provider Synamedia today announced it has partnered with Swiss sports technology specialist, Asport to deliver global live streaming for ...

05/03/2026

Advanced Systems Group Promotes Gretchen Taipale to Vice...

Advanced Systems Group, LLC (ASG), a technology and services provider for media creatives and content owners, announced the promotion of Gretchen Taipale to Vic...

05/03/2026

Clear-Com Supplies Cloud-Based Communications System for...

Clear-Com has provided Gen-IC virtual intercom, its cloud-based voice communications system for SaxaVord Spaceport, the first fully licensed vertical launch s...

05/03/2026

Hollywood Professional Association Concludes the 2026 HPA...

The Hollywood Professional Association (HPA) concluded the 2026 HPA Tech Retreat, convening more than 800 industry leaders, technologists, creatives and executi...

05/03/2026

LynTec 20 AMP Dual Output DMX Relay Now Shipping

LynTec, a leading manufacturer of electrical power control solutions for professional audio, video, and lighting systems, today announced that its new Dual DMX ...

05/03/2026

Snicket Labs Partners with ES Broadcast to Expand Access...

Snicket Labs is pleased to announce a new distribution partnership with ES Broadcast for its award winning solutions, Match and Enrich. Under the agreement, ES...

05/03/2026

LiveU Marks First Large Scale Global Deployment of AI Dri...

LiveU today announced the first large-scale deployment of its AI-driven LiveU IQ (LIQ ) technology at a global, multi-venue sporting event, setting a new benchm...

05/03/2026

Innovation for Filmmaking, By Filmmakers: Why InterPositive Is Joining Netflix

Back to All News Innovation for Filmmaking, By Filmmakers: Why InterPositive Is Joining Netflix Business 05 March 2026 Global Link copied to clipboard Dow...

05/03/2026

New LinkedIn Data Shows Women's Path to Senior Leadership...

New LinkedIn Data Shows Women's Path to Senior Leadership Is Narrowing Published on Mar 5, 2026 Categories: Data and insights LinkedIn Corporate Commun...

05/03/2026

March Into the Cloud With 15 New Games Coming to GeForce NOW

March is in full bloom, and that means a fresh wave of games heading to the cloud. 15 new titles are joining the GeForce NOW library this month. Leading the Ma...

04/03/2026

Lega Basket Serie A Modernizes Media Operations Across Italian Basketball with ScorePlay

Lega Basket Serie A (LBA), the governing body for Italy's premier basketball...

04/03/2026

To Mark Five Years at Wrexham AFC, Co-Chairmen Rob Mac and Ryan Reynolds Will Host Broadcast During Wrexham-Swansea City on March 13

Wrexham AFC co-chairmen Rob Mac and Ryan Reynolds will host a first-of-its-kind ...

04/03/2026

FOX Sports Marks 100 Days to FIFA World Cup 2026 with Company-Wide Celebration

The countdown is underway and with just 100 days to go until the world's greatest sporting event begins on Thurs., June 11, FOX Sports, America's Englis...

04/03/2026

Exchange, NBCUniversal Team Up to Provide Service Members with Free Streaming of Paralympic Winter Games

No matter where they are in the world, service members and veterans can stream N...

04/03/2026

Telemundo Releases Somos Ms, the Official Anthem of its FIFA World Cup 2026 Coverage

Telemundo officially releases Somos M s, the anthem for the network's cove...

04/03/2026

Hollywood Professional Association Concludes 2026 HPA Tech Retreat

The Hollywood Professional Association (HPA) concluded the 2026 HPA Tech Retreat, convening more than 800 industry leaders, technologists, creatives, and execut...

04/03/2026

Case Study: How SEG+ Unified Utah's Biggest Sports Teams into a 40%+ Subscriber Growth Streaming Platform

Smith Entertainment Group transformed how local sports are consumed by creating ...

04/03/2026

SVG in Indy: Pacers Sports & Entertainment Remotely Produces Broadcasts of G League's Noblesville Boom

The production method, which spans a distance of 36.2 miles, was designed and im...

04/03/2026

Haivision Releases Seventh Annual Broadcast Transformation Report, Highlights Key Trends Shaping Live Production in 2026

Haivision, a global provider of mission-critical, real-time video networking and...

04/03/2026

SVG Sit-Down: Quantum CEO Hugues Meyrath on Reshaping the Company, the Impact of AI, Evolving Media Storage

When Hugues Meyrath came out of retirement to take the helm as CEO of Quantum, i...

04/03/2026

As Banana Ball Expands Exponentially, So Too Do Its Production Capabilities

Last week's launch of Banana Ball Championship League has spurred a significant upgrade of production facilities The Savannah Bananas, arguably the hottest...

04/03/2026

Press Release TEST

sldkfjsdlfkjsldkfjsldkjfslkdjfslkdjfsl Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus...

04/03/2026

APTS Announces Public Broadcast Leadership, Advocacy Awards

Share Copy link Facebook X Linkedin Bluesky Email...

04/03/2026

NBC Sports, USA Sports Extend Rights Deal with PGA

Share Copy link Facebook X Linkedin Bluesky Email...

04/03/2026

Broadcasters Gather in DC for NAB State Leadership Conference

Share Copy link Facebook X Linkedin Bluesky Email...

04/03/2026

Netflix Unveils the Trailer of Made In Korea', a CrossCultural ComingofAge Story of Courage and SelfDiscovery

Back to All News Netflix Unveils the Trailer of Made In Korea', a Cross Cu...

04/03/2026

Netflix's Hit Courtroom Comedy Maamla Legal Hai' Arrives with Season 2 on April 3

Back to All News Netflix's Hit Courtroom Comedy Maamla Legal Hai' Arri...

04/03/2026

Bloodhounds' Season 2 Gears Up for April 3 Premiere with Hard-Hitting New Teaser and Poster

Back to All News Bloodhounds' Season 2 Gears Up for April 3 Premiere with ...

04/03/2026

Netflix Ads Suite Expands Capabilities

Back to All News Netflix Ads Suite Expands Capabilities Business 04 March 2026 GlobalUnited States Link copied to clipboard After launching the Netflix Ad...

04/03/2026

March 02, 2026

Scripps Research welcomes healthcare innovator Joe Kiani to the Board of Directors Kiani brings decades of experience in patient safety and public service. Mar...

04/03/2026

March 03, 2026

Nanoparticle vaccine approach takes on a new target: Hepatitis C virus Scripps Research scientists reengineer critical proteins on the surface of HCV, paving th...

03/03/2026

LIV Golf, Beyond Sports Elevate Online Gaming Ecosystem with Launch of LIV Golf Fantasy and LIV X

Beyond Sports, a Sony group company, and LIV Golf, the world's golf league, ...