Sony Pixel Power calrec Sony

TOPS of the Class: Decoding AI Performance on RTX AI PCs and Workstations

12/06/2024

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, software, tools and accelerations for RTX PC users.

The era of the AI PC is here, and it's powered by NVIDIA RTX and GeForce RTX technologies. With it comes a new way to evaluate performance for AI-accelerated tasks, and a new language that can be daunting to decipher when choosing between the desktops and laptops available.

While PC gamers understand frames per second (FPS) and similar stats, measuring AI performance requires new metrics.

Coming Out on TOPS The first baseline is TOPS, or trillions of operations per second. Trillions is the important word here - the processing numbers behind generative AI tasks are absolutely massive. Think of TOPS as a raw performance metric, similar to an engine's horsepower rating. More is better.

Compare, for example, the recently announced Copilot+ PC lineup by Microsoft, which includes neural processing units (NPUs) able to perform upwards of 40 TOPS. Performing 40 TOPS is sufficient for some light AI-assisted tasks, like asking a local chatbot where yesterday's notes are.

But many generative AI tasks are more demanding. NVIDIA RTX and GeForce RTX GPUs deliver unprecedented performance across all generative tasks - the GeForce RTX 4090 GPU offers more than 1,300 TOPS. This is the kind of horsepower needed to handle AI-assisted digital content creation, AI super resolution in PC gaming, generating images from text or video, querying local large language models (LLMs) and more.

Insert Tokens to Play TOPS is only the beginning of the story. LLM performance is measured in the number of tokens generated by the model.

Tokens are the output of the LLM. A token can be a word in a sentence, or even a smaller fragment like punctuation or whitespace. Performance for AI-accelerated tasks can be measured in tokens per second.

Another important factor is batch size, or the number of inputs processed simultaneously in a single inference pass. As an LLM will sit at the core of many modern AI systems, the ability to handle multiple inputs (e.g. from a single application or across multiple applications) will be a key differentiator. While larger batch sizes improve performance for concurrent inputs, they also require more memory, especially when combined with larger models.

The more you batch, the more (time) you save. RTX GPUs are exceptionally well-suited for LLMs due to their large amounts of dedicated video random access memory (VRAM), Tensor Cores and TensorRT-LLM software.

GeForce RTX GPUs offer up to 24GB of high-speed VRAM, and NVIDIA RTX GPUs up to 48GB, which can handle larger models and enable higher batch sizes. RTX GPUs also take advantage of Tensor Cores - dedicated AI accelerators that dramatically speed up the computationally intensive operations required for deep learning and generative AI models. That maximum performance is easily accessed when an application uses the NVIDIA TensorRT software development kit (SDK), which unlocks the highest-performance generative AI on the more than 100 million Windows PCs and workstations powered by RTX GPUs.

The combination of memory, dedicated AI accelerators and optimized software gives RTX GPUs massive throughput gains, especially as batch sizes increase.

Text-to-Image, Faster Than Ever Measuring image generation speed is another way to evaluate performance. One of the most straightforward ways uses Stable Diffusion, a popular image-based AI model that allows users to easily convert text descriptions into complex visual representations.

With Stable Diffusion, users can quickly create and refine images from text prompts to achieve their desired output. When using an RTX GPU, these results can be generated faster than processing the AI model on a CPU or NPU.

That performance is even higher when using the TensorRT extension for the popular Automatic1111 interface. RTX users can generate images from prompts up to 2x faster with the SDXL Base checkpoint - significantly streamlining Stable Diffusion workflows.

ComfyUI, another popular Stable Diffusion user interface, added TensorRT acceleration last week. RTX users can now generate images from prompts up to 60% faster, and can even convert these images to videos using Stable Video Diffuson up to 70% faster with TensorRT.

TensorRT acceleration can be put to the test in the new UL Procyon AI Image Generation benchmark, which delivers speedups of 50% on a GeForce RTX 4080 SUPER GPU compared with the fastest non-TensorRT implementation.

TensorRT acceleration will soon be released for Stable Diffusion 3 - Stability AI's new, highly anticipated text-to-image model - boosting performance by 50%. Plus, the new TensorRT-Model Optimizer enables accelerating performance even further. This results in a 70% speedup compared with the non-TensorRT implementation, along with a 50% reduction in memory consumption.

Of course, seeing is believing - the true test is in the real-world use case of iterating on an original prompt. Users can refine image generation by tweaking prompts significantly faster on RTX GPUs, taking seconds per iteration compared with minutes on a Macbook Pro M3 Max. Plus, users get both speed and security with everything remaining private when running locally on an RTX-powered PC or workstation.

The Results Are in and Open Sourced But don't just take our word for it. The team of AI researchers and engineers behind the open-source Jan.ai recently integrated TensorRT-LLM into its local chatbot app, then tested these optimizations for themselves.

Source: Jan.ai The researchers tested its implementation of TensorRT-LLM against the open-source llama.cpp inference engine across a variety of GPUs and CPUs used by the community. They found that TensorRT is 30-70% faster than llam
LINK: https://blogs.nvidia.com/blog/ai-decoded-tops/...
See more stories from nvidia

North America Stories

31/10/2025

FanDuel Sports Network To Deliver Selected Live NBA, NHL Games to Major Streaming Services for In-Market Viewers

FanDuel Sports Network To Deliver Selected Live NBA, NHL Games to Major Streamin...

31/10/2025

NBC Jumps Out of the Gate in Extended Breeder's Cup Deal With Dual Drones, Jockey Cams, RF Super-Mo

NBC Jumps Out of the Gate in Extended Breeder's Cup Deal With Dual Drones, J...

31/10/2025

Nexstar Extends Chairman and CEO Perry Sook Through 2029

IRVING, Texas As station groups move into an era that promises rapid tech, regulatory and economic changes, Nexstar Media Group said its board has extended chai...

31/10/2025

Late Night Thrives on Social Media With Billions of Views in 2025

While some analysts have questioned the ongoing economic viability of broacast-TV late night shows amid ongoing declines in linear viewing, new data from Tubula...

31/10/2025

Disney Programming Dropped From YouTube TV

The contentious contract negotiations between The Walt Disney Co. and YouTube TV have resulted in a blackout of Disney-owned programming on the pay TV operator....

31/10/2025

tvONE Integrates CALICO PRO Video Processing With Matrox ConvertIP Series

CINCINNATI Video conversion and AV signal distribution specialist tvONE and Matrox Video have struck a strategic partnership, combining CALICO PRO's video p...

31/10/2025

IAB Urges Standards for CTV Ad Measurement

NEW YORK The Interactive Advertising Bureau (IAB) today released a new industry guide that discusses the urgency of adopting new standards that will help advert...

31/10/2025

Late Night Shows Thrive on Social Media with Billions of Views in 2025

While some analysts have questioned the ongoing economic viability of late night shows on broadcast TV amid ongoing declines in linear viewing, new data from Tu...

31/10/2025

Berklee Celebrates the Inauguration of President Jim Lucchese

Berklee Celebrates the Inauguration of President Jim Lucchese In his inaugural address, Lucchese highlighted Berklee's power to connect, create, and heal ...

31/10/2025

Family, Food, and Films: Netflix's 'Dining with the Kapoors' Arrives November 21

Back to All News Family, Food, and Films: Netflix's Dining with the Kapoors...

31/10/2025

Korea Joins AI Industrial Revolution: NVIDIA CEO Jensen Huang Unveils Historic Partnership at APEC Summit

Amidst Gyeongju, South Korea's ancient temples and modern skylines, Jensen H...

30/10/2025

Midwich Secures UK & Ireland Distribution Deal with X2O Media To Revolutionize Hybrid Learning

Midwich has signed a UK and Ireland distribution deal with X2O Media, a worldwid...

30/10/2025

SVG Students To Watch: Sam Newitt, Kansas State University

SVG Students To Watch: Sam Newitt, Kansas State UniversityThe South Dakota native thrives in many roles behind the scenes at K-StateHD.TVBy Brandon Costa, Direc...

30/10/2025

SVG Sit-Down: Swerve Sports' Christy Tanner Explores the Young FAST Channel's Early Success

SVG Sit-Down: Swerve Sports' Christy Tanner Explores the Young FAST Channel&...

30/10/2025

SVG Campus Shot Callers: Andy Liebsch, Senior Director, Video Services, Kansas State University

SVG Campus Shot Callers: Andy Liebsch, Senior Director, Video Services, Kansas S...

30/10/2025

Diversified Names Paul Lidsky CEO, Expanding Leadership Role After Serving as Board Chairman

Diversified Names Paul Lidsky CEO, Expanding Leadership Role After Serving as Bo...

30/10/2025

NBA, Cosm Enter Long-Term Partnership for Shared Reality Production, Distribution

NBA, Cosm Enter Long-Term Partnership for Shared Reality Production, Distributio...

30/10/2025

FanDuel Sports Network to Deliver Select Live NBA, NHL Games to Major Streaming Services for In-Market Viewers

FanDuel Sports Network to Deliver Select Live NBA, NHL Games to Major Streaming ...

30/10/2025

If I Had Legs, I'd Kick You, East of Wall, and More Sundance Institute-Supported Films Nominated for 35th Gotham Awards

As the year comes to a close, we can feel the invigorating wind sweeping in for ...

30/10/2025

Give Me the Backstory: Get to Know Max Walker-Silverman, the Writer-Director of Rebuilding

By Bailey Pennick One of the most exciting things about the Sundance Film Festi...

30/10/2025

Remarks for the 2025 APEC CEO Roundtable

Jon Rambeau, President of Integrated Mission Systems at L3Harris Technologies, speaks about industrial collaboration at the Asia-Pacific Economic Cooperation (A...

30/10/2025

L3Harris Technologies Reports Strong Third Quarter 2025 Results, Increases 2025 Guidance

MELBOURNE, Fla., October 30, 2025 - L3Harris Technologies (NYSE: LHX) reports th...

30/10/2025

FCC's Brendan Carr Issues Draft Proposal for More C-Band Spectrum Sales

WASHINGTON Federal Communications Commission Chair Brendan Carr said he has circulated a proposal for the agency to auction additional midband spectrum in the U...

30/10/2025

Diversified Names Paul Lidsky as CEO

PLANO, Texas Technology solutions provider Diversified has named Paul Lidsky as CEO, tasked with guiding the company's next stage of growth, driving market ...

30/10/2025

Interra Adds Stream Recording, BATON Integration to ORION

CUPERTINO, Calif. Interra Systems today unveiled ORION stream recording support and seamless integration with BATON Media Player, a combination that lets broadc...

30/10/2025

InterDigital Buys AI-Driven Video Codec Startup Deep Render

WILMINGTON, Del. InterDigital today announced the acquisition of Deep Render, an artificial intelligence startup with a team of AI experts focused on video code...

30/10/2025

TAG Video Systems Earns Two ESG Recognitions

NEW YORK TAG Video Systems has earned a higher-rated Digital Product Passport (DPP) Committed to Sustainability badge and the Aclymate Climate Wise Silver Tier ...

30/10/2025

Nexstar Extends Employment Agreement with Perry Sook Through 2029

IRVING, Texas As station groups move into an era that promises rapid tech, regulatory and economic changes, the Nexstar Media Group, Inc. has announced that its...

30/10/2025

Samba TV: 60% Of TV Time Spent Viewing Streaming Content

Television viewers are spending more time watching streaming content than linear TV, but sports continues to be a bright spot for broadcasters, according to the...

30/10/2025

Operative Media Names Mike Napadano as CEO

NEW YORK Advertising technology company Operative Media has named Mike Napadano as its new CEO....

30/10/2025

Walmart Selects Marshall Cameras to Power New Campus Broa...

Walmart Inc. has chosen Marshall Electronics cameras for use across its brand-new corporate campus studios and event center. The installation includes Marshall ...

30/10/2025

NETGEAR Academy Expands Into Industry-Wide IP Training Pl...

NETGEAR, Inc. (NASDAQ: NTGR), a global leader in intelligent networking solutions designed to power extraordinary experiences, today announced the launch of its...

30/10/2025

Clear-Com Gen-IC Virtual Intercom Connects Students World...

Clear-Com recently contributed its award-winning Gen-IC virtual intercom solution to power real-time communications for On-Air Student TV, a 24-hour global st...

30/10/2025

Maxon Strengthens Growth Strategy with Appointment of Kse...

Maxon, maker of powerful, approachable software solutions for creators working in 2D and 3D design, motion graphics, visual effects, and more, today announced t...

30/10/2025

Studio Technologies Dante Enabled Model 394 GPI Interface...

Studio Technologies, a leading manufacturer of high-quality audio, video, and fiber-optic solutions, announces that its new Model 394 GPI Interface and Model 39...

30/10/2025

Astro selects Broadpeak for high performance streaming an...

Broadpeak , a leader in streaming and monetization at scale, has been selected by leading Malaysian content and entertainment company Astro to enable two major ...

30/10/2025

Riedel Communications Appoints Ulrich Voigt as Director L...

Riedel Communications is pleased to announce that Ulrich Voigt has joined the company as Director Live Production Solutions, taking over the SimplyLive business...

30/10/2025

LiveU and Kinetiq Launch Cloud Native Watermarking Integr...

LiveU, the global leader in live IP-video contribution, production, and distribution, today announced a new partnership with Kinetiq, the AI-powered platform un...

30/10/2025

FCC Plans Nov. 20 Open Meeting, Provides Shutdown Update

WASHINGTON Federal Communications Commission Chair Brendan Carr has called for an end to the government shutdown while providing some updates on the agency'...

30/10/2025

Carr Issues Draft Proposal for More C-Band Spectrum Sales

WASHINGTON Federal Communications Commission Chair Brendan Carr has announced that he has circulated a proposal for the FCC to auction additional mid-band spect...

30/10/2025

October 29, 2025

Scripps Research professor awarded $3.2 million to advance type 1 diabetes research Support from the National Institute of Diabetes and Digestive and Kidney Dis...

30/10/2025

AI-Powered Mobile Clinics Deliver Breast Cancer Screening to India's Rural Communities

An unassuming van driving around rural India uses powerful AI technology that...

30/10/2025

Join the Resistance: ARC Raiders' Launches in the Cloud

Get ready, raiders - the wait is over. ARC Raiders is dropping onto GeForce NOW and bringing the fight from orbit to the screen. To celebrate the launch, gamer...

29/10/2025

MLS, EDGE Sound Research To Debut Immersive Embodied Sound' at LAFC vs. Austin FC Playoff Match

MLS, EDGE Sound Research To Debut Immersive Embodied Sound' at LAFC vs. Aus...

29/10/2025

SVG Remote Production Forum 2025: All Sessions Now Available to Watch on SVG PLAY

SVG Remote Production Forum 2025: All Sessions Now Available to Watch on SVG PLA...

29/10/2025

World Series 2025: How Audio Is Transported Around the Sites and Beyond

World Series 2025: How Audio Is Transported Around the Sites and BeyondThe signals also move not just between two countries but around the globeBy Dan Daley, Au...

29/10/2025

Inside the Archives: Celebrating Archives Month Through Sundance Film Festival Films

A still from 306 Hollywood, a film by sibling filmmakers Jonathan Bogar n and El...

29/10/2025

Riedel Names Ulrich Voigt Director of Live Production Solutions

WUPPERTAL, Germany Riedel Communications has hired Ulrich Voigt as director, live production solutions, taking over the leadership of its SimplyLive business fr...

29/10/2025

Sinclair Taps Mark Martin to Lead Stations in Oklahoma

OKLAHOMA CITY and TULSA, Okla. Sinclair has named Mark Martin as vice president and general manager of KOKH-KOCB Oklahoma City and KTUL Tulsa....