Sony Pixel Power calrec Sony

TOPS of the Class: Decoding AI Performance on RTX AI PCs and Workstations

12/06/2024

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, software, tools and accelerations for RTX PC users.

The era of the AI PC is here, and it's powered by NVIDIA RTX and GeForce RTX technologies. With it comes a new way to evaluate performance for AI-accelerated tasks, and a new language that can be daunting to decipher when choosing between the desktops and laptops available.

While PC gamers understand frames per second (FPS) and similar stats, measuring AI performance requires new metrics.

Coming Out on TOPS The first baseline is TOPS, or trillions of operations per second. Trillions is the important word here - the processing numbers behind generative AI tasks are absolutely massive. Think of TOPS as a raw performance metric, similar to an engine's horsepower rating. More is better.

Compare, for example, the recently announced Copilot+ PC lineup by Microsoft, which includes neural processing units (NPUs) able to perform upwards of 40 TOPS. Performing 40 TOPS is sufficient for some light AI-assisted tasks, like asking a local chatbot where yesterday's notes are.

But many generative AI tasks are more demanding. NVIDIA RTX and GeForce RTX GPUs deliver unprecedented performance across all generative tasks - the GeForce RTX 4090 GPU offers more than 1,300 TOPS. This is the kind of horsepower needed to handle AI-assisted digital content creation, AI super resolution in PC gaming, generating images from text or video, querying local large language models (LLMs) and more.

Insert Tokens to Play TOPS is only the beginning of the story. LLM performance is measured in the number of tokens generated by the model.

Tokens are the output of the LLM. A token can be a word in a sentence, or even a smaller fragment like punctuation or whitespace. Performance for AI-accelerated tasks can be measured in tokens per second.

Another important factor is batch size, or the number of inputs processed simultaneously in a single inference pass. As an LLM will sit at the core of many modern AI systems, the ability to handle multiple inputs (e.g. from a single application or across multiple applications) will be a key differentiator. While larger batch sizes improve performance for concurrent inputs, they also require more memory, especially when combined with larger models.

The more you batch, the more (time) you save. RTX GPUs are exceptionally well-suited for LLMs due to their large amounts of dedicated video random access memory (VRAM), Tensor Cores and TensorRT-LLM software.

GeForce RTX GPUs offer up to 24GB of high-speed VRAM, and NVIDIA RTX GPUs up to 48GB, which can handle larger models and enable higher batch sizes. RTX GPUs also take advantage of Tensor Cores - dedicated AI accelerators that dramatically speed up the computationally intensive operations required for deep learning and generative AI models. That maximum performance is easily accessed when an application uses the NVIDIA TensorRT software development kit (SDK), which unlocks the highest-performance generative AI on the more than 100 million Windows PCs and workstations powered by RTX GPUs.

The combination of memory, dedicated AI accelerators and optimized software gives RTX GPUs massive throughput gains, especially as batch sizes increase.

Text-to-Image, Faster Than Ever Measuring image generation speed is another way to evaluate performance. One of the most straightforward ways uses Stable Diffusion, a popular image-based AI model that allows users to easily convert text descriptions into complex visual representations.

With Stable Diffusion, users can quickly create and refine images from text prompts to achieve their desired output. When using an RTX GPU, these results can be generated faster than processing the AI model on a CPU or NPU.

That performance is even higher when using the TensorRT extension for the popular Automatic1111 interface. RTX users can generate images from prompts up to 2x faster with the SDXL Base checkpoint - significantly streamlining Stable Diffusion workflows.

ComfyUI, another popular Stable Diffusion user interface, added TensorRT acceleration last week. RTX users can now generate images from prompts up to 60% faster, and can even convert these images to videos using Stable Video Diffuson up to 70% faster with TensorRT.

TensorRT acceleration can be put to the test in the new UL Procyon AI Image Generation benchmark, which delivers speedups of 50% on a GeForce RTX 4080 SUPER GPU compared with the fastest non-TensorRT implementation.

TensorRT acceleration will soon be released for Stable Diffusion 3 - Stability AI's new, highly anticipated text-to-image model - boosting performance by 50%. Plus, the new TensorRT-Model Optimizer enables accelerating performance even further. This results in a 70% speedup compared with the non-TensorRT implementation, along with a 50% reduction in memory consumption.

Of course, seeing is believing - the true test is in the real-world use case of iterating on an original prompt. Users can refine image generation by tweaking prompts significantly faster on RTX GPUs, taking seconds per iteration compared with minutes on a Macbook Pro M3 Max. Plus, users get both speed and security with everything remaining private when running locally on an RTX-powered PC or workstation.

The Results Are in and Open Sourced But don't just take our word for it. The team of AI researchers and engineers behind the open-source Jan.ai recently integrated TensorRT-LLM into its local chatbot app, then tested these optimizations for themselves.

Source: Jan.ai The researchers tested its implementation of TensorRT-LLM against the open-source llama.cpp inference engine across a variety of GPUs and CPUs used by the community. They found that TensorRT is 30-70% faster than llam
LINK: https://blogs.nvidia.com/blog/ai-decoded-tops/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

01/04/2026

DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION

January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION Douyin Users Can Now Create And Share Videos With Stun...

05/02/2026

Teads, Google TV Partner To Grow CTV HomeScreen Ad Availability

Share Copy link Facebook X Linkedin Bluesky Email...

05/02/2026

Advanced Systems Group Appoints Industry Veteran Derek Pe...

Advanced Systems Group, LLC (ASG), a technology and services provider for media creatives and content owners, announced the appointment of Derek Pezzotti as Sen...

05/02/2026

Taurus Technologies Elevates Podcast Production with Brig...

Taurus Technologies, a Dallas-area professional AV systems integrator, has upgraded its in-house podcast studio with Brightline Lighting's AV/720 low-voltag...

05/02/2026

NBC Sports Selects Production Infrastructure and Signal P...

NBC Universal to Present XXV Olympic Winter Games Feb. 6-22 and Milan Cortina Paralympics March 6-15 NBC Sports to Utilize Grass Valley's Frame Rate Conver...

05/02/2026

Atomos Unveils All New Shogun AV-19

Atomos today announced Shogun AV-19, a rack-mountable, 19-inch 4K HDR monitor-recorder-switcher designed for professional live production, broadcast, and video ...

05/02/2026

Vizrt revolutionizes corporate communications with AI-pow...

Vizrt, the leader in live production technology, revolutionizing viewer experience and engagement, today introduces two brand new solutions in partnership with ...

05/02/2026

Appear Appoints Simon Frost as Chief Marketing Officer to...

Appear, a global leader in live production technology, today announced the appointment of Simon Frost in a newly created role as Chief Marketing Officer (CMO). ...

05/02/2026

Noah Chamis ICLS Illuminates Only Murders in the Building...

New York gaffer Noah Chamis, ICLS ( You Deserve Each Other , The Half of It , Project Runway ) practices a mix of technical precision and creative play in his...

05/02/2026

NBC Sports Deploys Audio-Technica Microphones for Winter Olympics

Share Copy link Facebook X Linkedin Bluesky Email...

05/02/2026

Hemisphere Media Group, Entravision Launch WAPA Orlando

Share Copy link Facebook X Linkedin Bluesky Email...

05/02/2026

SMT Providing Timing And Production Data Services for Winter Olympics

Share Copy link Facebook X Linkedin Bluesky Email...

05/02/2026

GeForce NOW Celebrates Six Years of Streaming With 24 Games in February

Break out the cake and green sprinkles - GeForce NOW is turning six. Since launch, members have streamed over 1 billion hours, and the party's just getting...

04/02/2026

Save the Date: SVG Regional Sports Production Summit Heads to Denver June 29-30

The 11th-annual Summit will not only the unprecedented headwinds facing the business, but also the groundbreaking opportunities for the future....

04/02/2026

2026 Grammy Awards Audio Team Collaborates for Live Broadcast

Just moments before the 2026 Grammy Awards kicked off, members of the event's audio team assembled for a group photo at the base of the stage inside Los Ang...

04/02/2026

Riedel Connects Live Surgery and Medical Professionals at VISAR 2025

At the Vienna Interdisciplinary Symposium on Aortic Repair (VISAR), Riedel Communications' Managed Technology Division delivered a turnkey technical infrast...

04/02/2026

Mountain West Announces New Media Rights Package in Collaboration with CBS Sports, FOX Sports, The CW Network, and Kiswe

The Mountain West Conference announces a new media rights package featuring CBS ...

04/02/2026

NFL 2026 International Games Announced in Madrid, Mexico, and Paris

Earlier this week, the NFL announced it would play regular season games in Madrid, Paris, and Mexico City in 2026 as part of a nine-game international schedule,...

04/02/2026

Super Bowl Halftime Show Preview: PA Speakers on Wheels Return for Football-Meets-Music Event

Custom-built carts carry music speakers for Apple Music Super Bowl LX Halftime S...

04/02/2026

Wireless Audio, Part 1: RF Does More With Less, Because It Has To

In an era of constrained spectrum, two tactics have emerged: work closely with regulatory bodies and utilize engineering chops The Federal Communications Commi...

04/02/2026

Wireless Audio, Part 2: RF Mics Have a Key Role in Sports Broadcasting

Three examples of how wireless microphones are deployed to bring fans in deep and up close Microphone manufacturers have myriad ways to put wireless to work fo...

04/02/2026

Sennheiser Is Moving to Music City

The mic manufacturer will join other suppliers in the new Rock Nashville production campus Sennheiser is relocating its U.S. headquarters from its long-time lo...

04/02/2026

Release Rundown: What to Watch in February, From Jimpa to Queen of Chess

Olivia Colman and John Lithgow appear in Jimpa by Sophie Hyde, an official selection of the 2025 Sundance Film Festival. (Courtesy of Sundance Institute | pho...

04/02/2026

Get More From Lyrics on Spotify With These 3 Upgrades

Lyrics are one of Spotify's most popular features, giving fans a richer way to experience the music and artists they love. They're viewed hundreds of mi...

04/02/2026

Queer Renegades: SBS Audio's new podcast reclaiming Australia's queer history

Queer Renegades: SBS Audio's new podcast reclaiming Australia's queer hi...

04/02/2026

NFVF ANNECY 2026 SOUTH AFRICAN ANIMATION SHOWCASE CALL

The National Film and Video Foundation (NFVF) invites final-year animation students to participate in an exclusive creative showcase at the Annecy International...

04/02/2026

Viper Shield Flight Tests Accelerate Delivery with New Digital Electronic Warfare Capability

Viper Shield's robust phase of flight-testing production representative hard...

04/02/2026

Nielsen launches co-viewing pilot program to further enhance TV measurement

Pilot To Launch with Super Bowl LX on February 8 and Continue with High Profile Live Events, Entertainment and Sports Nielsen to Use State of the Art Wearable...

04/02/2026

Vinten Extends VEGA Platform with VEGA Lite PTZ Control S...

New control solution applies broadcast robotics workflows to PTZ cameras with third-party integration and upgrade paths Vinten, a global leader in robotic cam...

04/02/2026

Vinten Launches Vega Lite PTZ Control System

Share Copy link Facebook X Linkedin Bluesky Email...

04/02/2026

Chyron to Provide Graphics, Virtual Sets for Winter Olympics Coverage

Share Copy link Facebook X Linkedin Bluesky Email...

04/02/2026

NBC Sports Taps Appear for 2026 Winter Olympics Production

Share Copy link Facebook X Linkedin Bluesky Email...

04/02/2026

Katie Vitolins Announced as Vice President of Alumni Products and Services

Katie Vitolins Announced as Vice President of Alumni Products and Services An alumna and former trustee, Vitolins will lead the relaunch of Berklee's alum...

04/02/2026

Full cast announced for Saturday Night Live UK, coming to Sky and NOW 21 March 2026

Wednesday 4 February 2026 Full cast announced for Saturday Night Live UK, comin...

04/02/2026

Fear of running out of mobile data (FORO) is a real issue for UK businesses that lose over 3,400 a year

Wednesday 4 February 2026 Fear of running out of mobile data (FORO) is a real i...

04/02/2026

Rohde & Schwarz powers next generation television in Brazil with DTV+ technology for Globo

Rohde & Schwarz powers next generation television in Brazil with DTV technology...

04/02/2026

Netflix Shares Teaser of the Third and Final Season of 'Knokke Off'

Back to All News Netflix Shares Teaser of the Third and Final Season of Knokke Off Entertainment 04 February 2026 GlobalNetherlands Link copied to clipboar...

04/02/2026

Netflix Premieres 'The TikTok Killer' on March 6

Back to All News Netflix Premieres The TikTok Killer on March 6 Entertainment 04 February 2026 GlobalSpain Link copied to clipboard Download the pictures ...

04/02/2026

Music Is Having a Golden Moment on Netflix as New and Nostalgic Songs Storm the Charts

Back to All News Music Is Having a Golden Moment on Netflix as New and Nostal...

04/02/2026

Fox Corporation Reports Second Quarter Fiscal 2026 Financial Results

Fox Corporation Reports Second Quarter Fiscal 2026 Financial Results NEW YORK, NY, February 4, 2026 - Fox Corporation (Nasdaq: FOXA, FOX; FOX or the Compan...

04/02/2026

Lucas P. Aragn Joins FOX Advertising as Senior Vice President, Creative

Lucas P. Arag n Joins FOX Advertising as Senior Vice President, Creative New York, NY - February 4, 2026 - Accomplished Creative Executive Lucas P. Arag n has...

04/02/2026

RT Statement on Home of the Year and The Great House Revival

Following the passing of our friend and colleague Hugh Wallace, and with the full support of his family, RT will proceed with the broadcast of the new series o...

04/02/2026

Nemotron Labs: How AI Agents Are Turning Documents Into Real-Time Business Intelligence

Editor's note: This post is part of the Nemotron Labs blog series, which exp...

03/02/2026

Tagboard's New Partner Development Kit Turns Complex Third-Party Integrations into Instant Graphics

Tagboard, a modern, interactive graphics system for news, sports, and entertainm...

03/02/2026

SNS Launches New S3-Compatible Cloud Storage Service

Studio Network Solutions (SNS) announces the launch of Trio, a new S3-compatible cloud storage service fully integrated with EVO for media backup, archival, and...

03/02/2026

NEP Group Running at Full-Scale This Month in Support of Major International Events

50 Production Trucks at center of 160 U.S.-based productions...

03/02/2026

Nielsen Launches Co-Viewing Pilot Program to Further Enhance TV Measurement

Nielsen, which specializes in audience measurement, data, and media intelligence, announces that it is piloting a new methodology enhancement to more accurately...