Sony Pixel Power calrec Sony

Unlocking Peak Generations: TensorRT Accelerates AI on RTX PCs and Workstations

27/03/2024

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and which showcases new hardware, software, tools and accelerations for RTX PC users.

As generative AI advances and becomes widespread across industries, the importance of running generative AI applications on local PCs and workstations grows. Local inference gives consumers reduced latency, eliminates their dependency on the network and enables more control over their data.

NVIDIA GeForce and NVIDIA RTX GPUs feature Tensor Cores, dedicated AI hardware accelerators that provide the horsepower to run generative AI locally.

Stable Video Diffusion is now optimized for the NVIDIA TensorRT software development kit, which unlocks the highest-performance generative AI on the more than 100 million Windows PCs and workstations powered by RTX GPUs.

Now, the TensorRT extension for the popular Stable Diffusion WebUI by Automatic1111 is adding support for ControlNets, tools that give users more control to refine generative outputs by adding other images as guidance.

TensorRT acceleration can be put to the test in the new UL Procyon AI Image Generation benchmark, which internal tests have shown accurately replicates real-world performance. It delivered speedups of 50% on a GeForce RTX 4080 SUPER GPU compared with the fastest non-TensorRT implementation.

More Efficient and Precise AI TensorRT enables developers to access the hardware that provides fully optimized AI experiences. AI performance typically doubles compared with running the application on other frameworks.

It also accelerates the most popular generative AI models, like Stable Diffusion and SDXL. Stable Video Diffusion, Stability AI's image-to-video generative AI model, experiences a 40% speedup with TensorRT.

The optimized Stable Video Diffusion 1.1 Image-to-Video model can be downloaded on Hugging Face.

Plus, the TensorRT extension for Stable Diffusion WebUI boosts performance by up to 2x - significantly streamlining Stable Diffusion workflows.

With the extension's latest update, TensorRT optimizations extend to ControlNets - a set of AI models that help guide a diffusion model's output by adding extra conditions. With TensorRT, ControlNets are 40% faster.

TensorRT optimizations extend to ControlNets for improved customization. Users can guide aspects of the output to match an input image, which gives them more control over the final image. They can also use multiple ControlNets together for even greater control. A ControlNet can be a depth map, edge map, normal map or keypoint detection model, among others.

Download the TensorRT extension for Stable Diffusion Web UI on GitHub today.

Other Popular Apps Accelerated by TensorRT Blackmagic Design adopted NVIDIA TensorRT acceleration in update 18.6 of DaVinci Resolve. Its AI tools, like Magic Mask, Speed Warp and Super Scale, run more than 50% faster and up to 2.3x faster on RTX GPUs compared with Macs.

In addition, with TensorRT integration, Topaz Labs saw an up to 60% performance increase in its Photo AI and Video AI apps - such as photo denoising, sharpening, photo super resolution, video slow motion, video super resolution, video stabilization and more - all running on RTX.

Combining Tensor Cores with TensorRT software brings unmatched generative AI performance to local PCs and workstations. And by running locally, several advantages are unlocked:

Performance: Users experience lower latency, since latency becomes independent of network quality when the entire model runs locally. This can be important for real-time use cases such as gaming or video conferencing. NVIDIA RTX offers the fastest AI accelerators, scaling to more than 1,300 AI trillion operations per second, or TOPS.

Cost: Users don't have to pay for cloud services, cloud-hosted application programming interfaces or infrastructure costs for large language model inference.

Always on: Users can access LLM capabilities anywhere they go, without relying on high-bandwidth network connectivity.

Data privacy: Private and proprietary data can always stay on the user's device.

Optimized for LLMs What TensorRT brings to deep learning, NVIDIA TensorRT-LLM brings to the latest LLMs.

TensorRT-LLM, an open-source library that accelerates and optimizes LLM inference, includes out-of-the-box support for popular community models, including Phi-2, Llama2, Gemma, Mistral and Code Llama. Anyone - from developers and creators to enterprise employees and casual users - can experiment with TensorRT-LLM-optimized models in the NVIDIA AI Foundation models. Plus, with the NVIDIA ChatRTX tech demo, users can see the performance of various models running locally on a Windows PC. ChatRTX is built on TensorRT-LLM for optimized performance on RTX GPUs.

NVIDIA is collaborating with the open-source community to develop native TensorRT-LLM connectors to popular application frameworks, including LlamaIndex and LangChain.

These innovations make it easy for developers to use TensorRT-LLM with their applications and experience the best LLM performance with RTX.

Get weekly updates directly in your inbox by subscribing to the AI Decoded newsletter.
LINK: https://blogs.nvidia.com/blog/ai-decoded-tensorrt-stable-diffusion-aut...
See more stories from nvidia

Most recent headlines

10/12/2025

Sound-Alike Commercials Are Part of Sports' Soundtrack

Sound-Alike Commercials Are Part of Sports' Soundtrack Johnny Cash for Coca-Cola is the latest in a long litany of sonic approximationsBy Dan Daley, Audio ...

10/12/2025

Immersive Sound Is Logical Next Step for Sports Venues

Immersive Sound Is Logical Next Step for Sports VenuesSound-systems suppliers are sanguine, but the market has its challengesBy Dan Daley, Audio Editor Wednes...

10/12/2025

The Romans Built Arenas for Immersive Sound 2,000 Years Ago

The Romans Built Arenas for Immersive Sound 2,000 Years AgoThe historic Arena of Nimes in France is still in use todayBy Dan Daley, Audio Editor Wednesday, De...

10/12/2025

SVG Summit 2025 Preview: Audio Workshop Hits on Immersive, Virtualized, and Next-Gen Streaming Workflows

SVG Summit 2025 Preview: Audio Workshop Hits on Immersive, Virtualized, and Next...

10/12/2025

SVG Summit 2025 Technology Exhibits Preview: Audio Spotlight

SVG Summit 2025 Technology Exhibits Preview: Audio SpotlightBy SVG Staff Wednesday, December 10, 2025 - 8:21 am Print This Story | Subscribe Story Highlig...

10/12/2025

SVG Europe Audio: Listening to the Sounds of Powder and Ice at Milano Cortina with a Behind the Scenes Tour of OBS and NBC's Audio Set Ups

SVG Europe Audio: Listening to the sounds of powder and ice at Milano Cortina wi...

10/12/2025

Advancements in Audio Technology: Capturing the Atmosphere of Live Sports

Advancements in audio technology: Capturing the atmosphere of live sports By David Davies Tuesday, November 25, 2025 - 09:27 Print This Story Although wor...

10/12/2025

Everything Smelled of Popcorn: The Art of Bringing the Complex Sound of Esports to Fans With Sound Supervisor Matt Gilbert

Everything smelled of popcorn: The art of bringing the complex sound of esports ...

10/12/2025

2026 Sundance Film Festival Unveils 97 Projects Selected for the Feature Film and Episodic Program

Top L-R: Ha-Chan, Shake Your Booty!, Hanging by a Wire, Broken English, Buddy C...

10/12/2025

You're in Control: Spotify Lets You Steer the Algorithm

For the first time, Spotify is giving users the power to steer the algorithm. Gustav S derstr m, Spotify's Co-President, CPO, and CTO, shares the vision beh...

10/12/2025

L3Harris to Produce Additional Solid Rocket Motors for Precision-Guided Artillery System

L3Harris' new contract for Guided Multiple Launch Rocket System Insensitive ...

10/12/2025

US Space Force Expands Offensive Space Programs Through L3Harris Foreign Sales

L3Harris Meadowlands system has been designed with an open architecture software system that allows for more flexible and efficient software updates. This capab...

10/12/2025

Football Shifts TV Viewing Towards Ad Supported, Nielsen's Q3 2025 Ad Supported Gauge Finds

During this interval, streaming comprised the majority of ad supported TV (46.4%...

10/12/2025

Bitcentral Names Venture Capital Exec Rick Arnold to Board

NEWPORT BEACH, Calif. Bitcentral, a provider of production, asset management, playout and streaming workflow solutions, has named technology veteran Rick Arnold...

10/12/2025

TV Tech Announces Winners of 2025 Best in Market Awards for M&E Tech

TV Tech is delighted to reveal the winners of the 2025 Media & Entertainment: Best in Market Awards....

10/12/2025

AIMS, VSF, AMWA, EBU To Hold Inaugural IPMX Testing, Certification Event

BOTHELL, Wash. The Alliance for IP Media Solutions (AIMS), the Video Services Forum (VSF), the Advanced Media Workflow Association (AMWA) and the European Broad...

10/12/2025

DirecTV Launches Peacock Games

In a notable example of how pay TV operators are integrating streaming services into their lineup and using those services to retain or attract subscribers, Dir...

10/12/2025

Chaos Brings Real-Time Rendering to Maya and Houdini

Today, Chaos builds instant feedback into the viewport, connecting Maya and Houdini to Chaos Vantage's real-time path tracer. Artists can now assess 3D asse...

10/12/2025

Smeup doubles capacity with Cubbit under a new agreement...

Smeup, a key partner for companies engaged in digital transformation, today announced the expansion of its adoption of Cubbit, the first geo-distributed cloud s...

10/12/2025

Mediagenix Strengthens Its Security Posture with ISO 2700...

Mediagenix, a global leader in smart content solutions to profitably connect the right content to the right audience, today announced two significant milestones...

10/12/2025

HDR10+ Technologies Unveils HDR10+ ADVANCED Dynamic Metadata Technology

BEAVERTON, Ore. HDR10+ Technologies, LLC has announced that they will soon begin the licensing and certification of devices, content, and services that support ...

10/12/2025

SMPTE, EBU, ETC Publish Report on AI's Impact on the Media

SMPTE has joined forces with the European Broadcasting Union (EBU) and Entertainment Technology Center (ETC) to publish an updated report on AI and its impact o...

10/12/2025

Clear-Com Appoints Kris Koch as New Director of Sales - N...

Clear-Com is pleased to announce the appointment of Kris Koch as Director of Sales - North & South America. In this expanded leadership role, Kris will oversee...

10/12/2025

Mavis Camera Launches Film Kit Unlocking LUT Workflows an...

Mavis today announced the latest version of Mavis Camera (v7.4), a major update to its professional iOS camera app, headlined by the launch of Film Kit - an opt...

10/12/2025

Creamsource Taps Industry Heavyweight Markus Zeiler as Gr...

Creamsource, renowned for its Vortex series of cinematic lighting, is laying the groundwork for its next phase of growth with the addition of Markus Zeiler as G...

10/12/2025

Digital Alert Systems Introduces DAS3-DC-PS DASDEC-III DC...

Digital Alert Systems, a global leader in emergency communications solutions for media providers, today announced that the DAS3-DC-PS, a new DC power supply opt...

10/12/2025

Riedel and Racing Electronics Announce Strategic Partners...

Riedel Communications today announced it has formed a strategic partnership with Racing Electronics, a premier provider of motorsport communication equipment in...

10/12/2025

GALSNGEAR Announces 2026 Leadership Retreats on East and...

#GALSNGEAR is launching two signature leadership retreats in early 2026, designed to equip women in media, entertainment, and technology with the tools to lead...

10/12/2025

CVP Launches Global Price Guarantee for Seamless Internat...

Providing worldwide customers with total confidence through transparent, all-inclusive pricing CVP, one of Europe's leading suppliers of professional video...

10/12/2025

Securing the Future of Broadcast TV in the U.S.

With the Federal Communications Commission working on new rules for the deployment of NextGen TV, next year promises to be an important one for both the future ...

10/12/2025

Former Charter CEO Tom Rutledge to Receive Cable Centers Bresnan Award

DENVER Tom Rutledge, director emeritus and former president and CEO of Charter Communications, will be honored with the 2026 Bresnan Ethics in Business Award by...

10/12/2025

Cadent Acquires YouTube Measurement Firm VuePlanner

NEW YORK Novocap's Cadent has acquired VuePlanner, a YouTube video ad planning, optimization, and measurement company in a deal that will help Cadent expand...

10/12/2025

Avoid Playlist Conflicts: Scheduling Back-to-Back Special Playlists

In preparation for the madness of March, here are some important reminders for scheduling back-to-back Special Playlists. The first Special Playlist MUST end b...

10/12/2025

VEON's Rising Capital Markets Profile Strengthened by Inclusion in Key Global Indices

10 Dec 2025 VEON's Rising Capital Markets Profile Strengthened by Inclusion...

10/12/2025

VEON Recognized for JazzCash, Kyivstar and Jazz at the World Communication Awards 2025

10 Dec 2025 VEON Recognized for JazzCash, Kyivstar and Jazz at the World Commun...

10/12/2025

Tribeca Films to Release the Independent Documentary Film Beam Me Up, Sulu by Timour Gregory and Sasha Schneider

December 10th, 2025 TRIBECA FILMS TO RELEASE THE INDEPENDENT DOCUMENTARY FILM...

10/12/2025

Sky extends partnership with the Ladies European Tour for a landmark 30th year

Wednesday 10 December 2025 Sky extends partnership with the Ladies European Tour for a landmark 30th year Sky and the Ladies European Tour (LET) have announce...

10/12/2025

Walk-on if you love the darts: James Maddison, Luke Littler and Big John star as Club 180 opens before 2026 PDC World Darts Championship

Wednesday 10 December 2025 Walk-on if you love the darts: James Maddison, Luke ...

10/12/2025

Rohde & Schwarz presents world's first RF power sensor with 0.80 mm RF connector for gapless DC to 150 GHz coverage

Rohde & Schwarz presents world's first RF power sensor with 0.80 mm RF conne...

10/12/2025

2026 Starts With a Swoon: Kim Seon-ho and Go Youn-jung Lead Can This Love Be Translated?', Premiering January 16

Back to All News 2026 Starts With a Swoon: Kim Seon-ho and Go Youn-jung Lead C...

10/12/2025

'Berlin and the Lady with an Ermine' Arrives to Netflix on May 15

Back to All News Berlin and the Lady with an Ermine Arrives to Netflix on May 15 Entertainment 10 December 2025 GlobalSpain Link copied to clipboard THE N...

10/12/2025

From stand-up to Foxtrot: Comedian Michael Fry revealed as fourth contestant for Dancing with the Stars 2026

It's out of the frying pan and into the sequins for comedian and actor Micha...

10/12/2025

Documentary film on the extraordinary life of Patrick Lydon, pioneer of social inclusion and disability, to premiere on RT

Born That Way airs Thursday 18 December on RT One and RT Player Born That ...

09/12/2025

2025 Sports Broadcasting Hall of Fame: Pam Oliver, Sideline Icon Who Redefined the Role

2025 Sports Broadcasting Hall of Fame: Pam Oliver, Sideline Icon Who Redefined t...

09/12/2025

SVG Summit 2025 Technology Exhibits Preview, Part 2

SVG Summit 2025 Technology Exhibits Preview, Part 2By Jason Dachman, Editorial Director, U.S. Tuesday, December 9, 2025 - 7:17 am Print This Story | Subscr...

09/12/2025

SVG Summit 2025 Preview: Cloud Production Workshop Spotlights Live and Non-Live Workflows in the Cloud

SVG Summit 2025 Preview: Cloud Production Workshop Spotlights Live and Non-Live ...

09/12/2025

Next-Generation Content Protection: Multi-Technology Security is Integral to Combating New Threats

Next-generation content protection: Multi-technology security is integral to com...

09/12/2025

CBS Sports Provides One-of-a-Kind Production' for UEFA Champions League Crossover Event

CBS Sports Provides One-of-a-Kind Production' for UEFA Champions League Cro...

09/12/2025

Spanish Professional Basketball League Relies on NETGEAR AV, MAM Tech for Seamless Production

Spanish Professional Basketball League Relies on NETGEAR AV, MAM Tech for Seamle...