Sony Pixel Power calrec Sony

Unlocking Peak Generations: TensorRT Accelerates AI on RTX PCs and Workstations

27/03/2024

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and which showcases new hardware, software, tools and accelerations for RTX PC users.

As generative AI advances and becomes widespread across industries, the importance of running generative AI applications on local PCs and workstations grows. Local inference gives consumers reduced latency, eliminates their dependency on the network and enables more control over their data.

NVIDIA GeForce and NVIDIA RTX GPUs feature Tensor Cores, dedicated AI hardware accelerators that provide the horsepower to run generative AI locally.

Stable Video Diffusion is now optimized for the NVIDIA TensorRT software development kit, which unlocks the highest-performance generative AI on the more than 100 million Windows PCs and workstations powered by RTX GPUs.

Now, the TensorRT extension for the popular Stable Diffusion WebUI by Automatic1111 is adding support for ControlNets, tools that give users more control to refine generative outputs by adding other images as guidance.

TensorRT acceleration can be put to the test in the new UL Procyon AI Image Generation benchmark, which internal tests have shown accurately replicates real-world performance. It delivered speedups of 50% on a GeForce RTX 4080 SUPER GPU compared with the fastest non-TensorRT implementation.

More Efficient and Precise AI TensorRT enables developers to access the hardware that provides fully optimized AI experiences. AI performance typically doubles compared with running the application on other frameworks.

It also accelerates the most popular generative AI models, like Stable Diffusion and SDXL. Stable Video Diffusion, Stability AI's image-to-video generative AI model, experiences a 40% speedup with TensorRT.

The optimized Stable Video Diffusion 1.1 Image-to-Video model can be downloaded on Hugging Face.

Plus, the TensorRT extension for Stable Diffusion WebUI boosts performance by up to 2x - significantly streamlining Stable Diffusion workflows.

With the extension's latest update, TensorRT optimizations extend to ControlNets - a set of AI models that help guide a diffusion model's output by adding extra conditions. With TensorRT, ControlNets are 40% faster.

TensorRT optimizations extend to ControlNets for improved customization. Users can guide aspects of the output to match an input image, which gives them more control over the final image. They can also use multiple ControlNets together for even greater control. A ControlNet can be a depth map, edge map, normal map or keypoint detection model, among others.

Download the TensorRT extension for Stable Diffusion Web UI on GitHub today.

Other Popular Apps Accelerated by TensorRT Blackmagic Design adopted NVIDIA TensorRT acceleration in update 18.6 of DaVinci Resolve. Its AI tools, like Magic Mask, Speed Warp and Super Scale, run more than 50% faster and up to 2.3x faster on RTX GPUs compared with Macs.

In addition, with TensorRT integration, Topaz Labs saw an up to 60% performance increase in its Photo AI and Video AI apps - such as photo denoising, sharpening, photo super resolution, video slow motion, video super resolution, video stabilization and more - all running on RTX.

Combining Tensor Cores with TensorRT software brings unmatched generative AI performance to local PCs and workstations. And by running locally, several advantages are unlocked:

Performance: Users experience lower latency, since latency becomes independent of network quality when the entire model runs locally. This can be important for real-time use cases such as gaming or video conferencing. NVIDIA RTX offers the fastest AI accelerators, scaling to more than 1,300 AI trillion operations per second, or TOPS.

Cost: Users don't have to pay for cloud services, cloud-hosted application programming interfaces or infrastructure costs for large language model inference.

Always on: Users can access LLM capabilities anywhere they go, without relying on high-bandwidth network connectivity.

Data privacy: Private and proprietary data can always stay on the user's device.

Optimized for LLMs What TensorRT brings to deep learning, NVIDIA TensorRT-LLM brings to the latest LLMs.

TensorRT-LLM, an open-source library that accelerates and optimizes LLM inference, includes out-of-the-box support for popular community models, including Phi-2, Llama2, Gemma, Mistral and Code Llama. Anyone - from developers and creators to enterprise employees and casual users - can experiment with TensorRT-LLM-optimized models in the NVIDIA AI Foundation models. Plus, with the NVIDIA ChatRTX tech demo, users can see the performance of various models running locally on a Windows PC. ChatRTX is built on TensorRT-LLM for optimized performance on RTX GPUs.

NVIDIA is collaborating with the open-source community to develop native TensorRT-LLM connectors to popular application frameworks, including LlamaIndex and LangChain.

These innovations make it easy for developers to use TensorRT-LLM with their applications and experience the best LLM performance with RTX.

Get weekly updates directly in your inbox by subscribing to the AI Decoded newsletter.
LINK: https://blogs.nvidia.com/blog/ai-decoded-tensorrt-stable-diffusion-aut...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

21/04/2026

Diversified Appoints Tyler Affolter Chief Revenue Officer

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

TV Azteca to Bring Dolby Atmos to Free-To-Air TV in Mexico

Share Copy link Facebook X Linkedin Bluesky Email...

21/04/2026

Maxon Announces Free Tools and Mobile Expansion of ZBrush...

Cinema 4D brings professional 3D workflows to iPad. The return of Autograph now free for individual users. ZBrush expands to Windows on Arm. See it all at NAB...

21/04/2026

Bitfocus improves availability, security and user managem...

Software version 1.6 extends enterprise functionality to place Buttons at the heart of media operations at any scale Bitfocus, the Norwegian software develope...

21/04/2026

Cobalt Digital Announces Launch of blueCORE at NAB Show 2...

Cobalt Digital Announces Launch of blueCORE at NAB Show 2026: Compact Powerhouse Signal Processors for SDI and ST 2110 Workflows Compact, multi-function stan...

21/04/2026

Applications open for 2026 AISF and Screen Australia Writer/Director Virtual Sessions

Applications open for 2026 AISF and Screen Australia Writer/Director Virtual Ses...

20/04/2026

Live From NAB 2026: Sonys Hugo Gaggioni Highlights HDR Advances, Software-Defined Workflows

At the 2026 NAB Show, Sony is showcasing a broad slate of innovations across liv...

20/04/2026

Live From NAB 2026: Fujinons Stosh Durbacz on Expanding the 4K Broadcast Lens Lineup With New Portable Zooms, 94x Box Lens

Fujifilm is sharpening its focus on core broadcast production with a new wave of...

20/04/2026

Live From NAB 2026: Rock-It Sports' John Walberg on Powering Logistics, Shipping for the 2026 FIFA Men's World Cup

This upcoming summer in North America is going to be a busy one. The 2026 FIFA M...

20/04/2026

NAB 2026: Glookast outlines product updates including Media Producer UX, connectors and Premiere Pro panel

Glookast (Booth W1661) announced a series of product updates at NAB Show 2026, c...

20/04/2026

NAB 2026: Matrox Video and Amagi collaborate on cloud-based broadcast workflows using ORIGIN framework

Matrox Video and Amagi announced a collaboration to integrate the Matrox ORIGIN ...

20/04/2026

NAB 2026: Riedel SimplyLive supports expanded centralised VAR system for Argentina football league

Riedel Communications (Booth C4908) announced that the Asociaci n del F tbol Arg...

20/04/2026

NAB 2026: Ikegami introduces VFE-P07D OLED viewfinder with integrated LCD monitor

Ikegami (Booth C3819) announced the VFE-P07D monocular OLED viewfinder at NAB Sh...

20/04/2026

NAB 2026: IABM rebrands as IAMT and launches AI discovery platform and global alliance

International Association of MediaTech (IAMT), formerly known as IABM, announced...

20/04/2026

NAB 2026: Harmonic supports DIRECTV DTH platform upgrade with VOS Media Software

Harmonic (Booth W2831) announced that DIRECTV is updating its US direct-to-home (DTH) video platform using Harmonic's VOS Media Software. The deployment is...

20/04/2026

NAB 2026: Wasabi Technologies acquires Seagate Lyve Cloud business

Wasabi Technologies announced that it has acquired the Lyve Cloud business from Seagate Technology. As part of the agreement, Seagate received equity in Wasabi ...

20/04/2026

NAB 2026: EVS introduces Choreon robotics orchestration platform for unified production control

EVS (Booth N1841) has launched Choreon, a robotics controller for media producti...

20/04/2026

SportsTechBuzz at NAB 2026, Day 2: Live Reports From the Show Floor in Vegas

The NAB Show is in full swing, and the SVG and SVG Europe editorial teams are chasing down the hottest stories from all over the Las Vegas Convention Center. He...

20/04/2026

NAB 2026: Skyline Communications launches DataMiner packages on Grass Valley AMPP App Store

Skyline Communications announced the availability of its DataMiner xOps platform...

20/04/2026

NAB 2026: SNS launches Outpost, Trio and AI Suite for connected post-production workflows

Studio Network Solutions (Booth N1129) introduced a set of new products at NAB S...

20/04/2026

NAB 2026: Dell Technologies and NVIDIA present AI data platform for media workflows

Dell Technologies is showcasing its Dell AI Data Platform with NVIDIA at NAB Sho...

20/04/2026

NAB 2026: Blackmagic Design Announces Fairlight Live Software Audio Mixer

Blackmagic Design has announced Fairlight Live, a software-based live audio mixer with SMPTE 2110 support and spatial audio mixing. A public beta is available n...

20/04/2026

Live From NAB 2026: Imagine Comms Jimbo Haneklau Talks Prismon, Hybrid IP/SDI Workflows, and Cloud Playout

At the 2026 NAB Show in Las Vegas, Imagine Communications VP of Sales, Sports an...

20/04/2026

Live From NAB 2026: LiveUs Phillip Broaddus on LU900Q Launch, Nexus Cloud Platform, and REMI Growth

At the 2026 NAB Show in Las Vegas, LiveU Senior Director of Sales, Sports Philli...

20/04/2026

3 New Ways to Dive Deeper Into the Music You Love

A song that perfectly captures a moment is magic. But when you uncover the story behind it, who made it, what inspired it, and the meaning woven into the lyrics...

20/04/2026

Deity Microphones announce the PR-4

Ultra-compact 32-bit recorder set for launch Deity Microphones will soon be launching a new 32-bit six-track recorder that's been designed with producti...

20/04/2026

Lectrosonics preview the S1

Uncoming lightweight shotgun mic announced Production-sound experts Lectrosonics have recently announced the upcoming launch of a new lightweight shotgun mi...

20/04/2026

The story of Focusrite ISA

New 20-minute documentary explores iconic preamp In 2025, Focusrite commissioned a new short-form documentary with filmmaker Chris Mayes-Wright - the direct...

20/04/2026

Sampleson release Boomcha

Turn quick sketches into real drum grooves Sampleson have been experimenting with assitive production tools recently, and their latest creation aims to make...

20/04/2026

Rohde & Schwarz rolls out its full ARDRONIS counter UAS suite in a demonstration van at Counter UAS Technology Europe 2026

Rohde & Schwarz rolls out its full ARDRONIS counter UAS suite in a demonstration...

20/04/2026

Protecting America's Shores: L3Harris Keeps the Coast Guard Mission-Ready

L3Harris delivers integrated communications, navigation and C4ISR capabilities that empower the U.S. Coast Guard to protect Americas maritime interests and resp...

20/04/2026

Google Cloud Embraces the Rise of Agentic Production

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

Creators Go All in on AI, Niche Content

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

NBC Sports' Jon Miller: Broadcast Is Having a Moment'

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

Beyond the Lift and Shift': Cloud Migration's New Mandate

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

Virtual Production Finds Its Footing

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

Corporate Creators: All Companies Are Media Companies Now

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

IABM Rebrands as the International Association of MediaTech

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

CBS Detroit Debuts New AR/VR Technology-Driven Studio

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

Fox Sports Taps Appear X Platform for Remote Production

Share Copy link Facebook X Linkedin Bluesky Email...

20/04/2026

CueScript and Lighting Design Group Expand Customer Oppor...

CueScript and Lighting Design Group Expand Customer Opportunities Through New Partnership Find both companies at 2026 NAB Show in CueScript Booth # C 4720 ...

20/04/2026

Layercake Deepens Bitmovin Integration to Power End-to-En...

[Sydney, NSW, 20 April 2026] - Layercake, the company behind the intelligent media orchestration platform Streamcake, today announced the formalisation of its i...

20/04/2026

FOX Sports selects Appear X Platform for next-generation...

Deployment spans FOX Sports' REMI infrastructure, IP production for a major global soccer event, and its Jewel Events production systems Appear, a global l...