
Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and which showcases new hardware, software, tools and accelerations for RTX PC users.
As generative AI advances and becomes widespread across industries, the importance of running generative AI applications on local PCs and workstations grows. Local inference gives consumers reduced latency, eliminates their dependency on the network and enables more control over their data.
NVIDIA GeForce and NVIDIA RTX GPUs feature Tensor Cores, dedicated AI hardware accelerators that provide the horsepower to run generative AI locally.
Stable Video Diffusion is now optimized for the NVIDIA TensorRT software development kit, which unlocks the highest-performance generative AI on the more than 100 million Windows PCs and workstations powered by RTX GPUs.
Now, the TensorRT extension for the popular Stable Diffusion WebUI by Automatic1111 is adding support for ControlNets, tools that give users more control to refine generative outputs by adding other images as guidance.
TensorRT acceleration can be put to the test in the new UL Procyon AI Image Generation benchmark, which internal tests have shown accurately replicates real-world performance. It delivered speedups of 50% on a GeForce RTX 4080 SUPER GPU compared with the fastest non-TensorRT implementation.
More Efficient and Precise AI TensorRT enables developers to access the hardware that provides fully optimized AI experiences. AI performance typically doubles compared with running the application on other frameworks.
It also accelerates the most popular generative AI models, like Stable Diffusion and SDXL. Stable Video Diffusion, Stability AI's image-to-video generative AI model, experiences a 40% speedup with TensorRT.
The optimized Stable Video Diffusion 1.1 Image-to-Video model can be downloaded on Hugging Face.
Plus, the TensorRT extension for Stable Diffusion WebUI boosts performance by up to 2x - significantly streamlining Stable Diffusion workflows.
With the extension's latest update, TensorRT optimizations extend to ControlNets - a set of AI models that help guide a diffusion model's output by adding extra conditions. With TensorRT, ControlNets are 40% faster.
TensorRT optimizations extend to ControlNets for improved customization. Users can guide aspects of the output to match an input image, which gives them more control over the final image. They can also use multiple ControlNets together for even greater control. A ControlNet can be a depth map, edge map, normal map or keypoint detection model, among others.
Download the TensorRT extension for Stable Diffusion Web UI on GitHub today.
Other Popular Apps Accelerated by TensorRT Blackmagic Design adopted NVIDIA TensorRT acceleration in update 18.6 of DaVinci Resolve. Its AI tools, like Magic Mask, Speed Warp and Super Scale, run more than 50% faster and up to 2.3x faster on RTX GPUs compared with Macs.
In addition, with TensorRT integration, Topaz Labs saw an up to 60% performance increase in its Photo AI and Video AI apps - such as photo denoising, sharpening, photo super resolution, video slow motion, video super resolution, video stabilization and more - all running on RTX.
Combining Tensor Cores with TensorRT software brings unmatched generative AI performance to local PCs and workstations. And by running locally, several advantages are unlocked:
Performance: Users experience lower latency, since latency becomes independent of network quality when the entire model runs locally. This can be important for real-time use cases such as gaming or video conferencing. NVIDIA RTX offers the fastest AI accelerators, scaling to more than 1,300 AI trillion operations per second, or TOPS.
Cost: Users don't have to pay for cloud services, cloud-hosted application programming interfaces or infrastructure costs for large language model inference.
Always on: Users can access LLM capabilities anywhere they go, without relying on high-bandwidth network connectivity.
Data privacy: Private and proprietary data can always stay on the user's device.
Optimized for LLMs What TensorRT brings to deep learning, NVIDIA TensorRT-LLM brings to the latest LLMs.
TensorRT-LLM, an open-source library that accelerates and optimizes LLM inference, includes out-of-the-box support for popular community models, including Phi-2, Llama2, Gemma, Mistral and Code Llama. Anyone - from developers and creators to enterprise employees and casual users - can experiment with TensorRT-LLM-optimized models in the NVIDIA AI Foundation models. Plus, with the NVIDIA ChatRTX tech demo, users can see the performance of various models running locally on a Windows PC. ChatRTX is built on TensorRT-LLM for optimized performance on RTX GPUs.
NVIDIA is collaborating with the open-source community to develop native TensorRT-LLM connectors to popular application frameworks, including LlamaIndex and LangChain.
These innovations make it easy for developers to use TensorRT-LLM with their applications and experience the best LLM performance with RTX.
Get weekly updates directly in your inbox by subscribing to the AI Decoded newsletter.
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
04/08/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
04/07/2026
April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...
01/06/2026
January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026
Throughout the week, Dolby brings to life the latest innovatio...
02/05/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
01/05/2026
January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...
21/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
21/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
21/04/2026
Cinema 4D brings professional 3D workflows to iPad. The return of Autograph now free for individual users. ZBrush expands to Windows on Arm. See it all at NAB...
21/04/2026
Software version 1.6 extends enterprise functionality to place Buttons at the heart of media operations at any scale
Bitfocus, the Norwegian software develope...
21/04/2026
Cobalt Digital Announces Launch of blueCORE at NAB Show 2026: Compact Powerhouse Signal Processors for SDI and ST 2110 Workflows
Compact, multi-function stan...
21/04/2026
Applications open for 2026 AISF and Screen Australia Writer/Director Virtual Ses...
20/04/2026
At the 2026 NAB Show, Sony is showcasing a broad slate of innovations across liv...
20/04/2026
At the 2026 NAB Show, Canon is doubling down on its commitment to live sports pr...
20/04/2026
Fujifilm is sharpening its focus on core broadcast production with a new wave of...
20/04/2026
This upcoming summer in North America is going to be a busy one. The 2026 FIFA M...
20/04/2026
Glookast (Booth W1661) announced a series of product updates at NAB Show 2026, c...
20/04/2026
Matrox Video and Amagi announced a collaboration to integrate the Matrox ORIGIN ...
20/04/2026
Riedel Communications (Booth C4908) announced that the Asociaci n del F tbol Arg...
20/04/2026
Ikegami (Booth C3819) announced the VFE-P07D monocular OLED viewfinder at NAB Sh...
20/04/2026
International Association of MediaTech (IAMT), formerly known as IABM, announced...
20/04/2026
Harmonic (Booth W2831) announced that DIRECTV is updating its US direct-to-home (DTH) video platform using Harmonic's VOS Media Software.
The deployment is...
20/04/2026
Wasabi Technologies announced that it has acquired the Lyve Cloud business from Seagate Technology. As part of the agreement, Seagate received equity in Wasabi ...
20/04/2026
EVS (Booth N1841) has launched Choreon, a robotics controller for media producti...
20/04/2026
The NAB Show is in full swing, and the SVG and SVG Europe editorial teams are chasing down the hottest stories from all over the Las Vegas Convention Center. He...
20/04/2026
Skyline Communications announced the availability of its DataMiner xOps platform...
20/04/2026
Studio Network Solutions (Booth N1129) introduced a set of new products at NAB S...
20/04/2026
Dell Technologies is showcasing its Dell AI Data Platform with NVIDIA at NAB Sho...
20/04/2026
Blackmagic Design has announced Fairlight Live, a software-based live audio mixer with SMPTE 2110 support and spatial audio mixing. A public beta is available n...
20/04/2026
At the 2026 NAB Show in Las Vegas, Imagine Communications VP of Sales, Sports an...
20/04/2026
At the 2026 NAB Show in Las Vegas, LiveU Senior Director of Sales, Sports Philli...
20/04/2026
A song that perfectly captures a moment is magic. But when you uncover the story behind it, who made it, what inspired it, and the meaning woven into the lyrics...
20/04/2026
Ultra-compact 32-bit recorder set for launch
Deity Microphones will soon be launching a new 32-bit six-track recorder that's been designed with producti...
20/04/2026
Uncoming lightweight shotgun mic announced
Production-sound experts Lectrosonics have recently announced the upcoming launch of a new lightweight shotgun mi...
20/04/2026
New 20-minute documentary explores iconic preamp
In 2025, Focusrite commissioned a new short-form documentary with filmmaker Chris Mayes-Wright - the direct...
20/04/2026
Turn quick sketches into real drum grooves
Sampleson have been experimenting with assitive production tools recently, and their latest creation aims to make...
20/04/2026
Rohde & Schwarz rolls out its full ARDRONIS counter UAS suite in a demonstration...
20/04/2026
L3Harris delivers integrated communications, navigation and C4ISR capabilities that empower the U.S. Coast Guard to protect Americas maritime interests and resp...
20/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
20/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
20/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
20/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
20/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
20/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
20/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
20/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
20/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
20/04/2026
CueScript and Lighting Design Group Expand Customer Opportunities
Through New Partnership
Find both companies at 2026 NAB Show in CueScript Booth # C 4720
...
20/04/2026
[Sydney, NSW, 20 April 2026] - Layercake, the company behind the intelligent media orchestration platform Streamcake, today announced the formalisation of its i...
20/04/2026
Deployment spans FOX Sports' REMI infrastructure, IP production for a major global soccer event, and its Jewel Events production systems
Appear, a global l...