Sony Pixel Power calrec Sony

NVIDIA TensorRT Boosts Stable Diffusion 3.5 Performance on NVIDIA GeForce RTX and RTX PRO GPUs

12/06/2025

Generative AI has reshaped how people create, imagine and interact with digital content.

As AI models continue to grow in capability and complexity, they require more VRAM, or video random access memory. The base Stable Diffusion 3.5 Large model, for example, uses over 18GB of VRAM - limiting the number of systems that can run it well.

By applying quantization to the model, noncritical layers can be removed or run with lower precision. NVIDIA GeForce RTX 40 Series and the Ada Lovelace generation of NVIDIA RTX PRO GPUs support FP8 quantization to help run these quantized models, and the latest-generation NVIDIA Blackwell GPUs also add support for FP4.

NVIDIA collaborated with Stability AI to quantize its latest model, Stable Diffusion (SD) 3.5 Large, to FP8 - reducing VRAM consumption by 40%. Further optimizations to SD3.5 Large and Medium with the NVIDIA TensorRT software development kit (SDK) double performance.

In addition, TensorRT has been reimagined for RTX AI PCs, combining its industry-leading performance with just-in-time (JIT), on-device engine building and an 8x smaller package size for seamless AI deployment to more than 100 million RTX AI PCs. TensorRT for RTX is now available as a standalone SDK for developers.

RTX-Accelerated AI NVIDIA and Stability AI are boosting the performance and reducing the VRAM requirements of Stable Diffusion 3.5, one of the world's most popular AI image models. With NVIDIA TensorRT acceleration and quantization, users can now generate and edit images faster and more efficiently on NVIDIA RTX GPUs.

Stable Diffusion 3.5 quantized FP8 (right) generates images in half the time with similar quality as FP16 (left). Prompt: A serene mountain lake at sunrise, crystal clear water reflecting snow-capped peaks, lush pine trees along the shore, soft morning mist, photorealistic, vibrant colors, high resolution. To address the VRAM limitations of SD3.5 Large, the model was quantized with TensorRT to FP8, reducing the VRAM requirement by 40% to 11GB. This means five GeForce RTX 50 Series GPUs can run the model from memory instead of just one.

SD3.5 Large and Medium models were also optimized with TensorRT, an AI backend for taking full advantage of Tensor Cores. TensorRT optimizes a model's weights and graph - the instructions on how to run a model - specifically for RTX GPUs.

FP8 TensorRT boosts SD3.5 Large performance by 2.3x vs. BF16 PyTorch, with 40% less memory use. For SD3.5 Medium, BF16 TensorRT delivers a 1.7x speedup. Combined, FP8 TensorRT delivers a 2.3x performance boost on SD3.5 Large compared with running the original models in BF16 PyTorch, while using 40% less memory. And in SD3.5 Medium, BF16 TensorRT provides a 1.7x performance increase compared with BF16 PyTorch.

The optimized models are now available on Stability AI's Hugging Face page.

NVIDIA and Stability AI are also collaborating to release SD3.5 as an NVIDIA NIM microservice, making it easier for creators and developers to access and deploy the model for a wide range of applications. The NIM microservice is expected to be released in July.

TensorRT for RTX SDK Released Announced at Microsoft Build - and already available as part of the new Windows ML framework in preview - TensorRT for RTX is now available as a standalone SDK for developers.

Previously, developers needed to pre-generate and package TensorRT engines for each class of GPU - a process that would yield GPU-specific optimizations but required significant time.

With the new version of TensorRT, developers can create a generic TensorRT engine that's optimized on device in seconds. This JIT compilation approach can be done in the background during installation or when they first use the feature.

The easy-to-integrate SDK is now 8x smaller and can be invoked through Windows ML - Microsoft's new AI inference backend in Windows. Developers can download the new standalone SDK from the NVIDIA Developer page or test it in the Windows ML preview.

For more details, read this NVIDIA technical blog and this Microsoft Build recap.

Join NVIDIA at GTC Paris At NVIDIA GTC Paris at VivaTech - Europe's biggest startup and tech event - NVIDIA founder and CEO Jensen Huang yesterday delivered a keynote address on the latest breakthroughs in cloud AI infrastructure, agentic AI and physical AI. Watch a replay.

GTC Paris runs through Thursday, June 12, with hands-on demos and sessions led by industry leaders. Whether attending in person or joining online, there's still plenty to explore at the event.

Each week, the RTX AI Garage blog series features community-driven AI innovations and content for those looking to learn more about NVIDIA NIM microservices and AI Blueprints, as well as building AI agents, creative workflows, digital humans, productivity apps and more on AI PCs and workstations.

Plug in to NVIDIA AI PC on Facebook, Instagram, TikTok and X - and stay informed by subscribing to the RTX AI PC newsletter.

Follow NVIDIA Workstation on LinkedIn and X.

See notice regarding software product information.
LINK: https://blogs.nvidia.com/blog/rtx-ai-garage-gtc-paris-tensorrt-rtx-nim...
See more stories from nvidia

Most recent headlines

06/10/2025

France Tlvisions Wins Prestigious 2025 EBU Technology & Innovation Award in Groundbreaking Collaboration with Dalet

France T l visions, France's leading broadcaster, has received the 2025 EBU ...

04/09/2025

Monumental Sports & Entertainment and Dalet Win Prestigious 2025 NAB Show Project of the Year Award

Monumental Sports & Entertainment (MSE), in collaboration with Dalet, has been a...

07/08/2025

Tata Motors & Dolby Bring Dolby Atmos to Harrier.ev, Redefining In-Car Entertainment Experience

July 8 2025, 22:30 (PDT) Tata Motors & Dolby Bring Dolby Atmos to Harrier.ev, R...

30/07/2025

Inside the Archives: Building Community through the Creative Process at the 2009 Directors Lab

The 2009 Directors Lab fellows share a moment together outside at the Sundance R...

30/07/2025

4 Must-Try Features That Make Spotify the Best Place to Be a Fan

At Spotify, we're always working to enhance your listening experience and connect you more deeply with the artists and music you love. That means helping yo...

30/07/2025

Engagement Superiority: How L3Harris' Iver AUVs are Establishing the Hybrid Undersea Fleet

L3Harris is redefining the undersea battlespace with its Iver4 autonomous underw...

30/07/2025

SMPTE Announces 2025 Honorees

WHITE PLAINS, N.Y. SMPTE, the home for media professionals, technologists, and engineers, will honor some of the industry's top minds and organizations at i...

30/07/2025

Lawo Showcases Dynamic Media Agility at IBC2025

At this year's IBC show in Amsterdam, Sept. 12-15, Lawo will showcase a forward-thinking portfolio of innovations that enables broadcasters to build agile, ...

30/07/2025

Shotoku to Feature Latest Robotic Control Enhancements at IBC 2025

SUNBURY-ON-THAMES, U.K. Shotoku Broadcast Systems will showcase its SmartPed fully robotic pedestal working in concert with an enhanced version of SoftRail, the...

30/07/2025

MARSHALL ELECTRONICS ANNOUNCES CV355-27X-ND3 CAMERA WITH...

Marshall Electronics introduces the CV355-27X-ND3 Optical Zoom NDI (NDI HX2, NDI HX3) Camera at IBC 2025 (Booth 11.C28). This camera utilizes a professional-gr...

30/07/2025

Pixel Power PRISMON Increases European Footprint with Key...

At the end of 2024 Pixel Power (A Rohde & Schwarz Company) announced that our company had become the sole sales and support destination for PRISMON, the innovat...

30/07/2025

MASV Express Breaks File Transfer Barriers with Speed Sim...

MASV (massive.io), the fastest and most reliable large file transfer platform for media professionals and an IDC Innovator 2025 for Media & Entertainment*, toda...

30/07/2025

NAKIVO Reports 40 Percent Revenue Growth in the Americas...

NAKIVO Inc., a fast-growing software company specialising in protecting virtual, physical, cloud, and SaaS environments, announced strong results for Q2 2025. T...

30/07/2025

Zee Entertainment UK Extends Partnership With Cerberus Te...

Cerberus Tech, a leader in cloud-native IP video contribution and distribution, today announced the continued expansion of its partnership with Zee Entertainmen...

30/07/2025

Black Box at IBC2025 Powering Smarter Control Rooms With...

At IBC2025, Black Box will demonstrate the continued evolution of its acclaimed Emerald IP-Based Control Solution, with powerful new features designed to stre...

30/07/2025

Leader to present full lineup of advanced test and measur...

est & measurement innovator, Leader Electronics of Europe, has announced that it will highlight its ability to provide suitable T&M equipment regardless of form...

30/07/2025

Chaos expands Cosmos with 30000 new assets and AI-powered...

Today, Chaos announces a major expansion to its Chaos Cosmos library, adding nearly 30,000 high-quality 3D assets to become the largest curated library of asset...

30/07/2025

Globecast Media Platform To Debut at IBC2025

Globecast, the leading provider of broadcast, media and entertainment managed services, will showcase its latest innovations in hybrid cloud solutions at IBC202...

30/07/2025

Mediaproxy to showcase new remote monitoring workflows at...

Mediaproxy, the global standard for IP compliance monitoring and multiviewing solutions, will be at this year s IBC on Stand 5.D76 in Amsterdam s RAI Convention...

30/07/2025

UEFA WOMENS EURO 2025 DRIVES SURGE IN AD-FUNDED STREAMING...

Yospace, the global leader in Dynamic Ad Insertion (DAI), reveals that it stitched 6 billion one-to-one addressable advertisements across the duration of the UE...

30/07/2025

World Archery Brings Production In-House with Appears X P...

New IP-based workflow provides full control, improved quality and 18-month ROI, reflecting broader industry shift from satellite to SRT Appear, the global lead...

30/07/2025

Nielsen: Ad-Supported TV Grew to a 73.6% Share of All Viewing in Q2

NEW YORK Nielsen reports that viewing of ad-supported content got more popular in Q2, 2025, gaining 1.2 share points of overall TV viewing to capture a 73.6% sh...

30/07/2025

20th Annual Independent Show Aims to Reach New Heights'

WASHINGTON and OVERLAND PARK, Kan. The National Content & Technology Cooperative (NCTC) and ACA Connects have released new details about the 20th annual Indepe...

30/07/2025

Fubo Shrinks Losses, Increases Subscriber Numbers

Fubo issued guidance on its second quarter today, reporting strong numbers for the period, shrinking its net loss and turning a positive operating profit for th...

30/07/2025

Former Vizrt Execs Launch AI-Driven Media Production Company

NEW YORK Former executives from Vizrt and Disguise announced today the launch of Emergent, an AI- and data-driven technology and services company that will offe...

30/07/2025

FOR-A America Partners With TecNec for U.S. Distribution

CYPRESS, Calif. FOR-A America said it has signed a distribution deal with TecNec, its first-U.S.-based distributor....

30/07/2025

MultiDyne To Spotlight VersaBrix Series Updates At IBC2025

KINGS PARK, N.Y. MultiDyne Video & Fiber Optic Systems will feature the latest updates to its rugged VersaBrix (VB) Series modular fiber optic transport platfor...

30/07/2025

NHL Strikes Deal With DAZN to Distribute NHL.TV to Nearly 200 Countries

NHL Strikes Deal With DAZN to Distribute NHL.TV to Nearly 200 Countries NHL.TV on DAZN will include distribution of the Stanley Cup Playoffs and Stanley Cup Fin...

30/07/2025

SVG Content Management Forum 2025: All Sessions Now Available to Watch on SVG PLAY

SVG Content Management Forum 2025: All Sessions Now Available to Watch on SVG PL...

30/07/2025

Cricket kids v experts: Skys stars put in a spin by young reporters

Sky Sports' cricket team were put through their paces by the next generation of reporters with some mischievous help from current stars of The Hundred.Wedne...

30/07/2025

First look at Sky Original film Nuremberg, starring Russell Crowe, Rami Malek and Michael Shannon

Ahead of the 80th anniversary of the Nuremberg trials, Sky will release James Va...

30/07/2025

Rethinking Broadcast Partnerships: Why the GV Alliance Matters

The days of building broadcast infrastructure around a single vendor are long gone. Broadcasters want flexibility, and they want tools that work together, witho...

30/07/2025

2025-07-30

Leagues Cup, the first in-season club tournament in North America across all men's professional sports, begins today, July 29, and MLS Season Pass on Apple ...

29/07/2025

UEFA Womens EURO 2025 drives surge in ad-funded streaming as Yospace powers 6 billion one-to-one advertisements

Staines-upon-Thames, UK, 29 July, 2025 Yospace, the global leader in Dynamic Ad ...

29/07/2025

2025 Sundance Institute Trans Possibilities Intensive Announced

Six Fellows Selected for Program Supporting Projects From Transgender Storytellers of Color Today the nonprofit Sundance Institute announced the six artists p...

29/07/2025

Give Me the Backstory: Get to Know Michael Shanks, the Writer-Director of Together

By Jessica Herndon One of the most exciting things about the Sundance Film Fest...

29/07/2025

Spotify Reports Second Quarter 2025 Earnings

Today, we announced our second quarter 2025 earnings, fueled by standout subscriber and MAU growth. In the first half of 2025, subscriber net additions grew mor...

29/07/2025

Spotify rapporterar intkter fr andra kvartalet 2025

Idag rapporterar vi resultatet f r andra kvartalet 2025, med stark tillv xt av antalet prenumeranter och m natliga aktiva anv ndare. Under f rsta halv ret kade...

29/07/2025

Viewing to Content with Ads Gained Share to 73.6% of Overall TV Viewing in Q2, Nielsen's Q2 2025 Ad Supported Gauge Finds

Streaming Holds Steady in a Lighter Summer Viewership Season NEW YORK - July 29...

29/07/2025

Viewing of Ad-Supported Services Grew to 73.6% of TV Viewing in Q2

NEW YORK Nielsen is reporting that viewing of content with ads became more popular in Q2, 2025, gaining 1.2 share points of overall TV viewing to capture 73.6% ...

29/07/2025

QuickLink Launches New StudioEdge Products for IBC2025

SAN ANTONIO QuickLink has launched two new versions of its StudioEdge line of products: StudioEdge-1 and StudioEdge-2 provide one-channel and two-channels of br...

29/07/2025

SBE Honors Irwin, Bialik for Engineering Achievements

The Society of Broadcast Engineers has announced the recipients of the 2025 SBE National Awards, which recognize outstanding achievements by individual members,...

29/07/2025

Cobalt Digital to Feature Plug-and-Play ST 2110 Solutions at IBC 2025

CHAMPAIGN, Ill. Cobalt Digital is heading to IBC 2025 with an expanded lineup of IPMX-compliant products and solutions that highlight its simple plug-and-play a...

29/07/2025

G&D Outlines Plans for Live KVM and Video Processing Demos at IBC2025

AMSTERDAM German manufacturer Guntermann & Drunck GmbH (G&D) has announced that it will present a wide range of KVM and video processing solutions for broadcast...

29/07/2025

Guntermann and Drunck at IBC 2025 Live demos of innovativ...

At IBC 2025 in Amsterdam (September 12 15), German manufacturer Guntermann & Drunck GmbH (G&D) will present a range of intelligent solutions designed to meet th...

29/07/2025

FourCastNet 3 Enables Fast and Accurate Large Ensemble Weather Forecasting With Scalable Geometric ML

FourCastNet3 (FCN3) is the latest AI global weather forecasting system from NVID...

29/07/2025

Space42, Microsoft, and Esri Sign Memorandum of Understanding to Enhance Mapping Capabilities Across Africa

MoU will support the Map Africa Initiative, a program designed to create a con...

29/07/2025

X-Rite Launches CT2100 Spectrophotometer for Fast, Affordable Retail Paint Color Matching

X-Rite Launches CT2100 Spectrophotometer for Fast, Affordable Retail Paint Color...

29/07/2025

Adrian Edmondson and Lesley Sharp join cast for second series of Bergerac

Filming is now underway with Damien Molony and wider cast returning to Jersey for Bergerac, written by Toby Whithouse alongside Ashley Sanders, Emilie Robson an...

29/07/2025

NBA Summer League Tests Out, Refines Audio Workflows

NBA Summer League Tests Out, Refines Audio Workflows New mic arrays and ways of mixing them are a focus By Dan Daley, Audio Editor Tuesday, July 29, 2025 - 7...