Sony Pixel Power calrec Sony

Striking Performance: Large Language Models up to 4x Faster on RTX With TensorRT-LLM for Windows

17/10/2023

Generative AI is one of the most important trends in the history of personal computing, bringing advancements to gaming, creativity, video, productivity, development and more.

And GeForce RTX and NVIDIA RTX GPUs, which are packed with dedicated AI processors called Tensor Cores, are bringing the power of generative AI natively to more than 100 million Windows PCs and workstations.

Today, generative AI on PC is getting up to 4x faster via TensorRT-LLM for Windows, an open-source library that accelerates inference performance for the latest AI large language models, like Llama 2 and Code Llama. This follows the announcement of TensorRT-LLM for data centers last month.

NVIDIA has also released tools to help developers accelerate their LLMs, including scripts that optimize custom models with TensorRT-LLM, TensorRT-optimized open-source models and a developer reference project that showcases both the speed and quality of LLM responses.

TensorRT acceleration is now available for Stable Diffusion in the popular Web UI by Automatic1111 distribution. It speeds up the generative AI diffusion model by up to 2x over the previous fastest implementation.

Plus, RTX Video Super Resolution (VSR) version 1.5 is available as part of today's Game Ready Driver release - and will be available in the next NVIDIA Studio Driver, releasing early next month.

Supercharging LLMs With TensorRT LLMs are fueling productivity - engaging in chat, summarizing documents and web content, drafting emails and blogs - and are at the core of new pipelines of AI and other software that can automatically analyze data and generate a vast array of content.

TensorRT-LLM, a library for accelerating LLM inference, gives developers and end users the benefit of LLMs that can now operate up to 4x faster on RTX-powered Windows PCs.

At higher batch sizes, this acceleration significantly improves the experience for more sophisticated LLM use - like writing and coding assistants that output multiple, unique auto-complete results at once. The result is accelerated performance and improved quality that lets users select the best of the bunch.

TensorRT-LLM acceleration is also beneficial when integrating LLM capabilities with other technology, such as in retrieval-augmented generation (RAG), where an LLM is paired with a vector library or vector database. RAG enables the LLM to deliver responses based on a specific dataset, like user emails or articles on a website, to provide more targeted answers.

To show this in practical terms, when the question How does NVIDIA ACE generate emotional responses? was asked of the LLaMa 2 base model, it returned an unhelpful response.

Better responses, faster. Conversely, using RAG with recent GeForce news articles loaded into a vector library and connected to the same Llama 2 model not only returned the correct answer - using NeMo SteerLM - but did so much quicker with TensorRT-LLM acceleration. This combination of speed and proficiency gives users smarter solutions.

TensorRT-LLM will soon be available to download from the NVIDIA Developer website. TensorRT-optimized open source models and the RAG demo with GeForce news as a sample project are available at ngc.nvidia.com and GitHub.com/NVIDIA.

Automatic Acceleration Diffusion models, like Stable Diffusion, are used to imagine and create stunning, novel works of art. Image generation is an iterative process that can take hundreds of cycles to achieve the perfect output. When done on an underpowered computer, this iteration can add up to hours of wait time.

TensorRT is designed to accelerate AI models through layer fusion, precision calibration, kernel auto-tuning and other capabilities that significantly boost inference efficiency and speed. This makes it indispensable for real-time applications and resource-intensive tasks.

And now, TensorRT doubles the speed of Stable Diffusion.

Compatible with the most popular distribution, WebUI from Automatic1111, Stable Diffusion with TensorRT acceleration helps users iterate faster and spend less time waiting on the computer, delivering a final image sooner. On a GeForce RTX 4090, it runs 7x faster than the top implementation on Macs with an Apple M2 Ultra. The extension is available for download today.

The TensorRT demo of a Stable Diffusion pipeline provides developers with a reference implementation on how to prepare diffusion models and accelerate them using TensorRT. This is the starting point for developers interested in turbocharging a diffusion pipeline and bringing lightning-fast inferencing to applications.

Video That's Super AI is improving everyday PC experiences for all users. Streaming video - from nearly any source, like YouTube, Twitch, Prime Video, Disney+ and countless others - is among the most popular activities on a PC. Thanks to AI and RTX, it's getting another update in image quality.

RTX VSR is a breakthrough in AI pixel processing that improves the quality of streamed video content by reducing or eliminating artifacts caused by video compression. It also sharpens edges and details.

Available now, RTX VSR version 1.5 further improves visual quality with updated models, de-artifacts content played in its native resolution and adds support for RTX GPUs based on the NVIDIA Turing architecture - both professional RTX and GeForce RTX 20 Series GPUs.

Retraining the VSR AI model helped it learn to accurately identify the difference between subtle details and compression artifacts. As a result, AI-enhanced images more accurately preserve details during the upscaling process. Finer details are more visible, and the overall image looks sharper and crisper.

RTX Video Super Resolution v1.5 improves detail and sharpness. New with version 1.5 is the ability to de-artifact video played at the display's native resolution. The original release only enhanced video when it was
LINK: https://blogs.nvidia.com/blog/2023/10/17/tensorrt-llm-windows-stable-d...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

01/04/2026

DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION

January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION Douyin Users Can Now Create And Share Videos With Stun...

15/01/2026

Telycam to Showcase New Mix One Video Switcher at ISE 2026

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

15/01/2026

Pliant Names Adam Grede as Regional Sales Manager

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

15/01/2026

NFL Wild Card Games Score With Viewers

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

15/01/2026

Survive the Quarantine Zone and More With Devolver Digital Games on GeForce NOW

NVIDIA kicked off the year at CES, where the crowd buzzed about the latest gaming announcements - including the native GeForce NOW app for Linux and Amazon Fire...

14/01/2026

ITV selects Yospace for Advanced Ad Measurement and Monetisation on Freely

Staines-upon-Thames, UK, 13th January, 2026 ITV, one of the UKs leading broadcasters, has selected Yospace, the global leader in Dynamic Ad Insertion (DAI), to ...

14/01/2026

Tech Focus: Audio Consoles, Part 2 - New Options for Virtual Mixing

Tech Focus: Audio Consoles, Part 2 - New Options for Virtual MixingA variety of solutions offer both technical and economic benefitsBy Dan Daley, Audio Editor ...

14/01/2026

Tech Focus: Audio Consoles, Part 1 - Key Component Evolves Toward the Totally Virtual

Tech Focus: Audio Consoles, Part 1 - Key Component Evolves Toward the Totally Vi...

14/01/2026

SVG Summit 2025: Audio from Monday Workshops Now Available

SVG Summit 2025: Audio from Monday Workshops Now AvailableListen to sessions from Live Production Innovation, AI Production Tools, Cloud Production, Content Wor...

14/01/2026

US Navy and Marines Select L3Harris T7 Robots to Enhance Ordnance Disposal Capabilities

The L3Harris large T7 robotic systems will provide U.S. Navy and U.S. Marines wi...

14/01/2026

Steiger Media reimagines broadcast workflows with Calrec

Steiger Media's adoption of Calrec's compact Argo M console not only makes its innovative new hybrid truck faster, more efficient, and agile, but also e...

14/01/2026

NBC Sports to Deploy viztrick AiDi for Live Sports Production

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

14/01/2026

Sinclair Accepting Applications for 2026 Scholarship Program

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

14/01/2026

Neal Shapiro to Retire as President and CEO of The WNET Group

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

14/01/2026

Tribeca Announces Best New York Short Award for 25th Anniversary Festival

January 14th, 2026 TRIBECA ANNOUNCES BEST NEW YORK SHORT AWARD FOR 25TH ANNIVERSARY FESTIVAL In Celebration of Its 25th Anniversary, Tribeca Introduces a N...

14/01/2026

Sky News announces Cathy Newman to lead flagship new political programme

Wednesday 14 January 2026 Sky News announces Cathy Newman to lead flagship new political programme Sky News today announces that award-winning journalist and ...

14/01/2026

'State of Fear', The First Spin-Off of a Netflix Brazil Production, Premieres February 11

Back to All News State of Fear, The First Spin-Off of a Netflix Brazil Producti...

14/01/2026

Special stamp celebrates 100 Years of Broadcasting in Ireland

The first stamp of An Post's 2026 Stamp Programme, marking 100 Years of Broadcasting, was unveiled at the GPO by Patrick O'Donovan TD, Minister for Cult...

14/01/2026

It's Official! Beverley Callard joins Fair City

It's official! Beverley Callard has landed in Carrigstown. The beloved actor, known for her unforgettable roles and iconic screen presence, is joining the c...

13/01/2026

AGILE Against the Odds: Backing Innovative Income Streams for Independent Media

Independent media in Brazil and Colombia is facing an urgent crisis of traditional business models alongside a deteriorating security environment, according to ...

13/01/2026

NHL Situation Room 2.0: How Sony Hawk-Eye Powers Centralized Officiating, Player Safety, the League's Next Chapter

NHL Situation Room 2.0: How Sony Hawk-Eye Powers Centralized Officiating, Player...

13/01/2026

NBC Sports Ices the Audio for the 2026 Prevagen U.S. Figure Skating Championships

NBC Sports Ices the Audio for the 2026 Prevagen U.S. Figure Skating Championship...

13/01/2026

DMF and MXL in Practice: Which Vendors are Adopting it, and How Fast is the Ecosystem Maturing?

DMF and MXL in practice: Which vendors are adopting it, and how fast is the ecos...

13/01/2026

CES 2026: Five Important Sports-Tech Buzzwords

CES 2026: Five Important Sports-Tech BuzzwordsThe terms highlight innovations for sports production at the showBy Daniel Frankel, SVG Contributor Tuesday, Jan...

13/01/2026

For TGL Season 2, Unity 6 Boosts Virtual-Graphic Quality; COSM 360 Cameras Improve Hitting-Box Coverage

For TGL Season 2, Unity 6 Boosts Virtual-Graphic Quality; COSM 360 Cameras Impro...

13/01/2026

Resetting Expectations? The State of the Sports Industry with Devoncroft's Josh Stinehour

Resetting Expectations? The State of the Sports Industry with Devoncroft's J...

13/01/2026

2026 Sundance Film Festival Unveils Jury Members

Top Row L-R: Ana Katz, Natalia Almada, Bao Nguyen, Tatiana Maslany, A.V. Rockwell, Dr. Heather Berlin Second Row L-R: Sophie Barthes, Azazel Jacobs, Janicza Br...

13/01/2026

L3Harris Accelerates Arsenal of Freedom' with Creation of a New Missile Solutions Company

DoW to invest $1B in planned independently traded Missile Solutions business...

13/01/2026

L3Harris Chairman and CEO Joins Under Secretary of War in Interview on FOX Business

L3Harris Chairman and CEO Christopher Kubasik and Under Secretary of War for Acq...

13/01/2026

First Gulf Expands into U.S. Market with Launch of First Westlake Logistics Park

April 10, 2025 First Gulf has taken a significant step in its U.S. expansion with the launch of its first industrial development in the country. First Westla...

13/01/2026

SoftMoc Leases 145,600 Sq. Ft. at 901 Hopkins in Whitby

April 11, 2025 Canadian footwear retailer SoftMoc has signed a lease for 145,600 square feet at 901 Hopkins Street in Whitby, where the space will serve as a w...

13/01/2026

25 Ontario Reaches Key Milestone with Occupancy Permit

April 14, 2025 First Gulf is proud to announce that 25 Ontario has officially received its occupancy permit, marking the transition from an active construction...

13/01/2026

Sherwin-Williams Selects First Gulf for New 350,000 Sq. Ft. Facility in Barrie

April 28, 2025 First Gulf has been awarded a design-build lease for a new 350,000 square foot office and warehouse facility for Sherwin-Williams. This project ...

13/01/2026

First Gulf Expands U.S. Industrial Footprint with First Savannah Logistics Center

August 13, 2025 First Gulf Expands U.S. Industrial Footprint with First Savanna...

13/01/2026

First Gulf Secures Construction Management Services Contract for Toromonts New Corporate Campus in Vaughan

August 13, 2025 First Gulf is proud to partner with Toromont Industries Ltd. to...

13/01/2026

Fully Leased! 901 Hopkins Street in Whitby is Now 100% Occupied

October 10, 2025 First Gulf is pleased to announce that PPFD, a leading third-party logistics company, has leased 146,536 square feet at 901 Hopkins Street in ...

13/01/2026

Nielsen appoints Matty Lin as APAC regional sales leader

Singapore - January 13, 2026 - Nielsen today announced the appointment of Matty Lin to its Commercial Organization as APAC regional sales leader. Based in Sing...

13/01/2026

Techex Taps Tim Jackson for Senior U.S. Sales Role

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

13/01/2026

Madhavi Tadikonda Joins Scripps as Senior Director, Network Sales

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

13/01/2026

The COT Launches Live at Bay 7 at American Tobacco Campus

Nine-week performance series brings music, dance, theatre, and storytelling to downtown Durham, January - March 2026 (Durham, NC) The Chamber Orchestra of the T...

13/01/2026

Berklee Launches AIMS, an Artist-Centered Summit on Music and AI

Berklee Launches AIMS, an Artist-Centered Summit on Music and AI Hosted by the Berklee Emerging Artistic Technology Lab (BEATL), the event will focus on the i...

13/01/2026

Big Blue Marble Partners With HISPlayer For Immersive XR

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

13/01/2026

Netflix Drives Global Growth in Ad-Supported Streaming

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

13/01/2026

HBO Max Closes in on Complete European Rollout

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

13/01/2026

CES 2026 Attendance Hits 148,000

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

13/01/2026

Altafiber Asks FCC to Reconsider Nexstar Retrans Ruling

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

13/01/2026

Kiloview to Showcase Integrated AV-over-IP Ecosystem at ISE 2026

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...