LM Studio Accelerates LLM Performance With NVIDIA GeForce RTX GPUs and CUDA 12.8

08/05/2025

As AI use cases continue to expand - from document summarization to custom software agents - developers and enthusiasts are seeking faster, more flexible ways to run large language models (LLMs).

Running models locally on PCs with NVIDIA GeForce RTX GPUs enables high-performance inference, enhanced data privacy and full control over AI deployment and integration. Tools like LM Studio - free to try - make this possible, giving users an easy way to explore and build with LLMs on their own hardware.

LM Studio has become one of the most widely adopted tools for local LLM inference. Built on the high-performance llama.cpp runtime, the app allows models to run entirely offline and can also serve as OpenAI-compatible application programming interface (API) endpoints for integration into custom workflows.

The release of LM Studio 0.3.15 brings improved performance for RTX GPUs thanks to CUDA 12.8, significantly improving model load and response times. The update also introduces new developer-focused features, including enhanced tool use via the tool_choice parameter and a redesigned system prompt editor.

The latest improvements to LM Studio improve its performance and usability - delivering the highest throughput yet on RTX AI PCs. This means faster responses, snappier interactions and better tools for building and integrating AI locally.

Where Everyday Apps Meet AI Acceleration LM Studio is built for flexibility - suited for both casual experimentation or full integration into custom workflows. Users can interact with models through a desktop chat interface or enable developer mode to serve OpenAI-compatible API endpoints. This makes it easy to connect local LLMs to workflows in apps like VS Code or bespoke desktop agents.

For example, LM Studio can be integrated with Obsidian, a popular markdown-based knowledge management app. Using community-developed plug-ins like Text Generator and Smart Connections, users can generate content, summarize research and query their own notes - all powered by local LLMs running through LM Studio. These plug-ins connect directly to LM Studio's local server, enabling fast, private AI interactions without relying on the cloud.

Example of using LM Studio to generate notes accelerated by RTX. The 0.3.15 update adds new developer capabilities, including more granular control over tool use via the tool_choice parameter and an upgraded system prompt editor for handling longer or more complex prompts.

The tool_choice parameter lets developers control how models engage with external tools - whether by forcing a tool call, disabling it entirely or allowing the model to decide dynamically. This added flexibility is especially valuable for building structured interactions, retrieval-augmented generation (RAG) workflows or agent pipelines. Together, these updates enhance both experimentation and production use cases for developers building with LLMs.

LM Studio supports a broad range of open models - including Gemma, Llama 3, Mistral and Orca - and a variety of quantization formats, from 4-bit to full precision.

Common use cases span RAG, multi-turn chat with long context windows, document-based Q&A and local agent pipelines. And by using local inference servers powered by the NVIDIA RTX-accelerated llama.cpp software library, users on RTX AI PCs can integrate local LLMs with ease.

Whether optimizing for efficiency on a compact RTX-powered system or maximizing throughput on a high-performance desktop, LM Studio delivers full control, speed and privacy - all on RTX.

Experience Maximum Throughput on RTX GPUs At the core of LM Studio's acceleration is llama.cpp - an open-source runtime designed for efficient inference on consumer hardware. NVIDIA partnered with the LM Studio and llama.cpp communities to integrate several enhancements to maximize RTX GPU performance.

Key optimizations include:

CUDA graph enablement: Groups multiple GPU operations into a single CPU call, reducing CPU overhead and improving model throughput by up to 35%.

Flash attention CUDA kernels: Boosts throughput by up to 15% by improving how LLMs process attention - a critical operation in transformer models. This optimization enables longer context windows without increasing memory or compute requirements.

Support for the latest RTX architectures: LM Studio's update to CUDA 12.8 ensures compatibility with the full range of RTX AI PCs - from GeForce RTX 20 Series to NVIDIA Blackwell-class GPUs, giving users the flexibility to scale their local AI workflows from laptops to high-end desktops.

Data measured using different versions of LM Studio and CUDA backends on GeForce RTX 5080 on DeepSeek-R1-Distill-Llama-8B model. All configurations measured using Q4_K_M GGUF (Int4) quantization at BS=1, ISL=4000, OSL=200, with Flash Attention ON. Graph showcases 27% speedup with the latest version of LM Studio due to NVIDIA contributions to the llama.cpp inference backend. With a compatible driver, LM Studio automatically upgrades to the CUDA 12.8 runtime, enabling significantly faster model load times and higher overall performance.

These enhancements deliver smoother inference and faster response times across the full range of RTX AI PCs - from thin, light laptops to high-performance desktops and workstations.

Get Started With LM Studio LM Studio is free to download and runs on Windows, macOS and Linux. With the latest 0.3.15 release and ongoing optimizations, users can expect continued improvements in performance, customization and usability - making local AI faster, more flexible and more accessible.

Users can load a model through the desktop chat interface or enable developer mode to expose an OpenAI-compatible API.

To quickly get started, download the latest version of LM Studio and open up the application.

Click the magnifying glass icon on the left panel to open up the Discover

LINK:	https://blogs.nvidia.com/blog/rtx-ai-garage-lmstudio-llamacpp-blackwel...
	See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

07/10/2026

Dalet Flex LTS Delivers Smarter Media Operations from Ingest to Distribution

Dalet, a leading technology and service provider for media-rich organizations, today announced the latest Long-Term Supported (LTS) release of Dalet Flex. Build...

06/09/2026

Dolby and MagentaTV Bring Fans Closer to the FIFA World Cup 2026 in Germany with Dolby Vision and Dolby Atmos

June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

01/08/2026

Harmonic Piano from The Crow Hill Company

Latest free Vaults instrument released The Crow Hill Company's Vaults collection offers a continually rotating selection of virtual instruments and samp...

01/08/2026

American Television Alliance Slams Deltavision Blackout on Verizon

Share Copy link Facebook X Linkedin Bluesky Email...

01/08/2026

FreeWheel Debuts TV Series-Level Reporting for CTV Buyers

Share Copy link Facebook X Linkedin Bluesky Email...

01/08/2026

Minnesota Timberwolves, DAZN to Launch New Streaming Offering

Share Copy link Facebook X Linkedin Bluesky Email...

01/08/2026

Early Registration Opens for 2026 SMPTE Media Technology Summit

Share Copy link Facebook X Linkedin Bluesky Email...

01/08/2026

NBCUniversal and YouTube Ink Landmark Distribution Deal

Share Copy link Facebook X Linkedin Bluesky Email...

01/08/2026

COW Jobs: Lead Generation Representative - Animation Services

COW Jobs: Lead Generation Representative - Animation Services Brie Clayton July 31, 2026 0 Comments Lead Generation Representative - Animation Service...

01/08/2026

Emilie Lowe Shoots Award Winning Short Film Where Dead Things Grow with Blackmagic Design

Emilie Lowe Shoots Award Winning Short Film Where Dead Things Grow with Blackmag...

01/08/2026

Early Registration Opens for the 2026 SMPTE Media Technology Summit

Early Registration Opens for the 2026 SMPTE Media Technology Summit Brie Clayton July 31, 2026 0 Comments Initial Program Announced Alongside New Medi...

31/07/2026

SVG Sit-Down: NHL's Steve Mayer on Building a Centralized Production Operation From Scratch

With an emphasis on local, the league is looking beyond the initial four teams t...

31/07/2026

Reflecting on the 2026 Summer Season of Our Sundance Institute Labs

The 2026 Sundance Institute Directors Lab fellows in Estes Park, Colorado (Photo by Gabe Rovick) Dear Friends, This has been an extraordinary year for our ar...

31/07/2026

Audiocube launch Audiocube Space

Offers 360-degree sound placement in any DAW Audiocube have just announced the launch of a new plug-in that embeds a complete 3D acoustic environment inside...

31/07/2026

Sonarworks expand SoundID VoiceAI library

10 new Chinese C-Pop voices introduced While Sonarworks are best known to many for their room-correction software, their product range also includes an inno...

31/07/2026

PSPaudioware release PSP BBDelay

Reimagines the vintage BBD sound character PSPaudioware have now officially launched the new delay plug-in that they were previewing at GearExpo UK. Said to...

31/07/2026

Savannah Bananas Hits a Broadcast Home Run with Calrec

The Savannah Bananas has built its reputation on refusing to play baseball by traditional rules, and the team's production arm Banana Ball TV (BTV) has draf...

31/07/2026

How Valuable Is TV to Smaller, Independent Cable Operators?

Share Copy link Facebook X Linkedin Bluesky Email...

31/07/2026

Early Registration Opens for the 2026 SMPTE Media Techno...

SMPTE, the home for media professionals, technologists and engineers, today opened early registration for the 2026 Media Technology Summit (MTS) and announced i...

31/07/2026

Central Christian Church Connects Remote Production Teams...

Clear-Com announced that Central Christian Church has transformed communication across its live production operations by implementing Clear-Com's LQ Seri...

31/07/2026

Haivision To Feature Makito ONE, Falkon X4 At IBC 2026

Share Copy link Facebook X Linkedin Bluesky Email...

31/07/2026

ABC Stations Tout Community Support for License Renewals

Share Copy link Facebook X Linkedin Bluesky Email...

31/07/2026

Finalists Announced for IBC2026 Innovation Awards

Share Copy link Facebook X Linkedin Bluesky Email...

31/07/2026

IBC2026 Innovation Awards finalists unveiled honouring co...

IBC has unveiled the finalists for the IBC2026 Innovation Awards, recognising collaborative projects from around the world that solve real-world challenges and ...

31/07/2026

From focus puller to colourist

From focus puller to colourist Caroline Shawley July 30, 2026 0 Comments Ana Mar a Ormaza shares her story of instinct, storytelling and finding a cre...

31/07/2026

Amazon Prime Documentary Andata e Ritorno Relies on Blackmagic Design

Amazon Prime Documentary Andata e Ritorno Relies on Blackmagic Design Brie Clayton July 30, 2026 0 Comments DaVinci Resolve Studio and Blackmagic Clou...

31/07/2026

Krotos Launches Video to Sound Plugin for DaVinci Resolve

Krotos Launches Video to Sound Plugin for DaVinci Resolve Brie Clayton July 30, 2026 0 Comments AI-assisted workflow helps editors add synchronized, p...

31/07/2026

Think summer of sport is over? Think again. More live sport. More big moments. More to come on RT

Athletics, Camogie, the Dublin Horse Show, the FAI Cup, Golf, Hockey, the UEFA C...

31/07/2026

Half-Year Report on SES's Liquidity Contract

Luxembourg, July 31, 2026 - Pursuant to the liquidity contract entered into by SES with BNP Paribas as of 7 April 2026, please see the below update on the progr...

31/07/2026

SES Publishes FY2025 EU Taxonomy Restatement

Luxembourg, July 31, 2026 - SES today published a restatement of its FY2025 EU Taxonomy (Article 8) disclosure. The restatement updates the FY2025 EU Taxonomy d...

31/07/2026

Mercedes-Benz Ireland returning as the official car partner of The Traitors Ireland

RT Commercial has today announced Mercedes-Benz Ireland will return as the offi...

31/07/2026

RT Supporting the Arts is supporting 15 cultural events nationwide this August

RT Supporting the Arts is delighted to spotlight a diverse range of arts, culture and heritage events taking place across Ireland this August. From major festi...

30/07/2026

CP Communications Provides Broadcast Support for 2026 MLB All-Star Week in Philadelphia

CP Communications provided RF audio, RF video, communications, RF coordination, ...

30/07/2026

OpenDrives Named to 2026 CRN Storage 100 List

OpenDrives has been included in CRN's 2026 Storage 100 list in the Software-Defined Storage category. The annual list, selected by the CRN editorial team, r...

30/07/2026

SES Selected by LATAM Airlines for Multi-Orbit Satellite Connectivity

LATAM Airlines has selected SES to provide multi-orbit inflight connectivity to its fleet of Airbus and Embraer aircraft. More than 60 aircraft - including Airb...

30/07/2026

Tagboard Launches Producer API with Elgato Stream Deck Integration

Tagboard has launched the Producer API, an open interface that allows key commands in Tagboard's live production environment to be triggered from external d...

30/07/2026

FOX Sports Adds Del Mar to Horse Racing Coverage in Multi-Year Deal

FOX Sports and the New York Racing Association (NYRA) have announced a multi-year agreement with Del Mar Thoroughbred Club that will make FOX Sports the exclusi...

30/07/2026

ABC Commercial Launches Four FAST Channels on LG Smart TVs

ABC Commercial has launched four free ad-supported streaming television (FAST) channels on LG Smart TVs across North America, Great Britain, and select countrie...

30/07/2026

Six Kings Slam Returns to Netflix in October with Sinner, Alcaraz, Djokovic

The Six Kings Slam exhibition tennis tournament will return to Riyadh on October 21, 22, and 24, streaming live on Netflix at no additional cost to subscribers....

30/07/2026

Central Christian Church Uses Clear-Coms Gen-IC to Connect Remote Production Teams

Central Christian Church in Mt. Vernon, Illinois has deployed Clear-Com's LQ...

30/07/2026

Blackmagic Design Releases UltraStudio Express 3G Capture and Playback Devices

Blackmagic Design has released the UltraStudio Express 3G family, a pair of USB4 capture and playback devices compatible with Mac, Windows, and Linux computers ...

30/07/2026

ESPN Secures Media Rights to Womens Pro Baseball League

ESPN has reached a media rights agreement with the Women's Pro Baseball League (WPBL), making ESPN the national streaming home of the league's 2026 seas...

30/07/2026

Most Valuable Promotions and Professional Fighters League Merge

Most Valuable Promotions (MVP) and the Professional Fighters League (PFL) have announced a merger that will operate under the MVP banner. PFL CEO John Martin wi...

30/07/2026

Columbus Blue Jackets To Simulcast TV and Radio Broadcasts Beginning 2026-27

The Columbus Blue Jackets will simulcast all game broadcasts across television and radio beginning with the 2026-27 NHL season. Under the new format, the televi...

30/07/2026

TMRW Sports Selects Populous To Design Professional Flag Football Stadium

TMRW Sports has selected Populous as architect for a purpose-built stadium for its professional flag football league, being developed in partnership with the NF...

30/07/2026

SiriusXM Launches Sports Pass Subscription at $5 per Month

SiriusXM has announced SiriusXM Sports Pass, a new subscription plan launching September 1 that bundles the company's sports audio programming into a single...

30/07/2026

SVG Cloud & Content Workflows Summit Focuses on Live Production, Content Management, AI

Leagues, broadcasters, and technologists gather to tackle cloud economics, distr...

30/07/2026

AWSs Jason Dvorkin on the Power of AWS Elemental Inference, Agentic AI in Live Sports Production

A major player in the cloud-based solutions, Amazon Web Services (AWS) has conti...

View most recent headlines