Sony Pixel Power calrec Sony

Run LLMs on AnythingLLM Faster With NVIDIA RTX AI PCs

29/05/2025

Large language models (LLMs), trained on datasets with billions of tokens, can generate high-quality content. They're the backbone for many of the most popular AI applications, including chatbots, assistants, code generators and much more.

One of today's most accessible ways to work with LLMs is with AnythingLLM, a desktop app built for enthusiasts who want an all-in-one, privacy-focused AI assistant directly on their PC.

With new support for NVIDIA NIM microservices on NVIDIA GeForce RTX and NVIDIA RTX PRO GPUs, AnythingLLM users can now get even faster performance for more responsive local AI workflows.

What Is AnythingLLM? AnythingLLM is an all-in-one AI application that lets users run local LLMs, retrieval-augmented generation (RAG) systems and agentic tools.

It acts as a bridge between a user's preferred LLMs and their data, and enables access to tools (called skills), making it easier and more efficient to use LLMs for specific tasks like:

Question answering: Getting answers to questions from top LLMs - like Llama and DeepSeek R1 - without incurring costs.

Personal data queries: Use RAG to query content privately, including PDFs, Word files, codebases and more.

Document summarization: Generating summaries of lengthy documents, like research papers.

Data analysis: Extracting data insights by loading files and querying it with LLMs.

Agentic actions: Dynamically researching content using local or remote resources, running generative tools and actions based on user prompts.

AnythingLLM can connect to a wide variety of open-source local LLMs, as well as larger LLMs in the cloud, including those provided by OpenAI, Microsoft and Anthropic. In addition, the application provides access to skills for extending its agentic AI capabilities via its community hub.

With a one-click install and the ability to launch as a standalone app or browser extension - wrapped in an intuitive experience with no complicated setup required - AnythingLLM is a great option for AI enthusiasts, especially those with GeForce RTX and NVIDIA RTX PRO GPU-equipped systems.

RTX Powers AnythingLLM Acceleration GeForce RTX and NVIDIA RTX PRO GPUs offer significant performance gains for running LLMs and agents in AnythingLLM - speeding up inference with Tensor Cores designed to accelerate AI.

AnythingLLM runs LLMs with Ollama for on-device execution accelerated through Llama.cpp and ggml tensor libraries for machine learning.

Ollama, Llama.cpp and GGML are optimized for NVIDIA RTX GPUs and the fifth-generation Tensor Cores. Performance on GeForce RTX 5090 is 2.4X compared to an Apple M3 Ultra.

GeForce RTX 5090 delivers 2.4x faster LLM inference in AnythingLLM than Apple M3 Ultra on both Llama 3.1 8B and DeepSeek R1 8B. As NVIDIA adds new NIM microservices and reference workflows - like its growing library of AI Blueprints - tools like AnythingLLM will unlock even more multimodal AI use cases.

AnythingLLM - Now With NVIDIA NIM AnythingLLM recently added support for NVIDIA NIM microservices - performance-optimized, prepackaged generative AI models that make it easy to get started with AI workflows on RTX AI PCs with a streamlined API.

NVIDIA NIMs are great for developers looking for a quick way to test a Generative AI model in a workflow. Instead of having to find the right model, download all the files and figure out how to connect everything, they provide a single container that has everything you need. And they can run both on Cloud and PC, making it easy to prototype locally and then deploy on the cloud.

By offering them within AnythingLLM's user-friendly UI, users have a quick way to test them and experiment with them. And then they can either connect them to their workflows with AnythingLLM, or leverage NVIDIA AI Blueprints and NIM documentation and sample code to plug them directly to their apps or projects.

Explore the wide variety of NIM microservices available to elevate AI-powered workflows, including language and image generation, computer vision and speech processing.

Each week, the RTX AI Garage blog series features community-driven AI innovations and content for those looking to learn more about NIM microservices and AI Blueprints, as well as building AI agents, creative workflows, digital humans, productivity apps and more on AI PCs and workstations.

Plug in to NVIDIA AI PC on Facebook, Instagram, TikTok and X - and stay informed by subscribing to the RTX AI PC newsletter.

Follow NVIDIA Workstation on LinkedIn and X. See notice regarding software product information.
LINK: https://blogs.nvidia.com/blog/rtx-ai-garage-anythingllm-nim/...
See more stories from nvidia

North America Stories

01/06/2025

IPMX in a Variety of Form Factors Cobalt Digital Has it...

Cobalt Digital, the leading designer and manufacturer of award-winning signal processing products, and a founding partner in the openGear initiative will highl...

01/06/2025

Richard Meredith Selects DPA for Challenging Location Sou...

The Piano, which just wrapped its third season, is an acclaimed UK reality competition that invites undiscovered, amateur pianists to perform emotionally charg...

01/06/2025

Optoma Launches Photon Go - Mobile Triple-Laser Projector...

Optoma, a provider of professional visual solutions, will launch its new Photon Go projector on May 28 2025. The compact device combines RGB triple-laser tech...

01/06/2025

CINEFLARES PRO Reaches Milestone of 100 Lens Sets Profile...

CINEFLARES LENS LAB, the unique interactive lens flare library from Markus F rderer ASC, BVK, has completed profiling 100 of the industry's top lens familie...

01/06/2025

Hippotizer MX Series Packs a Punch for WrestleMania

Green Hippo's new Hippotizer MX Series Media Servers were out in force for this year's WWE WrestleMania spectacular, pumping out 4K live action and pre-...

01/06/2025

Glensound Introduces Expanded Range of Audio Solutions at...

Glensound, a leader in high-quality audio systems, will highlight new additions to its range of intercom, commentary, and eSports audio solutions at InfoComm 20...

01/06/2025

Texas A and M Elevates Athletics Production with Grass Va...

Grass Valley, the media and entertainment industry's leading technology innovator, today announced Texas A&M University has selected its IP-enabled live pro...

01/06/2025

Kiloview Wins Asia-Pacific Broadcasting Award 2025 for Em...

Kiloview has been honored with the Live Event Streaming China award at the 2025 Asia-Pacific Broadcasting+ Awards for its outstanding contribution to the 71st...

01/06/2025

Veset Enables Telekom Malaysias Full Scale Cloud Migratio...

Cloud playout solutions provider, Veset, has announced its partnership with Telekom Malaysia (TM) to successfully migrate ten of its regional TV channels to the...

01/06/2025

MRMC Gears Up for Back-to-Back Industry Showcases at Cine...

Mark Roberts Motion Control (MRMC), a Nikon company, is set for a busy June, showcasing its cutting-edge camera robotics and motion control solutions at two sig...

01/06/2025

Dubformer and OOONA Announce Strategic Partnership to Str...

Dubformer, a global leader in AI dubbing, and OOONA, the media localization sector's language technology platform of choice, are pleased to announce a strat...

01/06/2025

NHRA Selects Zixi for Ultra-Low Latency Live Video Delive...

Zixi, the industry leader in enabling live broadcast-quality video over any IP network, today announced that the National Hot Rod Association (NHRA), the premie...

31/05/2025

ENCO to Show BYOD Translation, Captioning for Live Events at InfoComm

NOVI, Mich. ENCO said it will showcase new AI innovations, including two automated captioning and real-time translation solutions at InfoComm in June....

31/05/2025

Fubo Launches Programmatic Pause Ads on CTV, a First

NEW YORK Fubo has launched programmatic pause ads, a development that makes it the first connected TV (CTV) platform to offer this ad format in a programmatic b...

31/05/2025

BEAM Digital TV Network Expands to 9 Philippine Regions

RAYMOND, Maine Dielectric and Philippines partner 90 Degrees North said they have been working to extend the digital television service for BEAM (Broadcast Ente...

31/05/2025

EvertzAV Debuts MMA25G IPMX-Ready Gateways at InfoComm 2025

BURLINGTON, Canada EvertzAV, a division of Evertz Microsystems, has announced that it will unveil a new family of MMA25G IPMX-Ready Gateways at InfoComm 2025, s...

31/05/2025

Migrate Sound Uses DaVinci Resolve Studio for Audio Post

Migrate Sound Uses DaVinci Resolve Studio for Audio Post Brie Clayton May 30, 2025 0 Comments Hit FOX TV show promos created with DaVinci Resolve Stud...

31/05/2025

COW Job Post: After Effects Pro for Simple Rig, Remote

COW Job Post: After Effects Pro for Simple Rig, Remote Brie Clayton May 30, 2025 0 Comments After Effects Pro for Simple Rig May 16, 2025COW Jobs: H...

30/05/2025

American Underground Returns to American Tobacco Campus

American Underground Returns to American Tobacco Campus A full-circle homecoming for Durham's leading coworking community Durham, N.C. (May 29, 2025) - In ...

30/05/2025

Texas A&M Selects Grass Valley for IP-Based Athletics Production

COLLEGE STATION, Texas Texas A&M University here has selected Grass Valley's IP-enabled live production solutions to upgrade its athletics coverage workflow...

30/05/2025

Trinity Capital Invests in Atmosphere TV

PHOENIX Trinity Capital has announced a commitment to provide $62.7 million in capital to Atmosphere TV, a streaming platform that offers more than 30 original ...

30/05/2025

How Musicians Can (and Should) Use AI-According to Berklee Experts

How Musicians Can (and Should) Use AI-According to Berklee Experts At Berklee's first AI and Music Innovation event, artists and educators explored how ar...

30/05/2025

The Most Underrated Hip-Hop Producers, According to J. Rawls

The Most Underrated Hip-Hop Producers, According to J. Rawls The producer whos worked with Mos Def, Talib Kweli, the Roots, and Common, joined a panel on camp...

30/05/2025

Unlocking the Power of Integrated Ad Tech in a Converged Media World

By Brian Thoman, WideOrbit Chief Technology Officer...

30/05/2025

Tribeca Festival 2025 Announces Creator Economy Programming Lineup

May 30th, 2025 Tribeca Festival 2025 Announces Creator Economy Programming Lineup Tribeca Festival Elevates Digital Creators with Red Carpet, Big-Screen Debut...

30/05/2025

17th-Annual SVG College Summit Unites Video-Production Community as Industry Undergoes Seismic Change

17th-Annual SVG College Summit Unites Video-Production Community as Industry Und...

30/05/2025

UEFA Champions League Final 2025: CBS Sports To Produce Pitchside Studio Show, Beckham & Friends' Altcast in Munich

UEFA Champions League Final 2025: CBS Sports To Produce Pitchside Studio Show, ...

30/05/2025

University of Wyoming's Dennis Trapani on a Busy Year of Live Event Production

University of Wyoming's Dennis Trapani on a Busy Year of Live Event Producti...

30/05/2025

Women's College World Series: ESPN Enhances Softball Coverage With Expanded Camera Plan, New POV and Aerial Tech

Women's College World Series: ESPN Enhances Softball Coverage With Expanded ...

30/05/2025

Netflix and Embratur Announce Cooperation Agreement to Boost Audiovisual Tourism in Brazil

Back to All News Netflix and the Brazilian Tourism Board Announce Cooperation A...

30/05/2025

May 29, 2025

AI pinpoints new anti-aging drug candidates More than 70% of the drugs identified by artificial intelligence extended the lifespan of C. elegans worms. May 29,...

29/05/2025

From Fellow to Advisor: Erica Tremblay at Sundance Institute's Native Lab

Director Erica Tremblay on the set of Little Chief' during the 2018 Native Filmmakers Lab. Photo by Tytianna Harris for Sundance Institute...

29/05/2025

We're Listening!

The media and entertainment landscape is changing rapidly. Emerging technologies like AI and cloud workflows are transforming post production, while economic sh...

29/05/2025

Don't Sell Out the Power of Sound

Don't Sell Out the Power of Sound George Lucas famously said that sound and music are 50 percent of the entertainment in a movie and while the exact figure...

29/05/2025

L3Harris Receives Contract to Develop Next-Generation Security Processor for US Government

L3Harris receives a contract to develop a next-generation security processor to ...

29/05/2025

FOX Television Stations and Nielsen Extend Multi-Year Measurement Agreement, Including Advanced Audiences

Renewal Spans all 18 Local Fox Markets and Includes Streaming Measurement of Loc...

29/05/2025

Amazon Prime Video, Disney+ and Netflix are now home to 92% of sports programs on top SVOD services

Leading streamers expanded TV, movie and sports titles by 5% in Q2; Netflix led ...

29/05/2025

Gomez Criticizes FCC Delay in Implementing Multilingual WEA

CARSON, Calif. Rep. Nanette Barrag n (D-Calif.) joined Federal Communications Commission member Anna Gomez and Carson City Mayor Lula Davis-Holmes on Tuesday to...

29/05/2025

Fox Television Stations, Nielsen Renew Measurement Pact

NEW YORK Nielsen said it struck a multiyear renewal agreement with Fox Television Stations for measurement across 18 of Fox's owned-and-operated local stati...

29/05/2025

Telemundo Launches Caso Cerrado' FAST Channel

MIAMI NBCUniversal Telemundo Enterprises has launched the Caso Cerrado FAST channel, a 24/7 streaming destination featuring more than 800 hours of the iconic ...

29/05/2025

Media Prima moves to a new broadcast centre with Pebble r...

Pebble, the leading automation, content management and integrated channel specialist, has completed a large installation in Kuala Lumpur, Malaysia, as Media Pri...

29/05/2025

Tintri VMstore T7080 Named in DCIG 2025-26 TOP 5 Cybersec...

Tintri , a DDN subsidiary, and leading provider of the world's only workload-aware, AI-powered data management solutions, today announced that its high-per...

29/05/2025

Tedial, Moments Lab Partner to Add AI Indexing to EVO MAM

MALAGA, Spain Media integration and asset management solutions provider Tedial has formed a strategic partnership with Moments Lab, a provider of AI-based video...

29/05/2025

Sony Introduces FX2 Compact Camera

Sony announced the latest addition to its Cinema Line family, the FX2. The camera, designed as an entry-level product in the broader Cinema Line range, will be ...

29/05/2025

Yahoo DSP Integrates Comscore's ID-Free Audience Targeting Solution

RESTON, Va. Comscore has announced that its AI-powered ID-free audiences offering has been integrated into Yahoo DSP, adding to the existing suite of targeting ...

29/05/2025

Atomos Bows StudioSonic Shotgun Mic

SINGAPORE Atomos introduced its StudioSonic shotgun mic for filmmakers, journalists, YouTubers and production professionals at Broadcast Asia here, May 27-29 at...

29/05/2025

UEFA Champions League Final 2025: DAZN on the Goals For its Unilateral Platinum Production for PSG vs Inter

UEFA Champions League Final 2025: DAZN on the goals for its unilateral platinum ...

29/05/2025

UEFA Champions League Final 2025: Host Broadcaster DAZN Reveals the Plan for This Hotly Contested Visual Feast

UEFA Champions League Final 2025: Host broadcaster DAZN reveals the plan for thi...

29/05/2025

Alex de la Iglesia, Carmen Maura and Blanca Suarez Together in a New Project for Netflix

Back to All News Alex de la Iglesia, Carmen Maura and Blanca Suarez Together in...

29/05/2025

The Supercomputer Designed to Accelerate Nobel-Worthy Science

Ready for a front-row seat to the next scientific revolution? That's the idea behind Doudna - a groundbreaking supercomputer announced today at Lawrence Be...