Sony Pixel Power calrec Sony

Run LLMs on AnythingLLM Faster With NVIDIA RTX AI PCs

29/05/2025

Large language models (LLMs), trained on datasets with billions of tokens, can generate high-quality content. They're the backbone for many of the most popular AI applications, including chatbots, assistants, code generators and much more.

One of today's most accessible ways to work with LLMs is with AnythingLLM, a desktop app built for enthusiasts who want an all-in-one, privacy-focused AI assistant directly on their PC.

With new support for NVIDIA NIM microservices on NVIDIA GeForce RTX and NVIDIA RTX PRO GPUs, AnythingLLM users can now get even faster performance for more responsive local AI workflows.

What Is AnythingLLM? AnythingLLM is an all-in-one AI application that lets users run local LLMs, retrieval-augmented generation (RAG) systems and agentic tools.

It acts as a bridge between a user's preferred LLMs and their data, and enables access to tools (called skills), making it easier and more efficient to use LLMs for specific tasks like:

Question answering: Getting answers to questions from top LLMs - like Llama and DeepSeek R1 - without incurring costs.

Personal data queries: Use RAG to query content privately, including PDFs, Word files, codebases and more.

Document summarization: Generating summaries of lengthy documents, like research papers.

Data analysis: Extracting data insights by loading files and querying it with LLMs.

Agentic actions: Dynamically researching content using local or remote resources, running generative tools and actions based on user prompts.

AnythingLLM can connect to a wide variety of open-source local LLMs, as well as larger LLMs in the cloud, including those provided by OpenAI, Microsoft and Anthropic. In addition, the application provides access to skills for extending its agentic AI capabilities via its community hub.

With a one-click install and the ability to launch as a standalone app or browser extension - wrapped in an intuitive experience with no complicated setup required - AnythingLLM is a great option for AI enthusiasts, especially those with GeForce RTX and NVIDIA RTX PRO GPU-equipped systems.

RTX Powers AnythingLLM Acceleration GeForce RTX and NVIDIA RTX PRO GPUs offer significant performance gains for running LLMs and agents in AnythingLLM - speeding up inference with Tensor Cores designed to accelerate AI.

AnythingLLM runs LLMs with Ollama for on-device execution accelerated through Llama.cpp and ggml tensor libraries for machine learning.

Ollama, Llama.cpp and GGML are optimized for NVIDIA RTX GPUs and the fifth-generation Tensor Cores. Performance on GeForce RTX 5090 is 2.4X compared to an Apple M3 Ultra.

GeForce RTX 5090 delivers 2.4x faster LLM inference in AnythingLLM than Apple M3 Ultra on both Llama 3.1 8B and DeepSeek R1 8B. As NVIDIA adds new NIM microservices and reference workflows - like its growing library of AI Blueprints - tools like AnythingLLM will unlock even more multimodal AI use cases.

AnythingLLM - Now With NVIDIA NIM AnythingLLM recently added support for NVIDIA NIM microservices - performance-optimized, prepackaged generative AI models that make it easy to get started with AI workflows on RTX AI PCs with a streamlined API.

NVIDIA NIMs are great for developers looking for a quick way to test a Generative AI model in a workflow. Instead of having to find the right model, download all the files and figure out how to connect everything, they provide a single container that has everything you need. And they can run both on Cloud and PC, making it easy to prototype locally and then deploy on the cloud.

By offering them within AnythingLLM's user-friendly UI, users have a quick way to test them and experiment with them. And then they can either connect them to their workflows with AnythingLLM, or leverage NVIDIA AI Blueprints and NIM documentation and sample code to plug them directly to their apps or projects.

Explore the wide variety of NIM microservices available to elevate AI-powered workflows, including language and image generation, computer vision and speech processing.

Each week, the RTX AI Garage blog series features community-driven AI innovations and content for those looking to learn more about NIM microservices and AI Blueprints, as well as building AI agents, creative workflows, digital humans, productivity apps and more on AI PCs and workstations.

Plug in to NVIDIA AI PC on Facebook, Instagram, TikTok and X - and stay informed by subscribing to the RTX AI PC newsletter.

Follow NVIDIA Workstation on LinkedIn and X. See notice regarding software product information.
LINK: https://blogs.nvidia.com/blog/rtx-ai-garage-anythingllm-nim/...
See more stories from nvidia

More from Nvidia

29/05/2025

The Supercomputer Designed to Accelerate Nobel-Worthy Science

Ready for a front-row seat to the next scientific revolution? That's the idea behind Doudna - a groundbreaking supercomputer announced today at Lawrence Be...

29/05/2025

Run LLMs on AnythingLLM Faster With NVIDIA RTX AI PCs

Large language models (LLMs), trained on datasets with billions of tokens, can generate high-quality content. They're the backbone for many of the most popu...

29/05/2025

RTX on Deck: The GeForce NOW Native App for Steam Deck Is Here

GeForce NOW is supercharging Valve's Steam Deck with a new native app - delivering the high-quality GeForce RTX-powered gameplay members are used to on a po...

28/05/2025

NVIDIA's Bartley Richardson on How Teams of AI Agents Provide Next-Level Automation

Building effective agentic AI systems requires rethinking how technology interac...

27/05/2025

How Dell Technologies Is Building the Engines of AI Factories With NVIDIA Blackwell

Over a century ago, Henry Ford pioneered the mass production of cars and engines...

27/05/2025

NVIDIA and Google Partnership Gains Momentum With the Latest Blackwell and Gemini Announcements

NVIDIA and Google share a long-standing relationship rooted in advancing AI inno...

22/05/2025

Sale Into Summer With 40% Off GeForce NOW Six-Month Performance Memberships

GeForce NOW is turning up the heat this summer with a hot new deal. For a limited time, save 40% on six-month Performance memberships and enjoy premium GeForce ...

21/05/2025

NVIDIA and SAP Bring AI Agents to the Physical World

As robots increasingly make their way to the largest enterprises' manufacturing plants and warehouses, the need for access to critical business and operatio...

20/05/2025

Siemens Makes Factory Floors Smarter With Industrial AI

Industrial AI is transforming how factories operate, innovate and scale. The convergence of AI, simulation and digital twins is poised to unlock new levels of ...

19/05/2025

NVIDIA and Microsoft Accelerate Agentic AI Innovation, From Cloud to PC

Agentic AI is redefining scientific discovery and unlocking research breakthroughs and innovations across industries. Through deepened collaboration, NVIDIA and...

19/05/2025

NVIDIA Research Breakthroughs Put Advanced Robots in Motion

Across robot training and development, NVIDIA Research is uncovering breakthroughs in areas such as multimodal generative AI and synthetic data generation. The...

19/05/2025

NVIDIA and Microsoft Advance Development on RTX AI PCs

Generative AI is transforming PC software into breakthrough experiences - from digital humans to writing assistants, intelligent agents and creative tools. NVI...

18/05/2025

NVIDIA CEO Envisions AI Infrastructure Industry Worth Trillions of Dollars'

Electricity. The Internet. Now it's time for another major technology, AI, to sweep the globe. NVIDIA founder and CEO Jensen Huang took the stage at a pack...

18/05/2025

NVIDIA Expands Omniverse Blueprint for AI Factory Digital Twins With New Ecosystem Integrations, Development Tools

Empowering engineering teams with more tools for building AI factories, NVIDIA t...

18/05/2025

AI Blueprint for Video Search and Summarization Now Available to Deploy Video Analytics AI Agents Across Industries

The age of video analytics AI agents is here. Video is one of the defining feat...

18/05/2025

Semiconductor Industry Accelerates Design Manufacturing With NVIDIA Blackwell and CUDA-X

TSMC, Cadence, KLA, Siemens and Synopsys are advancing semiconductor manufacturi...

18/05/2025

NVIDIA Grace CPU C1 Gains Broad Support in Edge, Telco and Storage

NVIDIA is highlighting significant momentum for its new Grace CPU C1 this week at the COMPUTEX trade show in Taipei, with a strong showing of support from key o...

18/05/2025

NVIDIA-Powered Supercomputer to Enable Quantum Leap for Taiwan Research

Researchers across Taiwan are tackling complex challenges in AI development, climate science and quantum computing. Their work will soon be boosted by a new sup...

18/05/2025

That's One Smart Hospital! Taiwan Medical Centers Deploy Life-Saving Innovations With NVIDIA System-Builder Partners

Leading healthcare organizations across the globe are using agentic AI, robotics...

18/05/2025

NVIDIA Grows Quantum Computing Ecosystem With Taiwan Manufacturers and Supercomputing

Quantum computing promises to shorten the path to solving some of the world'...

15/05/2025

Into the Omniverse: Computational Fluid Dynamics Simulation Finds Smoothest Flow With AI-Driven Digital Twins

Editor's note: This post is part of Into the Omniverse, a series focused on ...

15/05/2025

Exploring the Revenue-Generating Potential of AI Factories

AI is creating value for everyone - from researchers in drug discovery to quantitative analysts navigating financial market changes. The faster an AI system ca...

15/05/2025

Time to Slay: DOOM: The Dark Ages' Looms on GeForce NOW

Steel clashes and war drums thunder as a new age of battle dawns - one that will test even the mightiest Slayer. This GFN Thursday, DOOM: The Dark Ages - the b...

14/05/2025

Visa Makes Payments Personalized and Secure With AI

Think tap to pay - but smarter and safer. Visa is tapping into AI to enhance services for its global network of customers, focused on fraud prevention, personal...

14/05/2025

Press Play on Don Diablo's Music Video - Created With NVIDIA RTX-Powered Generative AI

Electronic music icon Don Diablo is known for pushing the boundaries of music, v...

13/05/2025

NVIDIA Partners Showcase Cutting-Edge Robotic and Industrial AI Solutions at Automate 2025

As the manufacturing industry faces challenges - such as labor shortages, reshor...

13/05/2025

How Reasoning AI Agents Transform High-Stakes Decision Making

Editor's note: This post is part of the AI On blog series, which explores the latest techniques and real-world applications of agentic AI, chatbots and copi...

12/05/2025

NVIDIA Scores COMPUTEX Best Choice Awards

NVIDIA today received multiple accolades at COMPUTEX's Best Choice Awards, in recognition of innovation across the company. The NVIDIA GeForce RTX 5090 GPU...

08/05/2025

Wildfire Prevention: AI Startups Support Prescribed Burns, Early Alerts

Artificial intelligence is helping identify and treat diseases faster with better results for humankind. Natural disasters like wildfires are next. Fires in th...

08/05/2025

Join the Family: GeForce NOW Welcomes 2K's Acclaimed Mafia' Franchise to the Cloud

Calling all wiseguys - 2K's acclaimed Mafia franchise is available to stream...

08/05/2025

LM Studio Accelerates LLM Performance With NVIDIA GeForce RTX GPUs and CUDA 12.8

As AI use cases continue to expand - from document summarization to custom software agents - developers and enthusiasts are seeking faster, more flexible ways t...

07/05/2025

Cadence Taps NVIDIA Blackwell to Accelerate AI-Driven Engineering Design and Scientific Simulation

A new supercomputer offered by Cadence, a leading provider of technology for ele...

07/05/2025

NVIDIA's Rama Akkiraju on How AI Platform Architects Help Bridge Business Vision and Technical Execution

Enterprises across industries are exploring AI to rethink problem-solving and re...

02/05/2025

NVIDIA Experts Share Top 5 Tips for Standing Out in the AI Job Market

With graduation season approaching, a new cohort of students is embarking on next steps, aiming to use their passions and skills to make a real, tangible impact...

01/05/2025

May the Cloud Be With You: GeForce NOW Unveils 21 New Games This Month

May brings more than just rainbows and sunshine - it's also time for fresh adventures and epic battles. This GFN Thursday spotlights 20 can't-miss games...

01/05/2025

Wandercraft Begins Clinical Trials for Physical AI-Powered Personal Exoskeleton

For Nicolas Simon, advancing the field of robotics is a personal mission that could change his siblings' lives. Two-thirds of Simon's family members us...

30/04/2025

From Kitchen to Drive-Thru: How Yum! Brands Is Accelerating Restaurant Innovation With AI

The quick-service restaurant (QSR) industry is being reinvented by AI. For exam...

30/04/2025

Control the Composition of AI-Generated Images With the NVIDIA AI Blueprint for 3D-Guided Generative AI

AI-powered image generation has progressed at a remarkable pace - from early exa...

28/04/2025

NVIDIA Brings Cybersecurity to Every AI Factory

As enterprises increasingly adopt AI, securing AI factories - where complex, agentic workflows are executed - has never been more critical. NVIDIA is bringing ...

28/04/2025

How Agentic AI Enables the Next Leap in Cybersecurity

Agentic AI is redefining the cybersecurity landscape - introducing new opportunities that demand rethinking how to secure AI while offering the keys to addressi...

28/04/2025

Oracle Cloud Infrastructure Deploys Thousands of NVIDIA Blackwell GPUs for Agentic AI and Reasoning Models

Oracle has stood up and optimized its first wave of liquid-cooled NVIDIA GB200 N...

24/04/2025

NVIDIA Research at ICLR - Pioneering the Next Wave of Multimodal Generative AI

Advancing AI requires a full-stack approach, with a powerful foundation of computing infrastructure - including accelerated processors and networking technologi...

24/04/2025

All Roads Lead Back to Oblivion: Bethesda's The Elder Scrolls IV: Oblivion Remastered' Arrives on GeForce NOW

Get the controllers ready and clear the calendar - it's a jam-packed GFN Thu...

23/04/2025

How the Economics of Inference Can Maximize AI Value

As AI models evolve and adoption grows, enterprises must perform a delicate balancing act to achieve maximum value. That's because inference - the process ...

23/04/2025

Capital One Banks on AI for Financial Services

Financial services has long been at the forefront of adopting technological innovations. Today, generative AI and agentic systems are redefining the industry, f...

23/04/2025

Project G-Assist Plug-In Builder Lets Anyone Customize AI on GeForce RTX AI PCs

AI is rapidly reshaping what's possible on a PC - whether for real-time image generation or voice-controlled workflows. As AI capabilities grow, so does the...

23/04/2025

Enterprises Onboard AI Teammates Faster With NVIDIA NeMo Tools to Scale Employee Productivity

An AI agent is only as accurate, relevant and timely as the data that powers it....

22/04/2025

Keeping AI on the Planet: NVIDIA Technologies Make Every Day About Earth Day

Whether at sea, land or in the sky - even outer space - NVIDIA technology is helping research scientists and developers alike explore and understand oceans, wil...

22/04/2025

Chill Factor: NVIDIA Blackwell Platform Boosts Water Efficiency by Over 300x

Traditionally, data centers have relied on air cooling - where mechanical chillers circulate chilled air to absorb heat from servers, helping them maintain opti...

22/04/2025

Making Brain Waves: AI Startup Speeds Disease Research With Lab in the Loop

About 15% of the world's population - over a billion people - are affected by neurological disorders, from commonly known diseases like Alzheimer's and ...