
Editor's note: This post is part of our AI Decoded series, which aims to demystify AI by making the technology more accessible, while showcasing new hardware, software, tools and accelerations for RTX PC and workstation users.
If AI is having its iPhone moment, then chatbots are one of its first popular apps.
They're made possible thanks to large language models, deep learning algorithms pretrained on massive datasets - as expansive as the internet itself - that can recognize, summarize, translate, predict and generate text and other forms of content. They can run locally on PCs and workstations powered by NVIDIA GeForce and RTX GPUs.
LLMs excel at summarizing large volumes of text, classifying and mining data for insights, and generating new text in a user-specified style, tone or format. They can facilitate communication in any language, even beyond ones spoken by humans, such as computer code or protein and genetic sequences.
While the first LLMs dealt solely with text, later iterations were trained on other types of data. These multimodal LLMs can recognize and generate images, audio, videos and other content forms.
Chatbots like ChatGPT were among the first to bring LLMs to a consumer audience, with a familiar interface built to converse with and respond to natural-language prompts. LLMs have since been used to help developers write code and scientists to drive drug discovery and vaccine development.
But the AI models that power those functions are computationally intensive. Combining advanced optimization techniques and algorithms like quantization with RTX GPUs, which are purpose-built for AI, helps make LLMs compact enough and PCs powerful enough to run locally - no internet connection required. And a new breed of lightweight LLMs like Mistral - one of the LLMs powering Chat with RTX - sets the stage for state-of-the-art performance with lower power and storage demands.
Why Do LLMs Matter? LLMs can be adapted for a wide range of use cases, industries and workflows. This versatility, combined with their high-speed performance, offers performance and efficiency gains across virtually all language-based tasks.
DeepL, running on NVIDIA GPUs in the cloud, uses advanced AI to provide accurate text translations. LLMs are widely used in language translation apps such as DeepL, which uses AI and machine learning to provide accurate outputs.
Medical researchers are training LLMs on textbooks and other medical data to enhance patient care. Retailers are leveraging LLM-powered chatbots to deliver stellar customer support experiences. Financial analysts are tapping LLMs to transcribe and summarize earning calls and other important meetings. And that's just the tip of the iceberg.
Chatbots - like Chat with RTX - and writing assistants built atop LLMs are making their mark on every facet of knowledge work, from content marketing and copywriting to legal operations. Coding assistants were among the first LLM-powered applications to point toward the AI-assisted future of software development. Now, projects like ChatDev are combining LLMs with AI agents - smart bots that act autonomously to help answer questions or perform digital tasks - to spin up an on-demand, virtual software company. Just tell the system what kind of app is needed and watch it get to work.
Learn more about LLM agents on the NVIDIA developer blog.
Easy as Striking Up a Conversation Many people's first encounter with generative AI came by way of a chatbot such as ChatGPT, which simplifies the use of LLMs through natural language, making user action as simple as telling the model what to do.
LLM-powered chatbots can help generate a draft of marketing copy, offer ideas for a vacation, craft an email to customer service and even spin up original poetry.
Advances in image generation and multimodal LLMs have extended the chatbot's realm to include analyzing and generating imagery - all while maintaining the wonderfully simple user experience. Just describe an image to the bot or upload a photo and ask the system to analyze it. It's chatting, but now with visual aids.
For more on how these bots are designed, check out the on-demand webinar on Building Intelligent AI Chatbots Using RAG.
Future advancements will help LLMs expand their capacity for logic, reasoning, math and more, giving them the ability to break complex requests into smaller subtasks.
Progress is also being made on AI agents, applications capable of taking a complex prompt, breaking it into smaller ones, and engaging autonomously with LLMs and other AI systems to complete them. ChatDev is an example of an AI agent framework, but agents aren't limited to technical tasks.
For example, users could ask a personal AI travel agent to book a family vacation abroad. The agent would break that task into subtasks - itinerary planning, booking travel and lodging, creating packing lists, finding a dog walker - and independently execute them in order.
Unlock Personal Data With RAG As powerful as LLMs and chatbots are for general use, they can become even more helpful when combined with an individual user's data. By doing so, they can help analyze email inboxes to uncover trends, comb through dense user manuals to find the answer to a technical question about some hardware, or summarize years of bank and credit card statements.
Retrieval-augmented generation, or RAG, is one of the easiest and most effective ways to hone LLMs for a particular dataset.
An example of RAG on a PC. RAG enhances the accuracy and reliability of generative AI models with facts fetched from external sources. By connecting an LLM with practically any external resource, RAG lets users chat with data repositories while also giving the LLM the ability to cite its sources. The user experience is as simple as pointing the chatbot toward a file or directory.
For
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
01/06/2026
January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026
Throughout the week, Dolby brings to life the latest innovatio...
02/05/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
01/05/2026
January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...
06/04/2026
Michigan legends bring a new voice to the broadcast as TNT Sports and CBS Sports...
06/04/2026
From high school sports all the way up to the major leagues, building high-quali...
06/04/2026
Quickplay, an AI company for the media and entertainment industry, has been accepted into the Advanced tier of the TwelveLabs Ecosystem Partner Program. Quickpl...
06/04/2026
Grass Valley has announced the Future Playmakers Program, a global initiative to...
06/04/2026
El l der de operaciones impulsa la producci n en estudio mientras encuentra insp...
06/04/2026
The ops leader helps lead the charge in studio for the Spanish-language broadcas...
06/04/2026
Behind The Mic provides a roundup of recent news regarding on-air talent, includ...
06/04/2026
The National Hockey League (NHL), in partnership with Verizon and the New Jersey Devils, today announced the opening of the NHL Innovation Lab powered by Verizo...
06/04/2026
Rock League, a new professional curling league, has announced that ESPN+ will stream its inaugural 2026 season for fans in the United States. The first Rock Lea...
06/04/2026
Advanced Systems Group has announced the appointment of Andrea (Andy) Cummis as Vice President of Systems Design and Engineering. In this role, she will lead de...
06/04/2026
Backed by Bolt Ventures, the venture brings Bryson DeChambeau, Grant Horvat, and...
06/04/2026
With this environment we can start that collaboration even earlier because we ca...
06/04/2026
Like the immortal lives of vampires, some stories never really end. That's t...
06/04/2026
As podcasting continues to evolve, growth increasingly means building beyond aud...
06/04/2026
Multiband dynamics plug-in enhanced
California-based developer FSK Audio have released a significant update for their innovative multiband dynamics processo...
06/04/2026
Share official & user-created full-rig presets
IK Multimedia's latest TONEX update makes it possible for users of the popular amp and effects modelling ...
06/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
06/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
06/04/2026
Dalet Showcases Dalia Agentic AI and End-to-End Media Workflows at NAB Show 2026
Brie Clayton April 6, 2026
0 Comments
Dalet, a leading technology and...
06/04/2026
OpenDrives Shows Off Sports Expertise in Sports Business Hub located in NAB Show...
06/04/2026
Proton to Demonstrate 3D Application at NAB 2026
Brie Clayton April 6, 2026
0 Comments
Yet further creative potential unleashed through innovation in ...
06/04/2026
Autoscript Highlights Voice-Driven Prompting and PTZ Solutions at NAB 2026
Brie Clayton April 6, 2026
0 Comments
Experience Autoscript Voice, PTZ prom...
06/04/2026
Mediaproxy Highlights Significant Enhancements to its LogServer suite at NAB Sho...
06/04/2026
Wayne, N.J., April 6th, 2026 Phantom High-Speed announces the release of PCC 4...
06/04/2026
April 6th, 2026
TRIBECA STUDIOS AND LILLY ANNOUNCE WINNERS OF INAUGURAL VITAL...
06/04/2026
Back to All News
Netflix Expands Kids Entertainment Lineup With Playground App ...
05/04/2026
Tackles all reported bugs!
SoundBridge have just announced the launch of a new update that introduces a couple of minor changes to their remote collaboratio...
04/04/2026
The University of Arizona's Men's Basketball team has only loss twice th...
04/04/2026
1080p HDR arrives, a new generation of storytelling tools takes center stage, an...
04/04/2026
Michigan legends bring a new voice to the broadcast as TNT Sports and CBS Sports...
04/04/2026
Faster, cleaner and more intuitive than ever
The control software for Flock Audio's digitally controlled patchbay systems has just been treated to an up...
04/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
04/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
04/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
04/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
04/04/2026
DHD Introduces AI-Based Audio Noise Reduction to XD3 IP Core
Brie Clayton April 3, 2026
0 Comments
The accompanying image shows the rear panel of the ...
04/04/2026
Macnica Redefines ST 2110 Flexibility with Two Speeds on One Card
Brie Clayton April 3, 2026
0 Comments
New for NAB Show 2026, MEP100 SmartNIC now sup...
04/04/2026
Unified Media Workflows for Story-Centric Production
Brie Clayton April 3, 2026
0 Comments
Framelight X unifies field capture, editing and publishing ...
03/04/2026
Michigan's Fab Five will reunite for an alternate presentation of the Mich...
03/04/2026
Avid will exhibit at NAB Show 2026 (April 18-22, Booth N2226, Las Vegas Convention Center), demonstrating its Content Core platform and new AI-driven workflow c...
03/04/2026
Mark Roberts Motion Control (MRMC) has announced the appointment of Nick Barthee as Chief Operating Officer.
The announcement follows MRMC's transition fro...
03/04/2026
Interra Systems has announced that Elite Media Technologies has selected its BATON file-based QC solution for media workflows. Elite Media Technologies speciali...
03/04/2026
Ateme has announced that Moldtelecom has deployed Ateme technologies across its streaming workflow, covering encoding, delivery, operations, and analytics.
Mol...
03/04/2026
Grass Valley will demonstrate Framelight X, its content management platform, at NAB Show 2026. The platform connects capture, ingest, editing, and publishing in...
03/04/2026
Encompass Digital Media and Techex have announced a cloud-native Master Control ...