Sony Pixel Power calrec Sony

AI Decoded: Demystifying Large Language Models, the Brains Behind Chatbots

13/03/2024

Editor's note: This post is part of our AI Decoded series, which aims to demystify AI by making the technology more accessible, while showcasing new hardware, software, tools and accelerations for RTX PC and workstation users.

If AI is having its iPhone moment, then chatbots are one of its first popular apps.

They're made possible thanks to large language models, deep learning algorithms pretrained on massive datasets - as expansive as the internet itself - that can recognize, summarize, translate, predict and generate text and other forms of content. They can run locally on PCs and workstations powered by NVIDIA GeForce and RTX GPUs.

LLMs excel at summarizing large volumes of text, classifying and mining data for insights, and generating new text in a user-specified style, tone or format. They can facilitate communication in any language, even beyond ones spoken by humans, such as computer code or protein and genetic sequences.

While the first LLMs dealt solely with text, later iterations were trained on other types of data. These multimodal LLMs can recognize and generate images, audio, videos and other content forms.

Chatbots like ChatGPT were among the first to bring LLMs to a consumer audience, with a familiar interface built to converse with and respond to natural-language prompts. LLMs have since been used to help developers write code and scientists to drive drug discovery and vaccine development.

But the AI models that power those functions are computationally intensive. Combining advanced optimization techniques and algorithms like quantization with RTX GPUs, which are purpose-built for AI, helps make LLMs compact enough and PCs powerful enough to run locally - no internet connection required. And a new breed of lightweight LLMs like Mistral - one of the LLMs powering Chat with RTX - sets the stage for state-of-the-art performance with lower power and storage demands.

Why Do LLMs Matter? LLMs can be adapted for a wide range of use cases, industries and workflows. This versatility, combined with their high-speed performance, offers performance and efficiency gains across virtually all language-based tasks.

DeepL, running on NVIDIA GPUs in the cloud, uses advanced AI to provide accurate text translations. LLMs are widely used in language translation apps such as DeepL, which uses AI and machine learning to provide accurate outputs.

Medical researchers are training LLMs on textbooks and other medical data to enhance patient care. Retailers are leveraging LLM-powered chatbots to deliver stellar customer support experiences. Financial analysts are tapping LLMs to transcribe and summarize earning calls and other important meetings. And that's just the tip of the iceberg.

Chatbots - like Chat with RTX - and writing assistants built atop LLMs are making their mark on every facet of knowledge work, from content marketing and copywriting to legal operations. Coding assistants were among the first LLM-powered applications to point toward the AI-assisted future of software development. Now, projects like ChatDev are combining LLMs with AI agents - smart bots that act autonomously to help answer questions or perform digital tasks - to spin up an on-demand, virtual software company. Just tell the system what kind of app is needed and watch it get to work.

Learn more about LLM agents on the NVIDIA developer blog.

Easy as Striking Up a Conversation Many people's first encounter with generative AI came by way of a chatbot such as ChatGPT, which simplifies the use of LLMs through natural language, making user action as simple as telling the model what to do.

LLM-powered chatbots can help generate a draft of marketing copy, offer ideas for a vacation, craft an email to customer service and even spin up original poetry.

Advances in image generation and multimodal LLMs have extended the chatbot's realm to include analyzing and generating imagery - all while maintaining the wonderfully simple user experience. Just describe an image to the bot or upload a photo and ask the system to analyze it. It's chatting, but now with visual aids.

For more on how these bots are designed, check out the on-demand webinar on Building Intelligent AI Chatbots Using RAG.

Future advancements will help LLMs expand their capacity for logic, reasoning, math and more, giving them the ability to break complex requests into smaller subtasks.

Progress is also being made on AI agents, applications capable of taking a complex prompt, breaking it into smaller ones, and engaging autonomously with LLMs and other AI systems to complete them. ChatDev is an example of an AI agent framework, but agents aren't limited to technical tasks.

For example, users could ask a personal AI travel agent to book a family vacation abroad. The agent would break that task into subtasks - itinerary planning, booking travel and lodging, creating packing lists, finding a dog walker - and independently execute them in order.

Unlock Personal Data With RAG As powerful as LLMs and chatbots are for general use, they can become even more helpful when combined with an individual user's data. By doing so, they can help analyze email inboxes to uncover trends, comb through dense user manuals to find the answer to a technical question about some hardware, or summarize years of bank and credit card statements.

Retrieval-augmented generation, or RAG, is one of the easiest and most effective ways to hone LLMs for a particular dataset.

An example of RAG on a PC. RAG enhances the accuracy and reliability of generative AI models with facts fetched from external sources. By connecting an LLM with practically any external resource, RAG lets users chat with data repositories while also giving the LLM the ability to cite its sources. The user experience is as simple as pointing the chatbot toward a file or directory.

For
LINK: https://blogs.nvidia.com/blog/ai-decoded-rtx-pc-llms-chatbots/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

01/04/2026

DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION

January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION Douyin Users Can Now Create And Share Videos With Stun...

12/02/2026

Spectrum News Acquires New England Cable News

Share Copy link Facebook X Linkedin Bluesky Email...

12/02/2026

Vizrt Unveils Campus Stadium Production Bundles

Share Copy link Facebook X Linkedin Bluesky Email...

12/02/2026

Hulu + Live TV Adds Fubo Sports Network to Channel Line-up

Share Copy link Facebook X Linkedin Bluesky Email...

12/02/2026

FCC To Hold Open Commission Meeting on Feb. 18

Share Copy link Facebook X Linkedin Bluesky Email...

12/02/2026

Ralph M. Oakley to Receive NAB's Chuck Sherman TV Leadership Award

Share Copy link Facebook X Linkedin Bluesky Email...

12/02/2026

February 11, 2026

TIME100 Health list features Scripps Research Professor Darrell Irvine Irvine is recognized for his work in empowering the immune system to fight disease, which...

11/02/2026

FYI: Phone Support Maintenance

FYI: Phone Support Maintenance One thing we pride ourselves on here at Utah Scientific is our 24-hour support included with our signature 10-year hardware warra...

11/02/2026

Bitmovin Appoints Ian Baglow as Co-Chief Executive Officer

Leading provider of video streaming solutions, Bitmovin, has appointed Ian Baglow as Co-CEO alongside existing CEO and Co-Founder Stefan Lederer. Under this str...

11/02/2026

Paramount and CBS Partner to Air UFC 326

Paramount and the CBS Television Network will partner to air UFC 326: HOLLOWAY vs. OLIVEIRA 2 live on Saturday, March 7, from T-Mobile Arena in Las Vegas, mar...

11/02/2026

MLB.TV Launches on ESPN Beginning February 10

Beginning February 10, fans can buy MLB.TV on ESPN, a new milestone in one of sports media's longest-standing partnerships. ESPN becomes the new streaming h...

11/02/2026

Fubo Sports Network Launches on Hulu + Live TV

Fubo Sports Network is available to Hulu's Live TV subscribers in the core $89.99 a month subscription plan, which also includes full access to the entire H...

11/02/2026

Rai Selects Imagine Selenio Network Processor for IP Migration

Following a competitive public tender process, Rai (Radiotelevisione Italiana), the national public broadcasting company of Italy, has awarded Imagine Communica...

11/02/2026

MLB Makes In-Market Streaming Subscriptions for 20 Clubs Available to Fans

Major League Baseball is making in-market streaming subscriptions for 20 Clubs available today for fans. Subscriptions for the following Clubs are available vi...

11/02/2026

5G Broadcast Trials Return to the Olympic Stage at Milano Cortina 2026

Building on successful demonstrations during the Paris Olympics 2024, Italian public service broadcaster Rai and the European Broadcasting Union (EBU) are condu...

11/02/2026

ESPN and Disney Launch We're Going, the First Marketing Campaign for ESPN's Inaugural Super Bowl

Following Sunday's Super Bowl LX, ESPN and Disney unveiled We're Going,...

11/02/2026

Stats Perform: 2026 Super Bowl Latency Report

Delayed streams are a growing source of frustration for sports fans. During the 2026 Super Bowl, some streams lagged up to 62 seconds behind the action on the f...

11/02/2026

NASCAR Channel and FloSports to Simulcast 16 Races Live

NASCAR and FloSports announces an expanded slate of racing events that will bring FloRacing coverage live throughout the 2026 season to the NASCAR Channel, furt...

11/02/2026

Manifold Expands Sales Presence in Europe

Manifold technologies GmbH announces the appointment of Nick Tucker as Sales Manager for Europe, reinforcing the company's continued growth across broadcast...

11/02/2026

Genies and MLB Players Inc. Team Up to Create AI Characters of MLB Players

Genies, the AI avatar technology company powering the next era of interactive digital identity, entered into a landmark collaboration with MLB Players, Inc., th...

11/02/2026

ICC, Google Partner for the First-Ever AI-Powered ICC Men's T20 World Cup fuelled by Gemini & Pixel

The International Cricket Council (ICC) and Google have joined forces for an AI-...

11/02/2026

Dolby Highlights From First Ever Super Bowl LX Innovation Summit

Dolby's CEO Kevin Yeaman and Giles Baker, SVP of Dolby Cloud Solutions, shared how the brand's latest innovations - Dolby Vision, Dolby Atmos, and Dolby...

11/02/2026

Detroit Tigers and Detroit Red Wings Enter Broadcast Partnership with MLB

Ilitch Sports + Entertainment has entered a first of its kind partnership with Major League Baseball, which will provide broadcast support to both the Detroit T...

11/02/2026

A Changing Landscape: MLB Local Media Brings Detroit Tigers, Los Angeles Angels Into the Fold; ESPN Begins Distribution of MLB.TV

Broadcasts of the NHL's Detroit Red Wings will also be produced by the leagu...

11/02/2026

DAM Los Angeles

Video moves fast can your DAM keep up? Join Blue Lucy in LA for the West Coast's leading Digital Asset Management event as we explore, celebrate, and acc...

11/02/2026

Super Bowl LX Delivers 124.9 Million Viewers

NEW YORK - February 10, 2026 - An estimated 124.9 million viewers watched Super Bowl LX on Sunday, February 8, according to Nielsen's Big Data Panel measu...

11/02/2026

Scripps Selling Court TV to Jellysmack

Share Copy link Facebook X Linkedin Bluesky Email...

11/02/2026

Schulze-Brakel Introduces Wireless Clip-On Branding For Mics

Share Copy link Facebook X Linkedin Bluesky Email...

11/02/2026

LiveU To Showcase Expanded IP-Video EcoSystem at 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

11/02/2026

Clear-Com Powers TEDNext 2025 with Gen-IC Virtual Interco...

Clear-Com provided an advanced, IP-based communications infrastructure for TEDNext 2025, supporting production, media, and editorial teams with a highly flexib...

11/02/2026

Astera Releases QuikBeam - Versatile 200W Equivalent LED...

Astera introduces QuikBeam, the newest addition to its acclaimed Quik family of focusing LED Fresnels. This ultra-compact spotlight combines the equivalent powe...

11/02/2026

Rai Selects Imagine Selenio Network Processor for IP Migr...

Following a competitive public tender process, Rai (Radiotelevisione Italiana), the national public broadcasting company of Italy, has awarded Imagine Communica...

11/02/2026

DoPchoice to Intro SNAP RABBIT Octa 5 SNAPBAG

With Convertible Mount for NL Bowens & Aputure A Mounts See it at BSC Expo Stand #133 LCA DoPchoice continues to refine light shaping tools for professional LE...

11/02/2026

ZEISS Aatma -Contemporary Full Frame Primes with a Soulfu...

World Premiere at BSC Expo, Booth #319 Oberkochen/Germany, 10 February 2026 ZEISS introduces the new Aatma, set of nine high-end full frame T1.5 cinema primes ...

11/02/2026

NUGEN Audio Plug Ins Help Nick Fry Navigate the Demands o...

As Re-recording Mixer and Head of Sound at The Farm, one of UK's leading post-production facilities, Nick Fry has built his career on making stories sound a...

11/02/2026

iSpot Introduces Agentic AI Platform iSpot SAGE

Share Copy link Facebook X Linkedin Bluesky Email...

11/02/2026

NBC Sports Regional Sports Networks Taps Sportradar for NBA Coverage

Share Copy link Facebook X Linkedin Bluesky Email...

11/02/2026

MLB.TV Launches on ESPN

Share Copy link Facebook X Linkedin Bluesky Email...

11/02/2026

Super Bowl LX Attracts Nearly 125 Million U.S. Viewers

Share Copy link Facebook X Linkedin Bluesky Email...

11/02/2026

Graduate Spotlight: Gabrielle Rodriguez

Graduate Spotlight: Gabrielle Rodriguez The educator, who grew up in the Philippines, shares how shes bringing what she learned at Berklee back home. Februar...

11/02/2026

Sky brings together Netflix, Disney+, HBO Max and Hayu into one single subscription, exclusively on Sky

Wednesday 11 February 2026 Sky brings together Netflix, Disney , HBO Max and Ha...

11/02/2026

Netflix Confirms Production of Love O'Clock' From the Writers of Business Proposal' and True Beauty' Director

Back to All News Netflix Confirms Production of Love O'Clock' From the...

11/02/2026

Investing in Belgian Stories: A Commitment to Culture and Choice

Back to All News Investing in Belgian Stories: A Commitment to Culture and Choice From left to right: Undercover, Ang le, Rough Diamonds, Into the Night, John...

11/02/2026

Lisbon 2026

At the end of January, ICG headed off to the Portuguese capital, Lisbon, for our annual conference. An early flight gave us plenty of time to start exploring s...

11/02/2026

ABS Strengthens Ku-Band Capacity and Expands Regional Reach Through Strategic Partnership with Horizon Teleports

ABS Strengthens Ku-Band Capacity and Expands Regional Reach Through Strategic Pa...