Sony Pixel Power calrec Sony

AI Decoded: Demystifying Large Language Models, the Brains Behind Chatbots

13/03/2024

Editor's note: This post is part of our AI Decoded series, which aims to demystify AI by making the technology more accessible, while showcasing new hardware, software, tools and accelerations for RTX PC and workstation users.

If AI is having its iPhone moment, then chatbots are one of its first popular apps.

They're made possible thanks to large language models, deep learning algorithms pretrained on massive datasets - as expansive as the internet itself - that can recognize, summarize, translate, predict and generate text and other forms of content. They can run locally on PCs and workstations powered by NVIDIA GeForce and RTX GPUs.

LLMs excel at summarizing large volumes of text, classifying and mining data for insights, and generating new text in a user-specified style, tone or format. They can facilitate communication in any language, even beyond ones spoken by humans, such as computer code or protein and genetic sequences.

While the first LLMs dealt solely with text, later iterations were trained on other types of data. These multimodal LLMs can recognize and generate images, audio, videos and other content forms.

Chatbots like ChatGPT were among the first to bring LLMs to a consumer audience, with a familiar interface built to converse with and respond to natural-language prompts. LLMs have since been used to help developers write code and scientists to drive drug discovery and vaccine development.

But the AI models that power those functions are computationally intensive. Combining advanced optimization techniques and algorithms like quantization with RTX GPUs, which are purpose-built for AI, helps make LLMs compact enough and PCs powerful enough to run locally - no internet connection required. And a new breed of lightweight LLMs like Mistral - one of the LLMs powering Chat with RTX - sets the stage for state-of-the-art performance with lower power and storage demands.

Why Do LLMs Matter? LLMs can be adapted for a wide range of use cases, industries and workflows. This versatility, combined with their high-speed performance, offers performance and efficiency gains across virtually all language-based tasks.

DeepL, running on NVIDIA GPUs in the cloud, uses advanced AI to provide accurate text translations. LLMs are widely used in language translation apps such as DeepL, which uses AI and machine learning to provide accurate outputs.

Medical researchers are training LLMs on textbooks and other medical data to enhance patient care. Retailers are leveraging LLM-powered chatbots to deliver stellar customer support experiences. Financial analysts are tapping LLMs to transcribe and summarize earning calls and other important meetings. And that's just the tip of the iceberg.

Chatbots - like Chat with RTX - and writing assistants built atop LLMs are making their mark on every facet of knowledge work, from content marketing and copywriting to legal operations. Coding assistants were among the first LLM-powered applications to point toward the AI-assisted future of software development. Now, projects like ChatDev are combining LLMs with AI agents - smart bots that act autonomously to help answer questions or perform digital tasks - to spin up an on-demand, virtual software company. Just tell the system what kind of app is needed and watch it get to work.

Learn more about LLM agents on the NVIDIA developer blog.

Easy as Striking Up a Conversation Many people's first encounter with generative AI came by way of a chatbot such as ChatGPT, which simplifies the use of LLMs through natural language, making user action as simple as telling the model what to do.

LLM-powered chatbots can help generate a draft of marketing copy, offer ideas for a vacation, craft an email to customer service and even spin up original poetry.

Advances in image generation and multimodal LLMs have extended the chatbot's realm to include analyzing and generating imagery - all while maintaining the wonderfully simple user experience. Just describe an image to the bot or upload a photo and ask the system to analyze it. It's chatting, but now with visual aids.

For more on how these bots are designed, check out the on-demand webinar on Building Intelligent AI Chatbots Using RAG.

Future advancements will help LLMs expand their capacity for logic, reasoning, math and more, giving them the ability to break complex requests into smaller subtasks.

Progress is also being made on AI agents, applications capable of taking a complex prompt, breaking it into smaller ones, and engaging autonomously with LLMs and other AI systems to complete them. ChatDev is an example of an AI agent framework, but agents aren't limited to technical tasks.

For example, users could ask a personal AI travel agent to book a family vacation abroad. The agent would break that task into subtasks - itinerary planning, booking travel and lodging, creating packing lists, finding a dog walker - and independently execute them in order.

Unlock Personal Data With RAG As powerful as LLMs and chatbots are for general use, they can become even more helpful when combined with an individual user's data. By doing so, they can help analyze email inboxes to uncover trends, comb through dense user manuals to find the answer to a technical question about some hardware, or summarize years of bank and credit card statements.

Retrieval-augmented generation, or RAG, is one of the easiest and most effective ways to hone LLMs for a particular dataset.

An example of RAG on a PC. RAG enhances the accuracy and reliability of generative AI models with facts fetched from external sources. By connecting an LLM with practically any external resource, RAG lets users chat with data repositories while also giving the LLM the ability to cite its sources. The user experience is as simple as pointing the chatbot toward a file or directory.

For
LINK: https://blogs.nvidia.com/blog/ai-decoded-rtx-pc-llms-chatbots/...
See more stories from nvidia

Most recent headlines

09/11/2025

Dalet Unveils Agentic AI Media Workflows at IBC2025

Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...

15/10/2025

NBA Unveils Updated NBA App and Reimagined' NBA TV

NEW YORK The NBA is making major changes to the NBA App and NBA TV as it takes control of them from TNT Sports, which has long managed the league's digital ...

15/10/2025

DirecTV Announces Major Expansion of Interactive AI-Powered Features

SAN MATEO, Calif. In what promises to be a major expansion of interactive features and personalized content on the DirecTV platform, the operator and Glance hav...

15/10/2025

Roku Upgrades User Interface to Showcase Original Content

SAN JOSE, Calif. Roku has launched changes to its user interface (UI) that the streaming platform says will better showcase original programming on the platform...

15/10/2025

OpenDrives Promotes Alex Dunfey to CTO

LOS ANGELES Software-defined data storage and data services provider OpenDrives has elevated Alex Dunfey to chief technology officer, responsible for driving th...

14/10/2025

SVG Europe Summit 2025: All Sessions Now Available to Watch on SVG PLAY

SVG Europe Summit 2025: All Sessions Now Available to Watch on SVG PLAYNetworking event that preceded IBC2025 shone a light on elite live sports innovation acro...

14/10/2025

SVG Sit-Down: Author Rich Podolsky on Writing Madden & Summerall: How They Revolutionized NFL Broadcasting'

SVG Sit-Down: Author Rich Podolsky on Writing Madden & Summerall: How They Revo...

14/10/2025

SVG All-Stars: Michael Reiners, Coordinating Producer, FloRacing

SVG All-Stars: Michael Reiners, Coordinating Producer, FloRacingThe Illinois State grad steers a vast schedule of motorsports events at tracks across the countr...

14/10/2025

Content Protection: Getting the Right Management for Your DRM

Content protection: Getting the right management for your DRM By Neal Romanek Friday, October 10, 2025 - 10:11 Print This Story Eluvio power the EPCR'...

14/10/2025

As League Takes Over Ops, NBA TV and NBA App Add 60 Games, Weekday Studio Show, Global Matchups, and More

As League Takes Over Ops, NBA TV and NBA App Add 60 Games, Weekday Studio Show, ...

14/10/2025

Time and Effort: World's Largest Student-Led Broadcast Prepares to Go On Air for 24 Hours this Week

Time and effort: World's largest student-led broadcast prepares to go On Air...

14/10/2025

The Perfect Neighbor Takes a Novel Approach to Examining America's Race and Gun Violence

(L-R) Guest, Kimberly Robinson Jones, Geeta Gandbhir, Pamela Dias, and Takema Ro...

14/10/2025

Spotify Premium bietet jetzt verlustfreies Audio fr ein detailreicheres Hrerlebnis

Lossless ist jetzt mit Spotify Premium verf gbar. Verlustfreies Audio war eine...

14/10/2025

La qualit Lossless arrive sur Spotify Premium, pour une exprience d'coute plus riche

La qualit Lossless est disponible sur Spotify Premium. Le format sans perte de...

14/10/2025

Ed Sheeran's Play' Hits the Pitch as Spotify and FC Barcelona Launch Latest El Clsico Jersey Takeover

For the seventh edition of Spotify and FC Barcelona's artist jersey series, ...

14/10/2025

Spotify Expands Managed Accounts for Young Listeners on Premium Family Plans

Spotify is committed to bringing the best listening experience to all our users, and that includes parents and families. That's why we're expanding mana...

14/10/2025

Spotify's Fiction Podcast Caso 63' Reaches an Epic Conclusion

Since its debut, the Spotify Original podcast Caso 63 has been more than just a story; it's been a cultural sensation. The science fiction thriller captivat...

14/10/2025

El podcast de ficcin de Spotify, Caso 63,' llega a un pico final

Desde su debut, el podcast original de Spotify Caso 63 ha sido mucho m s que una historia: se ha convertido en un fen meno cultural. Este thriller de ciencia fi...

14/10/2025

Nu kan du lyssna med lossless-kvalitet p Spotify Premium

Lossless p Spotify Premium r h r. Lossless-ljud har varit en av de mest efterl ngtade funktionerna p Spotify och nu, ntligen, har den b rjat rullas ut til...

14/10/2025

Spotify Studios and The Ringer Video Podcasts Are Coming to Netflix

Early next year, your favorite video podcasts are getting a bigger stage. Spotify and Netflix are teaming up to bring sports, culture, lifestyle, and true crime...

14/10/2025

4th Safety Day at SGL Carbon puts occupational safety in the spotlight

Last week, the 4th global Safety Day took place at all SGL Carbon sites. This years Safety Day focused on hazardous substances. Various information events, wor...

14/10/2025

Latest Nielsen consumer data shows Everything's too expensive

From bowser to basket, 9 in 10 Aussies are feeling the impact of rising prices 26% of households earn over $160k, but are still concerned about rising prices...

14/10/2025

Latest Nielsen data shows Aussies are careful with cash, but careless with loyalty

New players take a bite out of big bank share as consumers increasingly value tr...

14/10/2025

From road trips to recipes, Nielsen data shows Aussies are choosing simple pleasures over big splurges

56% of Aussies are looking for a coastal holiday, while 40% are planning a road ...

14/10/2025

Nielsen data shows Aussies are set to swap utes for EVs in green gold rush

51% of Aussies want a hybrid car and 36% want a full EV Toyota leads the market 75% research online before a new car purchase Sydney - October 14, 2025 - Aus...

14/10/2025

Nielsen reveals Indonesia's biggest advertisers and top spending ad categories

Unilever leads the market Beverages, smartphones, and food dominate category sp...

14/10/2025

Nielsen Ad Intel data shows increasing ad spend by insurance brands as competition increases

Top insurance advertisers Biggest growth categories Sector ad spend up 4.7...

14/10/2025

Saothair Capital Partners Acquires GatesAir

WAYNE, Pa. Private-equity firm Saothair Capital Partners said it has completed the acquisition of GatesAir through a newly-formed affiliate....

14/10/2025

Space Norway chooses Media Excel HERO 6000 to power next-...

Media Excel, a leading provider of encoding and transcoding solutions, today announced that Space Norway, a leading provider of satellite services and operator ...

14/10/2025

ZTransform Welcomes Jason Tyler as Inside Sales and Procu...

Jason Tyler has joined ZTransform, a leader in media environment innovation, as Inside Sales and Procurement Manager bringing commercial and operational focus t...

14/10/2025

Tiny toys, big missions: Knee High Spies launches on ABC this November

14 10 2025 - Media release Tiny toys, big missions: Knee High Spies launches on ABC this November Knee High Spies Kids, assemble! The ABC and Screen Australi...

14/10/2025

Space42 and e& Partner to Accelerate Vehicle-to-Everything Technologies for Autonomous Mobility in the UAE

Abu Dhabi, UAE October 14, 2025: Space42 (ADX: SPACE42), the AI-powered SpaceT...

14/10/2025

e& and Space42 Explore Partnership to Extend 5G Direct to Device Services

Abu Dhabi, UAE October 14, 2025: Space42 (ADX: SPACE42), the UAE-based AI-powered SpaceTech company with a global reach, has signed a Memorandum of Understand...

14/10/2025

U&Gold to take viewers on Joe & David's Magical Sitcom Tour

Joe Wilkinson and David Earl will explore their favourite sitcoms together with help from stars such as Ricky Gervais 14th October, London: Comedians, writers,...

14/10/2025

Anna Sargent, Victor Slezak, Ali Ahn, Marceline Hugot, and Shane Harper Set As Cast of Liz Sargent s Feature Debut Take Me Home

October 14th, 2025 ANNA SARGENT, VICTOR SLEZAK, ALI AHN, MARCELINE HUGOT, AND S...

14/10/2025

Will Sharpe and Paul Bettany star in Skys spectacular reimagining of Amadeus first-look teaser revealed

The Sky Original event series - a symphony of genius, rivalry and vengeance - al...

14/10/2025

ESA awards Rohde & Schwarz for contributions to 30 years European Satellite Navigation

ESA awards Rohde & Schwarz for contributions to 30 years European Satellite Navi...

14/10/2025

2026 HPA Tech Retreat to Deliver Insight, Innovation, and Industry Dialogue

The Hollywood Professional Association (HPA) today unveiled key highlights of the 2026 HPA Tech Retreat, scheduled for Feb. 15-19 at the Westin Rancho Mirage Go...

14/10/2025

Rena Ayer Joins Red Seat Ventures as Senior Vice President, Content & Talent Partnerships

Rena Ayer Joins Red Seat Ventures as Senior Vice President, Content & Talent Par...

14/10/2025

Imelda May: Amhrin na nGael

Imelda May explores her relationship with the Irish language through songs and sean-n s singing Friday 17 October, 8.30pm on RT One and RT Player Watch tr...

14/10/2025

NVIDIA and Oracle to Accelerate Enterprise AI and Data Processing

AI is transforming the way enterprises build, deploy and scale intelligent applications. As demand surges for enterprise-grade AI applications that offer speed,...

14/10/2025

Oracle and NVIDIA Accelerate Sovereign AI, Enabling Abu Dhabi's AI-Native Government Transformation

At Oracle AI World, NVIDIA and Oracle announced they are deepening their collabo...

13/10/2025

Spectrum Brings Selected L.A. Lakers Games to Apple Vision Pro With New Immersive Presentation

Spectrum Brings Selected L.A. Lakers Games to Apple Vision Pro With New Immersiv...

13/10/2025

Media Climate Accord Aims to Offer United Approach to M&E Industry Sustainability Efforts

Media Climate Accord aims to offer united approach to M&E industry sustainabilit...

13/10/2025

Riot Games Streamlines Production of Valorant Champions Paris with ST 2110 Flypack

Riot Games streamlines production of Valorant Champions Paris with ST 2110 flypa...

13/10/2025

Feeling the NRG: Riot Games Puts on a Show for Valorant Champions Paris Final

Feeling the NRG: Riot Games puts on a show for Valorant Champions Paris final By Jo Ruddock Monday, October 13, 2025 - 09:17 Print This Story After more t...

13/10/2025

FOX Sports MLB Postseason Audio Aims To Make Officials' Calls More Accurate

FOX Sports MLB Postseason Audio Aims To Make Officials' Calls More AccurateA1 Joe Carpenter hopes to bring some baseball CSI' to the ABS ump-cam system...

13/10/2025

To Have All Is to Share All: A Spotlight on Sundance Institute's Merata Mita Fellowship

By Katie Arthurs Whether told through dance, ceremony, spoken word, or visual a...

13/10/2025

New SBS and NITV Original RECKLESS a Deadly Funny Thriller Straight Out of Freo - Premieres Wednesday November 12 at 8:30pm

New SBS and NITV Original RECKLESS a Deadly Funny Thriller Straight Out of Fre...

13/10/2025

Mid-Atlantic Sports Network strikes all-IP video distribu...

Regional sports network moves from satellite to IP to cut distribution costs by more than half and streamline broadcast and direct-to-consumer delivery Mid-Atl...