Sony Pixel Power calrec Sony

AI Decoded: Demystifying Large Language Models, the Brains Behind Chatbots

13/03/2024

Editor's note: This post is part of our AI Decoded series, which aims to demystify AI by making the technology more accessible, while showcasing new hardware, software, tools and accelerations for RTX PC and workstation users.

If AI is having its iPhone moment, then chatbots are one of its first popular apps.

They're made possible thanks to large language models, deep learning algorithms pretrained on massive datasets - as expansive as the internet itself - that can recognize, summarize, translate, predict and generate text and other forms of content. They can run locally on PCs and workstations powered by NVIDIA GeForce and RTX GPUs.

LLMs excel at summarizing large volumes of text, classifying and mining data for insights, and generating new text in a user-specified style, tone or format. They can facilitate communication in any language, even beyond ones spoken by humans, such as computer code or protein and genetic sequences.

While the first LLMs dealt solely with text, later iterations were trained on other types of data. These multimodal LLMs can recognize and generate images, audio, videos and other content forms.

Chatbots like ChatGPT were among the first to bring LLMs to a consumer audience, with a familiar interface built to converse with and respond to natural-language prompts. LLMs have since been used to help developers write code and scientists to drive drug discovery and vaccine development.

But the AI models that power those functions are computationally intensive. Combining advanced optimization techniques and algorithms like quantization with RTX GPUs, which are purpose-built for AI, helps make LLMs compact enough and PCs powerful enough to run locally - no internet connection required. And a new breed of lightweight LLMs like Mistral - one of the LLMs powering Chat with RTX - sets the stage for state-of-the-art performance with lower power and storage demands.

Why Do LLMs Matter? LLMs can be adapted for a wide range of use cases, industries and workflows. This versatility, combined with their high-speed performance, offers performance and efficiency gains across virtually all language-based tasks.

DeepL, running on NVIDIA GPUs in the cloud, uses advanced AI to provide accurate text translations. LLMs are widely used in language translation apps such as DeepL, which uses AI and machine learning to provide accurate outputs.

Medical researchers are training LLMs on textbooks and other medical data to enhance patient care. Retailers are leveraging LLM-powered chatbots to deliver stellar customer support experiences. Financial analysts are tapping LLMs to transcribe and summarize earning calls and other important meetings. And that's just the tip of the iceberg.

Chatbots - like Chat with RTX - and writing assistants built atop LLMs are making their mark on every facet of knowledge work, from content marketing and copywriting to legal operations. Coding assistants were among the first LLM-powered applications to point toward the AI-assisted future of software development. Now, projects like ChatDev are combining LLMs with AI agents - smart bots that act autonomously to help answer questions or perform digital tasks - to spin up an on-demand, virtual software company. Just tell the system what kind of app is needed and watch it get to work.

Learn more about LLM agents on the NVIDIA developer blog.

Easy as Striking Up a Conversation Many people's first encounter with generative AI came by way of a chatbot such as ChatGPT, which simplifies the use of LLMs through natural language, making user action as simple as telling the model what to do.

LLM-powered chatbots can help generate a draft of marketing copy, offer ideas for a vacation, craft an email to customer service and even spin up original poetry.

Advances in image generation and multimodal LLMs have extended the chatbot's realm to include analyzing and generating imagery - all while maintaining the wonderfully simple user experience. Just describe an image to the bot or upload a photo and ask the system to analyze it. It's chatting, but now with visual aids.

For more on how these bots are designed, check out the on-demand webinar on Building Intelligent AI Chatbots Using RAG.

Future advancements will help LLMs expand their capacity for logic, reasoning, math and more, giving them the ability to break complex requests into smaller subtasks.

Progress is also being made on AI agents, applications capable of taking a complex prompt, breaking it into smaller ones, and engaging autonomously with LLMs and other AI systems to complete them. ChatDev is an example of an AI agent framework, but agents aren't limited to technical tasks.

For example, users could ask a personal AI travel agent to book a family vacation abroad. The agent would break that task into subtasks - itinerary planning, booking travel and lodging, creating packing lists, finding a dog walker - and independently execute them in order.

Unlock Personal Data With RAG As powerful as LLMs and chatbots are for general use, they can become even more helpful when combined with an individual user's data. By doing so, they can help analyze email inboxes to uncover trends, comb through dense user manuals to find the answer to a technical question about some hardware, or summarize years of bank and credit card statements.

Retrieval-augmented generation, or RAG, is one of the easiest and most effective ways to hone LLMs for a particular dataset.

An example of RAG on a PC. RAG enhances the accuracy and reliability of generative AI models with facts fetched from external sources. By connecting an LLM with practically any external resource, RAG lets users chat with data repositories while also giving the LLM the ability to cite its sources. The user experience is as simple as pointing the chatbot toward a file or directory.

For
LINK: https://blogs.nvidia.com/blog/ai-decoded-rtx-pc-llms-chatbots/...
See more stories from nvidia

Most recent headlines

04/08/2024

Dalet Appoints Santiago Solanas as CEO to Lead Next Era of Growth and Innovation

Dalet, a leading technology and service provider for media-rich organizations, is excited to announce Santiago Solanas as its new Chief Executive Officer (CEO)....

03/06/2024

Dalet and Veritone Reach Agreement to Distribute, Transact and Monetize Media Archives

Dalet, a leading technology and service provider for media-rich organizations, a...

21/05/2024

Capturing Cityscapes With LEE Filters

Part of being a photographer is adapting to situations, says photographer Verity Milligan. In this video, Milligan takes her camera to the Gas Street Basin nei...

21/05/2024

I Am L3Harris: Eldon

Today, I am a proud first-generation college graduate with a master's degree in electrical engineering. Never give up, keep learning, keep working hard and ...

21/05/2024

Arabsat powers its new state-of-the-art cloud-playout services through Grass Valley's AMPP Platform

Montreal, Canada- May 21, 2024 - Grass Valley, a pioneer in live production solu...

21/05/2024

Comcast Unveils $15 Price Tag for StreamSaver Bundle

PHILADELPHIA Comcast has released pricing and launch details of its recently announced StreamSaver bundle of Apple TV+, Netflix and Peacock services with the ne...

21/05/2024

TiVo Launches TiVo One Cross-Screen Ad Platform

SAN JOSE, Calif. Xperi's TiVo subsidiary continues to expand the capabilities and reach of its independent media platform with the debut of the TiVo One cr...

21/05/2024

COW Job Listing: Expert Viral Editor Viral content

COW Job Listing: Expert Viral Editor Viral content Brie Clayton May 21, 2024 0 Comments Expert Viral Editor Viral content March 6, 2024COW Job Listi...

21/05/2024

Avid partners with Post Super on post production training course

The programme aims to provide those working in post with a comprehensive understanding of their role within the broader post production ecosystem By Jenny Prie...

21/05/2024

Stephen van Rooyen appointed VodafoneZiggo CEO

He joins VodafoneZiggo following the retirement of Jeroen Hoencamp and will be based at the companys offices in Utrecht By Jenny Priestley Published: May 21,...

21/05/2024

TVBEurope May/June 2024 issue out now

Our latest issue hears from France T l visions about their plans for Paris 2024, we also celebrate the winners of our NAB 2024 Best of Show Awards and explore c...

21/05/2024

AMG Chooses Brightline For Frame' Ad Format

LOS ANGELES Allen Media Group (AMG) has launched an instream Frame ad for advertisers on its The Weather Channel, Local Now and HBCU Go connected TV (CTV) app...

21/05/2024

Wisconsin TV Stations Change Call Signs, Channel Numbers

MILWAUKEE Low power TV (LPTV) station operator Roseland Broadcasting has changed the call sign and the operating channel number of its two Wisconsin stations in...

21/05/2024

U.S. FAST Channels Hit Record Numbers

NEW YORK Despite rapid growth in recent years and worries that the FAST channel market may be reaching saturation, a new report from FASTMaster shows that the t...

21/05/2024

Cineverse to Offer Remastered HD, 4K Episodes of the Bob Rosss 'Joy of Painting

LOS ANGELES Hoping to build on the success of The Bob Ross Channel, Cineverse ha...

21/05/2024

DLT Launches 80's Sitcom Flashback Fast Channel

NEW YORK DLT Entertainment has announced the launch of 80's Sitcom Flashback, a FAST Channel exclusively curated to celebrate the comedies that dominated pr...

21/05/2024

SPL Media House Selects Grass Valley's AMPP

MONTREAL Grass Valley has announced that SPL Media House (SPL MH) in Riyadh, Saudi Arabia has deployed Grass Valley's AMPP SaaS Platform deployment in the r...

21/05/2024

Christ Church Deploys Pliant Technologies Intercoms at West Monroe Location

WEST MONROE, La. Pliant Technologies has announced that Christ Church, which has campuses in West Monroe, Ruston, and Sterlington, has deployed its CrewCom Wire...

21/05/2024

Hollyland Announces Pyro S, a New Wireless 4K Video Monitoring System for Filmmakers

Hollyland Announces Pyro S, a New Wireless 4K Video Monitoring System for Filmma...

21/05/2024

Avid Unveils Revolutionary Post-Production Micro-Certifications with New Learning Partner, Post Super

Avid Unveils Revolutionary Post-Production Micro-Certifications with New Learnin...

21/05/2024

MIX's Carpool Casanova Announces Spirit Award Finalists for Season 3

MIX 101.5's Carpool Casanova will wrap season three this Friday, May 24, 2024, but his final location has yet to be determined. The ultimate Spirit Award W...

21/05/2024

Dr. Ray Seol Awarded Grant for Creative Individuals from Mass Cultural Council

Dr. Ray Seol Awarded Grant for Creative Individuals from Mass Cultural Council The grant will expand his Seu Aprendiz project designed to help individuals bui...

21/05/2024

Korean Unscripted Series Agents of Mystery' by the Producer of The Devil's Plan' Premieres June 18

Back to All News Korean Unscripted Series Agents of Mystery' by the Produc...

21/05/2024

Crime Thriller The Victims' Game' Returns With A Brand New Season on June 21

Back to All News Crime Thriller The Victims' Game' Returns With A Bran...

21/05/2024

This Spring's Biggest Crime Mystery is Solved - Netflix Unveils Who Will Take on the Role of Jo Nesb's Harry Hole

Back to All News This Springs Biggest Crime Mystery is Solved - Netflix Unveils...

21/05/2024

Top 10 Week of May 13: Bridgerton' Season 3 Crowned #1

Back to All News Top 10 Week of May 13: Bridgerton' Season 3 Crowned #1 Entertainment 21 May 2024 Global Link copied to clipboard Spring has sprung, ...

21/05/2024

2024-05-21

PARIS Apple and le-de-France Mobilit s today introduced an easy, secure, and private way for customers to add a new Navigo card to Apple Wallet and purchase pa...

21/05/2024

Skeem Saam: Monday's episode, 20 May 2024 [video]

Skeem Saam: Monday's episode, 20 May 2024 [video]Missed an episode of Skeem Saam? No problem! Watch the latest episode of your favourite South African soapi...

21/05/2024

Popular children's TV shows you may have forgotten

Popular children's TV shows you may have forgottenSouth African television channels have entertained us over the years. Here is looking back at some popular...

21/05/2024

Actor Dumisani Dlamini discusses his role in Isitha: The Enemy'

Actor Dumisani Dlamini discusses his role in Isitha: The Enemy'Legendary actor Dumisani Dlamini has landed the character of Nsimbi in e.tv's popular te...

21/05/2024

Tonight on Smoke and Mirrors: General's romantic gesture to Lulu meets with hesitation

Tonight on Smoke and Mirrors: General's romantic gesture to Lulu meets with ...

21/05/2024

New Performance Optimizations Supercharge NVIDIA RTX AI PCs for Gamers, Creators and Developers

NVIDIA today announced at Microsoft Build new AI performance optimizations and i...

21/05/2024

NVIDIA Expands Collaboration With Microsoft to Help Developers Build, Deploy AI Applications Faster

If optimized AI workflows are like a perfectly tuned orchestra - where each comp...

21/05/2024

RT 2FM announces Drive It with The 2 Johnnies' is to come to an end

RT 2FM today announced that the Drive It with The 2 Johnnies is to come to an end on 31 May. Head of RT 2Fm, Dan Healy said: After two very successful years...

21/05/2024

RT launches a series of Animated Shorts on the theme of Home

HOME IS WHERE THE STORY BEGINS RT launches a series of Animated Shorts on the theme of Home Watch: rte.ie/player/kids Among these short animations is Envelo...

21/05/2024

A Superbloom of Updates in the May Studio Driver Gives Fresh Life to Content Creation

Editor's note: This post is part of our In the NVIDIA Studio series, which c...

21/05/2024

May 20, 2024

New method to reveal what drives brain diseases Scripps Research scientists develop CRISPR screen technology to determine disease mechanism from tissues with ac...

20/05/2024

Masters of Reinvention commissioned for Yesterday & UKTV Play

22nd May 2024 UKTV has commissioned Masters of Reinvention (6x60) for its leading factual channel Yesterday and free streaming service UKTV Play, to be produced...

20/05/2024

The Tuba Thieves Asks What It Means to Listen

PARK CITY, UTAH - JANUARY 22: The cast and crew of The Tuba Thieves pose during the 2023 Sundance Film Festival The Tuba Thieves premiere at Prospector Squa...

20/05/2024

Spotify CLASSICS: The 100 Greatest Hip-Hop Songs of the Streaming Era

Reviews by Carl Chery, Kemet High, and Adrian Covert In February, we launched Spotify CLASSICS-our first-ever program to celebrate catalog music. Our inaugur...

20/05/2024

Hamish Blake to host Alone Australia Season 2: The Reunion on 29 May

Hamish Blake to host Alone Australia Season 2: The Reunion on 29 May 20 May, 2024 Media releases As the season nears its end, SBS's hit series cements ...

20/05/2024

Poolside Gossip

Created by showrunner Abe Sylvia, the Apple TV+ comedic drama Palm Royale navigates the tale of one womans ambitious journey to make it amongst the upper crust,...

20/05/2024

SPL Media House selects Grass Valley AMPP SaaS Platform to Distribute Live Saudi Pro League Football Matches Globally

AMPP's cloud enabled rapid deployment of live production workflows with dist...

20/05/2024

EditShare celebrates anniversaries at CABSAT with innovations, integrations and implementations

EditShare Celebrates Anniversaries at CABSAT with Innovations, Integrations and ...

20/05/2024

Sony-Apollo's Bid for Paramount Gets Serious

A bid from Sony Pictures Entertainment and Apollo Global Management to acquire Paramount gained steam late last week, according to a report in the New York Time...

20/05/2024

Vizrt Unveils Viz Libro 8.3 With New Social Media Cropping Function

BERGEN, Norway Vizrt has released Viz Libero 8.3 with the announcement that its latest software version enables users to crop video to various aspect ratios fro...

20/05/2024

The Vampire Next Door Created with Pocket Cinema Camera 6K and DaVinci Resolve

The Vampire Next Door Created with Pocket Cinema Camera 6K and DaVinci Resolve Brie Clayton May 20, 2024 0 Comments Blackmagic Design today announced ...

20/05/2024

Tom Wills To Retire After Nearly 50 Years at WJXT Jacksonville

Tom Wills, long-running anchor at WJXT Jacksonville, Florida, has announced his retirement. He started at WJXT in 1975 and has anchored early-evening news for o...

20/05/2024

AMC Networks Promotes Emily Gotto to Senior VP at Shudder

AMC Networks said it promoted Emily Gotto to senior VP of acquisition and production for Shudder, the company's horror-themed streaming service....