Sony Pixel Power calrec Sony

The Building Blocks of AI: Decoding the Role and Significance of Foundation Models

10/04/2024

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and which showcases new hardware, software, tools and accelerations for RTX PC users.

Skyscrapers start with strong foundations. The same goes for apps powered by AI.

A foundation model is an AI neural network trained on immense amounts of raw data, generally with unsupervised learning.

It's a type of artificial intelligence model trained to understand and generate human-like language. Imagine giving a computer a huge library of books to read and learn from, so it can understand the context and meaning behind words and sentences, just like a human does.

Foundation models. A foundation model's deep knowledge base and ability to communicate in natural language make it useful for a broad range of applications, including text generation and summarization, copilot production and computer code analysis, image and video creation, and audio transcription and speech synthesis.

ChatGPT, one of the most notable generative AI applications, is a chatbot built with OpenAI's GPT foundation model. Now in its fourth version, GPT-4 is a large multimodal model that can ingest text or images and generate text or image responses.

Online apps built on foundation models typically access the models from a data center. But many of these models, and the applications they power, can now run locally on PCs and workstations with NVIDIA GeForce and NVIDIA RTX GPUs.

Foundation Model Uses Foundation models can perform a variety of functions, including:

Language processing: understanding and generating text

Code generation: analyzing and debugging computer code in many programming languages

Visual processing: analyzing and generating images

Speech: generating text to speech and transcribing speech to text

They can be used as is or with further refinement. Rather than training an entirely new AI model for each generative AI application - a costly and time-consuming endeavor - users commonly fine-tune foundation models for specialized use cases.

Pretrained foundation models are remarkably capable, thanks to prompts and data-retrieval techniques like retrieval-augmented generation, or RAG. Foundation models also excel at transfer learning, which means they can be trained to perform a second task related to their original purpose.

For example, a general-purpose large language model (LLM) designed to converse with humans can be further trained to act as a customer service chatbot capable of answering inquiries using a corporate knowledge base.

Enterprises across industries are fine-tuning foundation models to get the best performance from their AI applications.

Types of Foundation Models More than 100 foundation models are in use - a number that continues to grow. LLMs and image generators are the two most popular types of foundation models. And many of them are free for anyone to try - on any hardware - in the NVIDIA API Catalog.

LLMs are models that understand natural language and can respond to queries. Google's Gemma is one example; it excels at text comprehension, transformation and code generation. When asked about the astronomer Cornelius Gemma, it shared that his contributions to celestial navigation and astronomy significantly impacted scientific progress. It also provided information on his key achievements, legacy and other facts.

Extending the collaboration of the Gemma models, accelerated with the NVIDIA TensorRT-LLM on RTX GPUs, Google's CodeGemma brings powerful yet lightweight coding capabilities to the community. CodeGemma models are available as 7B and 2B pretrained variants that specialize in code completion and code generation tasks.

MistralAI's Mistral LLM can follow instructions, complete requests and generate creative text. In fact, it helped brainstorm the headline for this blog, including the requirement that it use a variation of the series' name AI Decoded, and it assisted in writing the definition of a foundation model.

Hello, world, indeed. Meta's Llama 2 is a cutting-edge LLM that generates text and code in response to prompts.

Mistral and Llama 2 are available in the NVIDIA ChatRTX tech demo, running on RTX PCs and workstations. ChatRTX lets users personalize these foundation models by connecting them to personal content - such as documents, doctors' notes and other data - through RAG. It's accelerated by TensorRT-LLM for quick, contextually relevant answers. And because it runs locally, results are fast and secure.

Image generators like StabilityAI's Stable Diffusion XL and SDXL Turbo let users generate images and stunning, realistic visuals. StabilityAI's video generator, Stable Video Diffusion, uses a generative diffusion model to synthesize video sequences with a single image as a conditioning frame.

Multimodal foundation models can simultaneously process more than one type of data - such as text and images - to generate more sophisticated outputs.

A multimodal model that works with both text and images could let users upload an image and ask questions about it. These types of models are quickly working their way into real-world applications like customer service, where they can serve as faster, more user-friendly versions of traditional manuals.

Many foundation models are free to try - on any hardware - in the NVIDIA API Catalog. Kosmos 2 is Microsoft's groundbreaking multimodal model designed to understand and reason about visual elements in images.

Think Globally, Run AI Models Locally GeForce RTX and NVIDIA RTX GPUs can run foundation models locally.

The results are fast and secure. Rather than relying on cloud-based services, users can harness apps like ChatRTX to process sensitive data on their local PC without sharing the data with a third party or needing an internet connection.
LINK: https://blogs.nvidia.com/blog/ai-decoded-foundation-models/...
See more stories from nvidia

Most recent headlines

04/08/2024

Dalet Appoints Santiago Solanas as CEO to Lead Next Era of Growth and Innovation

Dalet, a leading technology and service provider for media-rich organizations, is excited to announce Santiago Solanas as its new Chief Executive Officer (CEO)....

03/06/2024

Dalet and Veritone Reach Agreement to Distribute, Transact and Monetize Media Archives

Dalet, a leading technology and service provider for media-rich organizations, a...

21/05/2024

A Warm Spotify Welcome to Bonnier Books UK and Blackstone Publishing

As Spotify continues to grow its worldwide audience for audiobooks, we're thrilled to welcome two new publishing partners-Bonnier Books UK and Blackstone Pu...

21/05/2024

Capturing Cityscapes With LEE Filters

Part of being a photographer is adapting to situations, says photographer Verity Milligan. In this video, Milligan takes her camera to the Gas Street Basin nei...

21/05/2024

I Am L3Harris: Eldon

Today, I am a proud first-generation college graduate with a master's degree in electrical engineering. Never give up, keep learning, keep working hard and ...

21/05/2024

Arabsat powers its new state-of-the-art cloud-playout services through Grass Valley's AMPP Platform

Montreal, Canada- May 21, 2024 - Grass Valley, a pioneer in live production solu...

21/05/2024

Comcast Unveils $15 Price Tag for StreamSaver Bundle

PHILADELPHIA Comcast has released pricing and launch details of its recently announced StreamSaver bundle of Apple TV+, Netflix and Peacock services with the ne...

21/05/2024

TiVo Launches TiVo One Cross-Screen Ad Platform

SAN JOSE, Calif. Xperi's TiVo subsidiary continues to expand the capabilities and reach of its independent media platform with the debut of the TiVo One cr...

21/05/2024

COW Job Listing: Expert Viral Editor Viral content

COW Job Listing: Expert Viral Editor Viral content Brie Clayton May 21, 2024 0 Comments Expert Viral Editor Viral content March 6, 2024COW Job Listi...

21/05/2024

Avid partners with Post Super on post production training course

The programme aims to provide those working in post with a comprehensive understanding of their role within the broader post production ecosystem By Jenny Prie...

21/05/2024

Stephen van Rooyen appointed VodafoneZiggo CEO

He joins VodafoneZiggo following the retirement of Jeroen Hoencamp and will be based at the companys offices in Utrecht By Jenny Priestley Published: May 21,...

21/05/2024

TVBEurope May/June 2024 issue out now

Our latest issue hears from France T l visions about their plans for Paris 2024, we also celebrate the winners of our NAB 2024 Best of Show Awards and explore c...

21/05/2024

AMG Chooses Brightline For Frame' Ad Format

LOS ANGELES Allen Media Group (AMG) has launched an instream Frame ad for advertisers on its The Weather Channel, Local Now and HBCU Go connected TV (CTV) app...

21/05/2024

Wisconsin TV Stations Change Call Signs, Channel Numbers

MILWAUKEE Low power TV (LPTV) station operator Roseland Broadcasting has changed the call sign and the operating channel number of its two Wisconsin stations in...

21/05/2024

U.S. FAST Channels Hit Record Numbers

NEW YORK Despite rapid growth in recent years and worries that the FAST channel market may be reaching saturation, a new report from FASTMaster shows that the t...

21/05/2024

Cineverse to Offer Remastered HD, 4K Episodes of the Bob Rosss 'Joy of Painting

LOS ANGELES Hoping to build on the success of The Bob Ross Channel, Cineverse ha...

21/05/2024

DLT Launches 80's Sitcom Flashback Fast Channel

NEW YORK DLT Entertainment has announced the launch of 80's Sitcom Flashback, a FAST Channel exclusively curated to celebrate the comedies that dominated pr...

21/05/2024

SPL Media House Selects Grass Valley's AMPP

MONTREAL Grass Valley has announced that SPL Media House (SPL MH) in Riyadh, Saudi Arabia has deployed Grass Valley's AMPP SaaS Platform deployment in the r...

21/05/2024

Christ Church Deploys Pliant Technologies Intercoms at West Monroe Location

WEST MONROE, La. Pliant Technologies has announced that Christ Church, which has campuses in West Monroe, Ruston, and Sterlington, has deployed its CrewCom Wire...

21/05/2024

Hollyland Announces Pyro S, a New Wireless 4K Video Monitoring System for Filmmakers

Hollyland Announces Pyro S, a New Wireless 4K Video Monitoring System for Filmma...

21/05/2024

Avid Unveils Revolutionary Post-Production Micro-Certifications with New Learning Partner, Post Super

Avid Unveils Revolutionary Post-Production Micro-Certifications with New Learnin...

21/05/2024

MIX's Carpool Casanova Announces Spirit Award Finalists for Season 3

MIX 101.5's Carpool Casanova will wrap season three this Friday, May 24, 2024, but his final location has yet to be determined. The ultimate Spirit Award W...

21/05/2024

Dr. Ray Seol Awarded Grant for Creative Individuals from Mass Cultural Council

Dr. Ray Seol Awarded Grant for Creative Individuals from Mass Cultural Council The grant will expand his Seu Aprendiz project designed to help individuals bui...

21/05/2024

WideOrbit's Susie Hedrick Named One of Radio Ink's 25th Annual Most Influential Women in Radio

WideOrbit congratulates Susie Hedrick, President and Managing Director, on her i...

21/05/2024

Tribeca Festival 2024 Announces Immersive Program in Collaboration with Mercer Labs

May 21st, 2024 Press Materials Available Here TRIBECA FESTIVAL 2024 ANNOUNCES ...

21/05/2024

SVG College Summit 2024: Technology Exhibits Preview, Part 1

SVG College Summit 2024: Technology Exhibits Preview, Part 1 By SVG Staff Tuesday, May 21, 2024 - 8:32 am Print This Story | Subscribe Story Highlights ...

21/05/2024

SNY's Samantha Kandell on Leveraging the Cloud For Live Digital Studio Show Production

SNY's Samantha Kandell on Leveraging the Cloud For Live Digital Studio Show ...

21/05/2024

Start Spreading the Views: New York Yankees Inject Deeper Creativity, Storytelling Into Social Media Content

Start Spreading the Views: New York Yankees Inject Deeper Creativity, Storytelli...

21/05/2024

Appear Founders Petter Jrgensen and Thomas Lind on the Evolution of Live Production Technology

Appear founders Petter J rgensen and Thomas Lind on the evolution of live produc...

21/05/2024

Viewer Engagement and Video: A State of the Industry Conversation

Viewer Engagement and Video: A State of the Industry Conversation Leaders from the NHL, SNY, FloSports, Barstool Sports, and Team Whistle share their insights o...

21/05/2024

Artist Ecosystem Enables Campus Wide Communications for Liberty University

Wuppertal May 21, 2024 Artist Ecosystem Enables Campus Wide Communications for Liberty UniversityRiedel Communications today announced that Liberty University...

21/05/2024

Korean Unscripted Series Agents of Mystery' by the Producer of The Devil's Plan' Premieres June 18

Back to All News Korean Unscripted Series Agents of Mystery' by the Produc...

21/05/2024

Crime Thriller The Victims' Game' Returns With A Brand New Season on June 21

Back to All News Crime Thriller The Victims' Game' Returns With A Bran...

21/05/2024

This Spring's Biggest Crime Mystery is Solved - Netflix Unveils Who Will Take on the Role of Jo Nesb's Harry Hole

Back to All News This Springs Biggest Crime Mystery is Solved - Netflix Unveils...

21/05/2024

Top 10 Week of May 13: Bridgerton' Season 3 Crowned #1

Back to All News Top 10 Week of May 13: Bridgerton' Season 3 Crowned #1 Entertainment 21 May 2024 Global Link copied to clipboard Spring has sprung, ...

21/05/2024

2024-05-21

PARIS Apple and le-de-France Mobilit s today introduced an easy, secure, and private way for customers to add a new Navigo card to Apple Wallet and purchase pa...

21/05/2024

Skeem Saam: Monday's episode, 20 May 2024 [video]

Skeem Saam: Monday's episode, 20 May 2024 [video]Missed an episode of Skeem Saam? No problem! Watch the latest episode of your favourite South African soapi...

21/05/2024

Popular children's TV shows you may have forgotten

Popular children's TV shows you may have forgottenSouth African television channels have entertained us over the years. Here is looking back at some popular...

21/05/2024

Actor Dumisani Dlamini discusses his role in Isitha: The Enemy'

Actor Dumisani Dlamini discusses his role in Isitha: The Enemy'Legendary actor Dumisani Dlamini has landed the character of Nsimbi in e.tv's popular te...

21/05/2024

Tonight on Smoke and Mirrors: General's romantic gesture to Lulu meets with hesitation

Tonight on Smoke and Mirrors: General's romantic gesture to Lulu meets with ...

21/05/2024

New Performance Optimizations Supercharge NVIDIA RTX AI PCs for Gamers, Creators and Developers

NVIDIA today announced at Microsoft Build new AI performance optimizations and i...

21/05/2024

NVIDIA Expands Collaboration With Microsoft to Help Developers Build, Deploy AI Applications Faster

If optimized AI workflows are like a perfectly tuned orchestra - where each comp...

21/05/2024

RT 2FM announces Drive It with The 2 Johnnies' is to come to an end

RT 2FM today announced that the Drive It with The 2 Johnnies is to come to an end on 31 May. Head of RT 2Fm, Dan Healy said: After two very successful years...

21/05/2024

RT launches a series of Animated Shorts on the theme of Home

HOME IS WHERE THE STORY BEGINS RT launches a series of Animated Shorts on the theme of Home Watch: rte.ie/player/kids Among these short animations is Envelo...

21/05/2024

A Superbloom of Updates in the May Studio Driver Gives Fresh Life to Content Creation

Editor's note: This post is part of our In the NVIDIA Studio series, which c...

21/05/2024

May 20, 2024

New method to reveal what drives brain diseases Scripps Research scientists develop CRISPR screen technology to determine disease mechanism from tissues with ac...

20/05/2024

Masters of Reinvention commissioned for Yesterday & UKTV Play

22nd May 2024 UKTV has commissioned Masters of Reinvention (6x60) for its leading factual channel Yesterday and free streaming service UKTV Play, to be produced...

20/05/2024

The Tuba Thieves Asks What It Means to Listen

PARK CITY, UTAH - JANUARY 22: The cast and crew of The Tuba Thieves pose during the 2023 Sundance Film Festival The Tuba Thieves premiere at Prospector Squa...