What Are Foundation Models?

13/03/2023

The mics were live and tape was rolling in the studio where the Miles Davis Quintet was recording dozens of tunes in 1956 for Prestige Records.

When an engineer asked for the next song's title, Davis shot back, I'll play it, and tell you what it is later.

Like the prolific jazz trumpeter and composer, researchers have been generating AI models at a feverish pace, exploring new architectures and use cases. Focused on plowing new ground, they sometimes leave to others the job of categorizing their work.

A team of more than a hundred Stanford researchers collaborated to do just that in a 214-page paper released in the summer of 2021.

In a 2021 paper, researchers reported that foundation models are finding a wide array of uses. They said transformer models, large language models (LLMs) and other neural networks still being built are part of an important new category they dubbed foundation models.

Foundation Models Defined A foundation model is an AI neural network - trained on mountains of raw data, generally with unsupervised learning - that can be adapted to accomplish a broad range of tasks, the paper said.

The sheer scale and scope of foundation models from the last few years have stretched our imagination of what's possible, they wrote.

Two important concepts help define this umbrella category: Data gathering is easier, and opportunities are as wide as the horizon.

No Labels, Lots of Opportunity Foundation models generally learn from unlabeled datasets, saving the time and expense of manually describing each item in massive collections.

Earlier neural networks were narrowly tuned for specific tasks. With a little fine-tuning, foundation models can handle jobs from translating text to analyzing medical images.

Foundation models are demonstrating impressive behavior, and they're being deployed at scale, the group said on the website of its research center formed to study them. So far, they've posted more than 50 papers on foundation models from in-house researchers alone.

I think we've uncovered a very small fraction of the capabilities of existing foundation models, let alone future ones, said Percy Liang, the center's director, in the opening talk of the first workshop on foundation models.

AI's Emergence and Homogenization In that talk, Liang coined two terms to describe foundation models:

Emergence refers to AI features still being discovered, such as the many nascent skills in foundation models. He calls the blending of AI algorithms and model architectures homogenization, a trend that helped form foundation models. (See chart below.)

The field continues to move fast.

A year after the group defined foundation models, other tech watchers coined a related term - generative AI. It's an umbrella term for transformers, large language models, diffusion models and other neural networks capturing people's imaginations because they can create text, images, music, software and more.

Generative AI has the potential to yield trillions of dollars of economic value, said executives from the venture firm Sequoia Capital who shared their views in a recent AI Podcast.

A Brief History of Foundation Models We are in a time where simple methods like neural networks are giving us an explosion of new capabilities, said Ashish Vaswani, an entrepreneur and former senior staff research scientist at Google Brain who led work on the seminal 2017 paper on transformers.

That work inspired researchers who created BERT and other large language models, making 2018 a watershed moment for natural language processing, a report on AI said at the end of that year.

Google released BERT as open-source software, spawning a family of follow-ons and setting off a race to build ever larger, more powerful LLMs. Then it applied the technology to its search engine so users could ask questions in simple sentences.

In 2020, researchers at OpenAI announced another landmark transformer, GPT-3. Within weeks, people were using it to create poems, programs, songs, websites and more.

Language models have a wide range of beneficial applications for society, the researchers wrote.

Their work also showed how large and compute-intensive these models can be. GPT-3 was trained on a dataset with nearly a trillion words, and it sports a whopping 175 billion parameters, a key measure of the power and complexity of neural networks.

The growth in compute demands for foundation models. (Source: GPT-3 paper) I just remember being kind of blown away by the things that it could do, said Liang, speaking of GPT-3 in a podcast.

The latest iteration, ChatGPT - trained on 10,000 NVIDIA GPUs - is even more engaging, attracting over 100 million users in just two months. Its release has been called the iPhone moment for AI because it helped so many people see how they could use the technology.

One timeline describes the path from early AI research to ChatGPT. (Source: blog.bytebytego.com) From Text to Images About the same time ChatGPT debuted, another class of neural networks, called diffusion models, made a splash. Their ability to turn text descriptions into artistic images attracted casual users to create amazing images that went viral on social media.

The first paper to describe a diffusion model arrived with little fanfare in 2015. But like transformers, the new technique soon caught fire.

Researchers posted more than 200 papers on diffusion models last year, according to a list maintained by James Thornton, an AI researcher at the University of Oxford.

In a tweet, Midjourney CEO David Holz revealed that his diffusion-based, text-to-image service has more than 4.4 million users. Serving them requires more than 10,000 NVIDIA GPUs mainly for AI inference, he said in an interview (subscription required).

Dozens of Models in Use Hundreds of foundation models are now available

LINK:	https://blogs.nvidia.com/blog/2023/03/13/what-are-foundation-models/...
	See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

07/10/2026

Dalet Flex LTS Delivers Smarter Media Operations from Ingest to Distribution

Dalet, a leading technology and service provider for media-rich organizations, today announced the latest Long-Term Supported (LTS) release of Dalet Flex. Build...

06/09/2026

Dolby and MagentaTV Bring Fans Closer to the FIFA World Cup 2026 in Germany with Dolby Vision and Dolby Atmos

June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

21/07/2026

Cinegy directly addresses the real issues of software-def...

Cinegy GmbH, the premier provider of software-defined television technology, will use its presence at IBC2026 (stand 7.A01, Amsterdam RAI, 11 14 September) to...

21/07/2026

Calrec brings leading audio solutions and long term busin...

Calrec will be located in Hall 8, on Stand C47 Beyond bigger For years, broadcast facilities were built around one assumption: provision for the biggest pro...

21/07/2026

Pebble takes the next step towards the future of media de...

Pebble, the leading automation, content management and integrated channel specialist, will discuss its future-facing developments for the new generation of medi...

21/07/2026

nxtedition Brings Production, AI and Automation Together...

nxtedition returns to IBC2026 with its consolidated production platform, showing how scripting, editing, graphics, AI-assisted tools and automation can work wit...

21/07/2026

Big Blue Marble brings C2PA Content Credentials to Cloud...

C2PA content signing helps broadcasters, public institutions, and publishers give audiences a verifiable record of where their video content came from and how i...

21/07/2026

HDHomeRun Enables Operation During Internet Outages

Share Copy link Facebook X Linkedin Bluesky Email...

21/07/2026

Calif. Federal Judge Pauses Paramount-WBD Merger

Share Copy link Facebook X Linkedin Bluesky Email...

21/07/2026

AWARN Rebuts Weigel Claims of 3.0 EAS Problems

Share Copy link Facebook X Linkedin Bluesky Email...

21/07/2026

FCC Announces Tentative Agenda for August Open Meeting

Share Copy link Facebook X Linkedin Bluesky Email...

21/07/2026

VIDA Introduces QC Manager for Collaborative Quality Cont...

New capability centralizes collaboration, feedback, security and status tracking within VIDA, providing a structured, auditable approach to master quality contr...

21/07/2026

Lightcraft Launches Exclusive Spark Story Beta for Filmmakers and Creators at SIGGRAPH 2026

Lightcraft Launches Exclusive Spark Story Beta for Filmmakers and Creators at SI...

21/07/2026

Indie Road Movie Where in the Hell Shot with Pocket Cinema Camera 4K

Indie Road Movie Where in the Hell Shot with Pocket Cinema Camera 4K Brie Clayton July 20, 2026 0 Comments Colorist blends vintage film looks to shape...

21/07/2026

Which USS Defiant Pulse Phaser effect is better?

Which USS Defiant Pulse Phaser effect is better? Graham Quince July 20, 2026 0 Comments Aargh, ever since @DarkRavenProductions posted a comment ask...

21/07/2026

RT announces biggest ever year for new drama

160 hours of Irish storytelling for 2026 KIN and The Walsh Sisters return RT has today unveiled its most ambitious drama slate ever, with a record-breakin...

20/07/2026

SVG All-Stars: Rob Coons, Senior Director, StudentU, Big Ten Network

The former Northwestern broadcast-operations leader is helping train the next generation of live-sports-production talent across the Big Ten The sports-product...

20/07/2026

Give Me the Backstory: Get to Know Stacey Lee, the Filmmaker Behind Murder 101

By Lucy Spicer One of the most exciting things about the Sundance Film Festival is having a front-row seat for the bright future of independent filmmaking. Whi...

20/07/2026

Graph Tech Guitar Labs open UK Online Store

Product range now readily available in the UK Graph Tech Guitar Labs have just announced the launch of a new UK Online Store that makes it quicker and easie...

20/07/2026

Audeze launch the Maxwell 2 ANC

Popular headset gains active noise cancellation The Maxwell headet was Audeze's first foray into the gaming world, and thanks to its Dolby Atmos compati...

20/07/2026

The National Film and Video Foundation (NFVF) Call for Public Screening funding applications for Cycle 1, 2026/27 financial year is Open

The NFVF, an agency of the Department of Sport, Arts and Culture, has released t...

20/07/2026

Telemundo Inks Another Major U.S. Spanish-Language Soccer Deal

Share Copy link Facebook X Linkedin Bluesky Email...

20/07/2026

HDHomeRun Enables Operation During An Internet Outage

Share Copy link Facebook X Linkedin Bluesky Email...

20/07/2026

Starfish highlights flexible, scalable transport stream p...

Starfish Technologies will use IBC2026 to showcase the flexibility of its transport stream processing software, including the latest versions of TS Splicer (Win...

20/07/2026

Bitfocus makes the connections at IBC2026

Bitfocus, the specialist in media control and monitoring, will show at IBC2026 (Elgato stand 8.D31, Amsterdam RAI, 11 14 September) how its Buttons control la...

20/07/2026

Mediagenix Introduces Trusted Agentic AI Operating Model...

Mediagenix, a global leader in smart content solutions to profitably connect the right content to the right audience, today announced new AI capabilities that e...

20/07/2026

Big Blue Marble Cloud DRM nominated for Streaming Media R...

Big Blue Marble's Cloud DRM has been nominated in the DRM/Content Protection category of the 2026 Streaming Media Readers' Choice Awards. Only four pro...

20/07/2026

Sky brings free global roaming to millions of loyal customers when they choose Sky Mobile

Monday 20 July 2026 Sky brings free global roaming to millions of loyal custome...

20/07/2026

Red Seat Ventures Announces Introduction of Premium Creator and Podcast Communities to Amazon DSP

Red Seat Ventures Announces Introduction of Premium Creator and Podcast Communit...

20/07/2026

RT secures exclusive free-to-air Irish rights to the 2030 FIFA World Cup

FIFA World Cup Final sets new RT Player record as the most-streamed single event in the platform's history Over 1 million viewers watched live on RT 2 as ...

20/07/2026

At SIGGRAPH, NVIDIA Advances Graphics and Simulation With Agentic and Physical AI

At this year's SIGGRAPH conference, running through Thursday, July 23, in Lo...

20/07/2026

Bristol Myers Squibb Building Life Science Industry's Most Advanced AI Factory on NVIDIA Vera Rubin

Erin Davis calls it the SuperDuperPOD. That's two things in one name: phar...

19/07/2026

Halftime Show at FIFA World Cup Final Joins a Litany of Firsts for the Quadrennial Event

Justin Bieber, Madonna, Shakira, BTS make for a diverse lineup, and the venue ad...

19/07/2026

More Than Just a Game: FIFA World Cup's Lance Brass Breaks Down Stadium Production and Entertainment

The stage is set: three-time champion Argentina will defend its World Cup title ...

19/07/2026

Acustica reveal Mystic 2

Channel strip plug-in gets upgraded Acustica Audio's vintage-inspired channel strip plug-in has just been treated to an update that expands its tonal ra...

18/07/2026

More Than Just a Game: FIFA World Cups Lance Brass Breaks Down Stadium Production & Entertainment

Topics include pre-match ceremonies, live performances, the tournament's fir...

18/07/2026

As the Final Approaches, FIFA and HBS Take Stock of a World Cup That Rewrote the Production Playbook

When FIFA and HBS set out to produce the 2026 FIFA World Cup, the numbers alone ...

18/07/2026

IK Multimedia add Brown Panel Signature Collection to TONEX

Captures nine sought-after Fender amps IK Multimedia's latest TONEX expansion captures a selection of nine rare Brown Panel' Fender amps that were ...

18/07/2026

Frap Tools update the Magnolia

Latest batch ships alongside firmware update Since being unveiled at Superbooth 2025, Frap Tools' debut polysynth has been met with widespread praise, a...

18/07/2026

Netflix Viewing Hit Record 97 Billion Hours in First Half of 2026

Share Copy link Facebook X Linkedin Bluesky Email...

18/07/2026

YouTube's Creative Ecosystem Contributed $60 Billion to U.S. GDP

Share Copy link Facebook X Linkedin Bluesky Email...

17/07/2026

SVG GameDay, Ep. 24: Mercedes-Benz Stadiums Cole Gallagher - Supporting Shows in the ATL

In-venue and creative video staffers at the professional and collegiate level ha...

17/07/2026

Brooklyn Bowl Williamsburg Stagehands Vote to Join IATSE Local 4

Production workers at Brooklyn Bowl's Williamsburg location voted 15-1 to join IATSE Local 4. The bargaining unit covers 24 production workers at the venue,...

17/07/2026

DAZN and ADI Predictstreet Announce Exclusive Global Prediction Market Partnership

DAZN and ADI Predictstreet have announced an exclusive global strategic partners...

17/07/2026

Zixi and Comcast Technology Solutions Announce Integration for C-Band Satellite Replacement

Zixi and Comcast Technology Solutions (CTS) have announced a strategic integrati...

17/07/2026

Professional Fighters League Announces Multi-Year Partnership with ESPN in Brazil

Professional Fighters League (PFL) has announced a multi-year partnership with E...

17/07/2026

Spectrum Business Launches Spectrum TV Control Pro for Multi-Screen Venue Management

Spectrum Business has announced Spectrum TV Control Pro, a centralized app-based...

17/07/2026

Clark Wire and Cable Appoints Rick Fernandez as Latin American Representative

Clark Wire and Cable has announced that Rick Fernandez, Managing Director of Axxion Consulting, will serve as Independent Manufacturers Representative for Centr...

View most recent headlines