
The mics were live and tape was rolling in the studio where the Miles Davis Quintet was recording dozens of tunes in 1956 for Prestige Records.
When an engineer asked for the next song's title, Davis shot back, I'll play it, and tell you what it is later.
Like the prolific jazz trumpeter and composer, researchers have been generating AI models at a feverish pace, exploring new architectures and use cases. Focused on plowing new ground, they sometimes leave to others the job of categorizing their work.
A team of more than a hundred Stanford researchers collaborated to do just that in a 214-page paper released in the summer of 2021.
In a 2021 paper, researchers reported that foundation models are finding a wide array of uses. They said transformer models, large language models (LLMs) and other neural networks still being built are part of an important new category they dubbed foundation models.
Foundation Models Defined A foundation model is an AI neural network - trained on mountains of raw data, generally with unsupervised learning - that can be adapted to accomplish a broad range of tasks, the paper said.
The sheer scale and scope of foundation models from the last few years have stretched our imagination of what's possible, they wrote.
Two important concepts help define this umbrella category: Data gathering is easier, and opportunities are as wide as the horizon.
No Labels, Lots of Opportunity Foundation models generally learn from unlabeled datasets, saving the time and expense of manually describing each item in massive collections.
Earlier neural networks were narrowly tuned for specific tasks. With a little fine-tuning, foundation models can handle jobs from translating text to analyzing medical images.
Foundation models are demonstrating impressive behavior, and they're being deployed at scale, the group said on the website of its research center formed to study them. So far, they've posted more than 50 papers on foundation models from in-house researchers alone.
I think we've uncovered a very small fraction of the capabilities of existing foundation models, let alone future ones, said Percy Liang, the center's director, in the opening talk of the first workshop on foundation models.
AI's Emergence and Homogenization In that talk, Liang coined two terms to describe foundation models:
Emergence refers to AI features still being discovered, such as the many nascent skills in foundation models. He calls the blending of AI algorithms and model architectures homogenization, a trend that helped form foundation models. (See chart below.)
The field continues to move fast.
A year after the group defined foundation models, other tech watchers coined a related term - generative AI. It's an umbrella term for transformers, large language models, diffusion models and other neural networks capturing people's imaginations because they can create text, images, music, software and more.
Generative AI has the potential to yield trillions of dollars of economic value, said executives from the venture firm Sequoia Capital who shared their views in a recent AI Podcast.
A Brief History of Foundation Models We are in a time where simple methods like neural networks are giving us an explosion of new capabilities, said Ashish Vaswani, an entrepreneur and former senior staff research scientist at Google Brain who led work on the seminal 2017 paper on transformers.
That work inspired researchers who created BERT and other large language models, making 2018 a watershed moment for natural language processing, a report on AI said at the end of that year.
Google released BERT as open-source software, spawning a family of follow-ons and setting off a race to build ever larger, more powerful LLMs. Then it applied the technology to its search engine so users could ask questions in simple sentences.
In 2020, researchers at OpenAI announced another landmark transformer, GPT-3. Within weeks, people were using it to create poems, programs, songs, websites and more.
Language models have a wide range of beneficial applications for society, the researchers wrote.
Their work also showed how large and compute-intensive these models can be. GPT-3 was trained on a dataset with nearly a trillion words, and it sports a whopping 175 billion parameters, a key measure of the power and complexity of neural networks.
The growth in compute demands for foundation models. (Source: GPT-3 paper) I just remember being kind of blown away by the things that it could do, said Liang, speaking of GPT-3 in a podcast.
The latest iteration, ChatGPT - trained on 10,000 NVIDIA GPUs - is even more engaging, attracting over 100 million users in just two months. Its release has been called the iPhone moment for AI because it helped so many people see how they could use the technology.
One timeline describes the path from early AI research to ChatGPT. (Source: blog.bytebytego.com) From Text to Images About the same time ChatGPT debuted, another class of neural networks, called diffusion models, made a splash. Their ability to turn text descriptions into artistic images attracted casual users to create amazing images that went viral on social media.
The first paper to describe a diffusion model arrived with little fanfare in 2015. But like transformers, the new technique soon caught fire.
Researchers posted more than 200 papers on diffusion models last year, according to a list maintained by James Thornton, an AI researcher at the University of Oxford.
In a tweet, Midjourney CEO David Holz revealed that his diffusion-based, text-to-image service has more than 4.4 million users. Serving them requires more than 10,000 NVIDIA GPUs mainly for AI inference, he said in an interview (subscription required).
Dozens of Models in Use Hundreds of foundation models are now available
Most recent headlines
04/09/2025
Monumental Sports & Entertainment (MSE), in collaboration with Dalet, has been a...
13/05/2025
Berklee Artists Take Center Stage at Top Music Festivals Thanks to the Berklee Popular Music Institute, students will perform at Lollapalooza, Governors Ball,...
13/05/2025
Solid Start to the Year Luxembourg, 30 April 2025 -- SES S.A. announces financial results for the three months ended 31 March 2025.
Revenue of 509 million (...
13/05/2025
LONDON IBC2025 is now open for registration, with the global media, entertainment and technology community set to gather at the RAI Amsterdam September 12 to 1...
13/05/2025
NEW YORK NBCUniversal at its upfront presentation laid outplans for new product features coming to Peacock for its NBA coverage, promising an unprecedented lev...
13/05/2025
NEW YORK In another example of how pay TV operators are looking to strengthen their bundled offerings by adding streaming services, Altice USA's Optimum has...
13/05/2025
Bolin Technology, producers of professional PTZ cameras and AV over IP solutions today announced a new distribution partnership with DigiBox in the UK and Irela...
13/05/2025
Leading South American internet subscription TV service provider Zapping Sports Ecuador has invested in a PlayBox Neo Channel-in-a-Box HD server to power its su...
13/05/2025
Pioneering DRM-based Solution Delivers Interactive Learning to Schools Without Internet Access
Encompass Digital Media has announced the completion of the worl...
13/05/2025
Techex, the UK-based expert in live video solutions over IP and cloud, will showcase its latest advances in software-defined broadcast technology at the Media P...
13/05/2025
Alfalite, the only European manufacturer of LED screens, will participate in CABSAT 2025 (May 13 15, Dubai World Trade Centre) to present its latest innovations...
13/05/2025
Test & measurement innovator, Leader Electronics, will showcase its LPX500 Quad-input Waveform Monitor, featuring 100G-IP and 12G-SDI toolsets, at BroadcastAsia...
13/05/2025
Larry Jordan Meets With Sam Bogoch of Axle.ai to Talk About the Power of AI Powe...
13/05/2025
As the manufacturing industry faces challenges - such as labor shortages, reshor...
13/05/2025
AI agents powered by large language models (LLMs) have grown past their FAQ chatbot beginnings to become true digital teammates capable of planning, reasoning a...
13/05/2025
Facebook
Twitter
LinkedIn
On the occasion of DEFEA, Greece's premier defence exhibition, Thales, a global leader in advanced technologies for the Defe...
13/05/2025
Facebook
Twitter
LinkedIn
After the success of the deployment in Brest in ...
13/05/2025
Facebook
Twitter
LinkedIn
Thales unveils its new multi-mission Primary Sur...
12/05/2025
As newsrooms around the world begin to experiment with artificial intelligence, many are asking the same question: how do we move beyond isolated pilots and emb...
12/05/2025
The Sundance Film Festival: CDMX 2025 program is composed of 15 feature films an...
12/05/2025
And They're Off! The Pure Drama of the Cycling Grand Tours Continues on SBS,...
12/05/2025
NEW YORK In an important step towards creating a unified framework for measurement and reporting of attention to media, the Interactive Advertising Bureau (IAB)...
12/05/2025
NEW YORK and LOS ANGELES Fox Corp. said it will call its direct-to-consumer streaming service Fox One....
12/05/2025
OWC Launches My OWC App to Further Streamline Setup, Support, and Ownership Expe...
12/05/2025
NUGEN Audio Unveils DialogCheck Speech Intelligibility Software
Brie Clayton May 12, 2025
0 Comments
Latest Plug-in Identifies Poor Dialog Clarity
NU...
12/05/2025
Building a Home Recording Studio? Here's What You Need. Experts break down the home studio setup and share tips to create a pro-level recording space on a...
12/05/2025
Following a successful mid-April NAB Show in Las Vegas, DHD will promote examples from its wide range of digital audio content creation products at the upcoming...
12/05/2025
With a body of international award-winning and premier festival-showing films, American Cinematographer David McFarland ( 12 Mighty Orphans , Mafak , The Ball...
12/05/2025
10 05 2025 - Media release Australian Filmmaker Lucy Mckendrick Set For Directin...
12/05/2025
Unlock Creative Flexibility with Shorts, Loops, and Stems in PremiumBeat Tracks
Brie Clayton May 12, 2025
0 Comments
PremiumBeat's studio-quality ...
12/05/2025
NVIDIA today received multiple accolades at COMPUTEX's Best Choice Awards, in recognition of innovation across the company.
The NVIDIA GeForce RTX 5090 GPU...
12/05/2025
12 May 2025
VEON Joins GSMA Advance's People Excellence Partner Program as ...
12/05/2025
May 12th, 2025 Press Materials Available Here
TRIBECA X Announces 2025 Speaker Lineup And Official Award Selections
Bryan Cranston, Paris Hilton, Lena Waithe...
12/05/2025
Food, Sports, Adventure and Relationship Content Drive 61 Percent Share For Warn...
12/05/2025
Perfect partners: Kicking off World Sevens Football's goals for its first br...
12/05/2025
Canon's Paul McAniff Shows the Latest in PTZ Auto-Tracking Tech The company showcased select solutions at the 2025 SVG Esports Production Summit in Seattle ...
12/05/2025
Fox Corporation Announces New Direct-to-Consumer Streaming Service: FOX One The platform will offer all of FOX's sports, news, and entertainment programming...
12/05/2025
Live from Eurovision: The greatest show on Earth returns with a focus on enhanci...
12/05/2025
Back to All News
Netflix ISP Speed Index for April 2025
Product
12 May 2025
Global
Link copied to clipboard
Three percent of Internet Service Providers (I...
12/05/2025
Riyadh, Saudi Arabia & Montreal, Canada - May 12, 2025 - Arabsat, a leading glob...
12/05/2025
Fox Corporation Reports Third Quarter Fiscal 2025 Financial Results NEW YORK, NY, May 12, 2025 - Fox Corporation (Nasdaq: FOXA, FOX; FOX or the Company ) t...
12/05/2025
FOX Unveils Name of New Streaming Service FOX One Featuring all of FOX's Premium News, Sports and Entertainment Programming in One Dynamic Platform New ...
12/05/2025
FOX Orders Propulsive, Edge-Of-Your-Seat Thriller Memory of a Killer For 2025-20...
12/05/2025
FOX Announces Schedule for 2025-26 Season #1 NETWORK IN A18-49 DELIVERS SLATE FILLED WITH FUN, IRREVERENT CONTENT AND BOLD, SIGNATURE FOX CHARACTERS ALL-NEW ...
12/05/2025
What is success? How is it achieved? And what does it really cost? These are the...
11/05/2025
Berklee Honors Andr 3000 and Sara Bareilles at 2025 Commencement This years honorary doctorate recipients were recognized for their profound influence as art...
10/05/2025
The National Film and Video Foundation (NFVF), an agency of the Department of Sp...
10/05/2025
NEW YORK During the IAB NewFronts 2025, Samsung Ads announced the debut of STN, the Samsung Television Network, a FAST channel with live content that will be av...
10/05/2025
SEATTLE T-Mobile has announced that it set a new uplink speed record of 550 Mbps in sub-6 GHz spectrum using cutting-edge 5G Advanced tech....
10/05/2025
WASHINGTON Twenty-two U.S. Senators have sent a letter to Federal Communications Commission chair Brendan Carr urging him to modernize ownership caps that are...