Sony Pixel Power calrec Sony

How Scaling Laws Drive Smarter, More Powerful AI

12/02/2025

Just as there are widely understood empirical laws of nature - for example, what goes up must come down, or every action has an equal and opposite reaction - the field of AI was long defined by a single idea: that more compute, more training data and more parameters makes a better AI model.

However, AI has since grown to need three distinct laws that describe how applying compute resources in different ways impacts model performance. Together, these AI scaling laws - pretraining scaling, post-training scaling and test-time scaling, also called long thinking - reflect how the field has evolved with techniques to use additional compute in a wide variety of increasingly complex AI use cases.

The recent rise of test-time scaling - applying more compute at inference time to improve accuracy - has enabled AI reasoning models, a new class of large language models (LLMs) that perform multiple inference passes to work through complex problems, while describing the steps required to solve a task. Test-time scaling requires intensive amounts of computational resources to support AI reasoning, which will drive further demand for accelerated computing.

What Is Pretraining Scaling? Pretraining scaling is the original law of AI development. It demonstrated that by increasing training dataset size, model parameter count and computational resources, developers could expect predictable improvements in model intelligence and accuracy.

Each of these three elements - data, model size, compute - is interrelated. Per the pretraining scaling law, outlined in this research paper, when larger models are fed with more data, the overall performance of the models improves. To make this feasible, developers must scale up their compute - creating the need for powerful accelerated computing resources to run those larger training workloads.

This principle of pretraining scaling led to large models that achieved groundbreaking capabilities. It also spurred major innovations in model architecture, including the rise of billion- and trillion-parameter transformer models, mixture of experts models and new distributed training techniques - all demanding significant compute.

And the relevance of the pretraining scaling law continues - as humans continue to produce growing amounts of multimodal data, this trove of text, images, audio, video and sensor information will be used to train powerful future AI models.

Pretraining scaling is the foundational principle of AI development, linking the size of models, datasets and compute to AI gains. Mixture of experts, depicted above, is a popular model architecture for AI training. What Is Post-Training Scaling? Pretraining a large foundation model isn't for everyone - it takes significant investment, skilled experts and datasets. But once an organization pretrains and releases a model, they lower the barrier to AI adoption by enabling others to use their pretrained model as a foundation to adapt for their own applications.

This post-training process drives additional cumulative demand for accelerated computing across enterprises and the broader developer community. Popular open-source models can have hundreds or thousands of derivative models, trained across numerous domains.

Developing this ecosystem of derivative models for a variety of use cases could take around 30x more compute than pretraining the original foundation model.

Developing this ecosystem of derivative models for a variety of use cases could take around 30x more compute than pretraining the original foundation model.

Post-training techniques can further improve a model's specificity and relevance for an organization's desired use case. While pretraining is like sending an AI model to school to learn foundational skills, post-training enhances the model with skills applicable to its intended job. An LLM, for example, could be post-trained to tackle a task like sentiment analysis or translation - or understand the jargon of a specific domain, like healthcare or law.

The post-training scaling law posits that a pretrained model's performance can further improve - in computational efficiency, accuracy or domain specificity - using techniques including fine-tuning, pruning, quantization, distillation, reinforcement learning and synthetic data augmentation.

Fine-tuning uses additional training data to tailor an AI model for specific domains and applications. This can be done using an organization's internal datasets, or with pairs of sample model input and outputs.

Distillation requires a pair of AI models: a large, complex teacher model and a lightweight student model. In the most common distillation technique, called offline distillation, the student model learns to mimic the outputs of a pretrained teacher model.

Reinforcement learning, or RL, is a machine learning technique that uses a reward model to train an agent to make decisions that align with a specific use case. The agent aims to make decisions that maximize cumulative rewards over time as it interacts with an environment - for example, a chatbot LLM that is positively reinforced by thumbs up reactions from users. This technique is known as reinforcement learning from human feedback (RLHF). Another, newer technique, reinforcement learning from AI feedback (RLAIF), instead uses feedback from AI models to guide the learning process, streamlining post-training efforts.

Best-of-n sampling generates multiple outputs from a language model and selects the one with the highest reward score based on a reward model. It's often used to improve an AI's outputs without modifying model parameters, offering an alternative to fine-tuning with reinforcement learning.

Search methods explore a range of potential decision paths before selecting a final output. This post-training technique can iteratively improve the model's responses
LINK: https://blogs.nvidia.com/blog/ai-scaling-laws/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

01/06/2026

CBS Sports UEFA Champions League Today Studio Show Heads to Budapest for Final as Transcontinental Popularity Grows

In its sixth year, the broadcaster's coverage has become a global brand and ...

01/06/2026

AudioShake Launches End-to-End Copyright Compliance System for Mixed-Media Audio

Designed to solve a common problem in broadcasting, the automated workflow detects, identifies, removes, and documents copyrighted music AudioShake has introdu...

01/06/2026

SVG Sit-Down: Stats Perform's Charles Kaplan on 30 Years of Opta, a Busy Summer of Soccer, What's Next

The sports-analytics company combines its data with proprietary AI to help leagu...

01/06/2026

Production Music Awards 2026

Category line-up & sponsors announced Photo: Paul Clarke The Production Music Awards (PMA) have announced that submissions are now officially open ahead of...

01/06/2026

Evolve Nest Acoustics from Excite Audio

New hybrid sample/synthesis instrument revealed Excite Audio have just released the latest instalment in their Evolve series, which has been developed in co...

01/06/2026

IK Multimedia release Royal 45 Legends Signature Collection

Latest TONEX expansion captures three rare vintage amps The newest addition to IK Multimedia's ever-growing TONEX line-up introduces a set of three incr...

01/06/2026

Scaler Music Carbon Electra 2

Musically intelligent soft synth gets upgraded Scaler Music will be probably be best known to many for their music theory tools, but their product range al...

01/06/2026

SBS confirms its broadcast sponsors for FIFA World Cup 2026

SBS confirms its broadcast sponsors for FIFA World Cup 2026 1 June, 2026 Media releases SBS has secured Hyundai, Hisense, Macca's, Rexona, bet365, Com...

01/06/2026

Rohde & Schwarz Satellite Industry Days 2026 guided by the motto From Earth to Orbit

Rohde & Schwarz Satellite Industry Days 2026 guided by the motto From Earth to ...

01/06/2026

ASG Advances Joe Marchitto to Western Regional CTO

Share Copy link Facebook X Linkedin Bluesky Email...

01/06/2026

Scripps Stations Go Dark on DirecTV

Share Copy link Facebook X Linkedin Bluesky Email...

01/06/2026

MARSHALL ELECTRONICS POWERS SEAMLESS AV EXPERIENCES WITH...

Marshall Electronics is showcasing a comprehensive lineup of next-generation POV cameras, purpose-built to power today's connected AV environments, at InfoC...

01/06/2026

Adobe Announces Concept to Vector

Adobe Announces Concept to Vector Deepa Subramaniam June 1, 2026 0 Comments One of the biggest frustrations we hear from designers is how difficult it...

01/06/2026

Vampire Feature Night Patrol Graded with DaVinci Resolve Studio

Vampire Feature Night Patrol Graded with DaVinci Resolve Studio Brie Clayton June 1, 2026 0 Comments Colorist shapes dark, gritty tone for horror thri...

01/06/2026

U.S. Broadcasters Ready for Most Complex FIFA World Cup Ever

Share Copy link Facebook X Linkedin Bluesky Email...

01/06/2026

Broadcasters Prepare for Nation's 250th Birthday Bash

Share Copy link Facebook X Linkedin Bluesky Email...

01/06/2026

Broadcasters Reveal What Makes C-Band Alternatives Right for Them

Share Copy link Facebook X Linkedin Bluesky Email...

01/06/2026

IAMT to Offer New Educational Sessions at InfoComm 2026

Share Copy link Facebook X Linkedin Bluesky Email...

01/06/2026

NewsNation Launches New Podcasting Studio and Podcasts

Share Copy link Facebook X Linkedin Bluesky Email...

01/06/2026

Simplifiez vos workflows avec FLAPI. Paris. 2 juin 2026

Mardi 2 juin 14h00 FilmLight (ARRI), 10 rue Ren Boulanger, 75010 Paris Rejoignez-nous pour d couvrir comment FLAPI (l'API FilmLight) peut transformer e...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

31/05/2026

Olivia Prez-Collellmir to Premiere Original Work at Gaud Centennial in Barcelona

Olivia P rez-Collellmir to Premiere Original Work at Gaud Centennial in Barcelona The Berklee graduate and faculty member will debut her choral symphony with...

31/05/2026

Netflix Wins 15 Awards at the Canadian Screen Awards - See Photos From Inside Our Photo Suite

Back to All News Netflix Wins 15 Awards at the Canadian Screen Awards - See Pho...

31/05/2026

Taiwan's Industry Titans Turbocharge World's AI Infrastructure Buildout With NVIDIA

Taiwan is home to more than 500 NVIDIA ecosystem partners. More than 1 million N...

31/05/2026

NVIDIA Factory Operations Blueprint Gives Factories a New AI Brain

As factories move from isolated automation to plant-wide intelligence, manufacturers need AI systems that can connect live machine signals, quality systems, wor...

31/05/2026

NVIDIA AI Cloud Ecosystem Expands Worldwide to Meet Global AI Compute Demand

The NVIDIA AI Cloud ecosystem is accelerating the global buildout of AI factory infrastructure. Partners are expanding capacity to meet growing demand from ente...

30/05/2026

NAB Asks FCC to Shift Regulatory Fee Burden to Big Tech, Broadband

Share Copy link Facebook X Linkedin Bluesky Email...

30/05/2026

NAB Announces 2026 Board Election Results

Share Copy link Facebook X Linkedin Bluesky Email...

30/05/2026

IAB Tech Lab Releases Guidance for Managing AI Crawlers and Bots

Share Copy link Facebook X Linkedin Bluesky Email...

30/05/2026

Zero in on one that says yes (and no)

Zero in on one that says yes (and no) Andy Marken May 29, 2026 0 Comments Hero image courtesy of Deposit Photos For content creators the most difficu...

29/05/2026

InfoComm 2026: NDI Demos NDI 6.3, Hands-On Presentation About Educational Integrations

With InfoComm 2026 just weeks away, NDI is giving attendees plenty of reasons to...

29/05/2026

Bell Media Inks New Long-Term Media Rights Deal for Broadcast, Streaming of the Canadian Football League

Reaffirming a partnership that has defined Canadian sports broadcasting since 19...

29/05/2026

Spring 2026 TV Survey: Vertical Live Matters. The Bigger Story Is Context

Mobile/tablet is No. 2 device for watching TV, suggesting that the sports-production industry needs to take another look at the format Ring Digital's Sprin...

29/05/2026

Germanys Berliner Ensemble Bolsters Backstage Infrastructure With Riedel Stage Systems

Berliner Ensemble, one of Berlin's five major theater companies, has expande...

29/05/2026

InfoComm 2026: Solid State Logic Spotlights TCA Tour, Live V6.2 Software, New SSL Live Trade-in Program

Solid State Logic will showcase its new compact, fly-away TCA Tour audio product...

29/05/2026

Gerald Jerry Pierce, Architect of Modern Digital Cinema, Passes Away at Age 73

Gerald (Jerry) Pierce, a pioneering technologist who helped shape the digital transformation of the motion picture industry, passed away last month on April 12 ...

29/05/2026

CBS Sports Becomes New Home of Barclays Women's Super League Thorough 2029-2030

Paramount+ will be the English-language U.S. home for Barclays Women's Super...

29/05/2026

Calrec Scales ImPulseV to Empower Broadcasters With Greater Choice in Virtualized Workflows

Further strengthening its virtualisation strategy to fully support broadcasters ...

29/05/2026

Switzerlands Canal Alpha Streamlines Playout to Delivery With Harmonic

Swiss broadcaster Canal Alpha has deployed Harmonic's award-winning, software-based XOS Advanced Media Processor to modernize playout operations across cant...

29/05/2026

InfoComm 2026: PTZOptics Showcases Intelligent Video Ecosystem

PTZOptics will showcase a new generation of intelligent video workflows at InfoComm 2026, June 17-19, Las Vegas. Visitors to booth N8227 will see how PTZOptics ...

29/05/2026

Arizona's Family Sports Debuts Direct-to-Consumer Streaming App

Arizona's Family has launched the Arizona's Family Sports (AZFS) streaming app, a new direct-to-consumer destination for live, local sports. The app is ...

29/05/2026

DAZN Brings The Canadian Football Leagues Saturday Night Football to the Masses in New Media Rights Deal

Starting in 2027, DAZN will be the exclusive home of The Canadian Football Leagu...

29/05/2026

Comcast Business Supports Advanced Technology Infrastructure at Levi's Stadium for Fan Experience, Venue Operations

Comcast Business has detailed the advanced network infrastructure it has deploye...

29/05/2026

College Sports-Production Community Gathers in Atlanta for 2026 SVG College Summit

In two-day event, leaders from academia and industry explored solutions to chall...

29/05/2026

TBT Reaches Two-Year Extension With FOX Sports for New-Look $2 Million Tournament

The Basketball Tournament (TBT), now entering their 13th year of competition, ha...

29/05/2026

Roku Expands Premium Subscriptions Experience with FOX One

Roku has launched FOX One as a Premium Subscription on The Roku Channel in the U.S. Roku customers can now subscribe to FOX One using their Roku account for liv...