
Just as there are widely understood empirical laws of nature - for example, what goes up must come down, or every action has an equal and opposite reaction - the field of AI was long defined by a single idea: that more compute, more training data and more parameters makes a better AI model.
However, AI has since grown to need three distinct laws that describe how applying compute resources in different ways impacts model performance. Together, these AI scaling laws - pretraining scaling, post-training scaling and test-time scaling, also called long thinking - reflect how the field has evolved with techniques to use additional compute in a wide variety of increasingly complex AI use cases.
The recent rise of test-time scaling - applying more compute at inference time to improve accuracy - has enabled AI reasoning models, a new class of large language models (LLMs) that perform multiple inference passes to work through complex problems, while describing the steps required to solve a task. Test-time scaling requires intensive amounts of computational resources to support AI reasoning, which will drive further demand for accelerated computing.
What Is Pretraining Scaling? Pretraining scaling is the original law of AI development. It demonstrated that by increasing training dataset size, model parameter count and computational resources, developers could expect predictable improvements in model intelligence and accuracy.
Each of these three elements - data, model size, compute - is interrelated. Per the pretraining scaling law, outlined in this research paper, when larger models are fed with more data, the overall performance of the models improves. To make this feasible, developers must scale up their compute - creating the need for powerful accelerated computing resources to run those larger training workloads.
This principle of pretraining scaling led to large models that achieved groundbreaking capabilities. It also spurred major innovations in model architecture, including the rise of billion- and trillion-parameter transformer models, mixture of experts models and new distributed training techniques - all demanding significant compute.
And the relevance of the pretraining scaling law continues - as humans continue to produce growing amounts of multimodal data, this trove of text, images, audio, video and sensor information will be used to train powerful future AI models.
Pretraining scaling is the foundational principle of AI development, linking the size of models, datasets and compute to AI gains. Mixture of experts, depicted above, is a popular model architecture for AI training. What Is Post-Training Scaling? Pretraining a large foundation model isn't for everyone - it takes significant investment, skilled experts and datasets. But once an organization pretrains and releases a model, they lower the barrier to AI adoption by enabling others to use their pretrained model as a foundation to adapt for their own applications.
This post-training process drives additional cumulative demand for accelerated computing across enterprises and the broader developer community. Popular open-source models can have hundreds or thousands of derivative models, trained across numerous domains.
Developing this ecosystem of derivative models for a variety of use cases could take around 30x more compute than pretraining the original foundation model.
Developing this ecosystem of derivative models for a variety of use cases could take around 30x more compute than pretraining the original foundation model.
Post-training techniques can further improve a model's specificity and relevance for an organization's desired use case. While pretraining is like sending an AI model to school to learn foundational skills, post-training enhances the model with skills applicable to its intended job. An LLM, for example, could be post-trained to tackle a task like sentiment analysis or translation - or understand the jargon of a specific domain, like healthcare or law.
The post-training scaling law posits that a pretrained model's performance can further improve - in computational efficiency, accuracy or domain specificity - using techniques including fine-tuning, pruning, quantization, distillation, reinforcement learning and synthetic data augmentation.
Fine-tuning uses additional training data to tailor an AI model for specific domains and applications. This can be done using an organization's internal datasets, or with pairs of sample model input and outputs.
Distillation requires a pair of AI models: a large, complex teacher model and a lightweight student model. In the most common distillation technique, called offline distillation, the student model learns to mimic the outputs of a pretrained teacher model.
Reinforcement learning, or RL, is a machine learning technique that uses a reward model to train an agent to make decisions that align with a specific use case. The agent aims to make decisions that maximize cumulative rewards over time as it interacts with an environment - for example, a chatbot LLM that is positively reinforced by thumbs up reactions from users. This technique is known as reinforcement learning from human feedback (RLHF). Another, newer technique, reinforcement learning from AI feedback (RLAIF), instead uses feedback from AI models to guide the learning process, streamlining post-training efforts.
Best-of-n sampling generates multiple outputs from a language model and selects the one with the highest reward score based on a reward model. It's often used to improve an AI's outputs without modifying model parameters, offering an alternative to fine-tuning with reinforcement learning.
Search methods explore a range of potential decision paths before selecting a final output. This post-training technique can iteratively improve the model's responses
Most recent headlines
27/03/2025
Women's college basketball continued to produce hefty audiences in the first and second rounds of the NCAA Women's Tournament, with ESPN platforms repor...
27/03/2025
SYDNEY Sony Electronics will showcase its newly announced VENICE Extension System Mini (CBK-3621XS), the latest addition to its CineAlta lineup, during the 2025...
27/03/2025
NEW YORK The WNET Group, the parent company of the PBS station Thirteen, has announced the appointment of Dana Roberson to general manager, Thirteen and product...
27/03/2025
NEW YORK NBCUniversal and LG Electronics (LG) have announced a deal that will make a wide variety of content available from NBCU on LG smart TVs and add e-comme...
27/03/2025
DENVER The International Broadcaster Coalition Against Piracy (IBCAP) has announced the addition of DirecTV, a leading video distribution company in the U.S., a...
26/03/2025
Director Gina Prince-Bythewood reviews the script for her beloved film Love and Basketball, which she workshopped at the 1998 Directors Lab before it premiere...
26/03/2025
nda sedan Spotify grundades har vi arbetat h rt f r att ka betalningsviljan f r...
26/03/2025
Spotify's RADAR program spotlights rising talent from around the world, and in Spain, it's been making waves since 2020. Five years in, RADAR Spain has ...
26/03/2025
The Secret DNA of Us premieres 17 April
20 March, 2025
Media releases
Family secrets, royal relationships and hidden histories revealed What would you disc...
26/03/2025
A Medium Range Ballistic Missile with a Hypersonic Target Vehicle (HTV) - 1 fron...
26/03/2025
MIAMI WPLG, the Berkshire Hathaway-owned station, has purchased four complete Ikegami HDK-X500 camera systems, including a BSX-1000 base station and OCP-300 ope...
26/03/2025
CINCINNATI GatesAir has introduced a new cost-efficient ATSC 3.0-ready platform that prepares its low-power TV transmission products for NextGen TV opportunitie...
26/03/2025
OSLO, Norway Appear has announced that it will introduce the VX Media Gateway at the 2025 NAB Show 2025 and that it will be showing the solution at its Booth W2...
26/03/2025
The Chancellor presented her speech in the House of Commons today
By Matthew Corrigan
Published: March 26, 2025
The Chancellor presented her speech in the...
26/03/2025
Disguise to Showcase Immersive Sports Programming Technologies at NAB 2025
Brie Clayton March 26, 2025
0 Comments
See Demos Across Partner Booths; Wat...
26/03/2025
Archiware PresentsUpcoming P5 Version 7.4 at NAB Show 2025
Brie Clayton March 26, 2025
0 Comments
Archiware, a leading provider of data management sof...
26/03/2025
Avid Redefines Digital-First News Production at NAB Show 2025
Brie Clayton March 26, 2025
0 Comments
Avid and Wolftech Debuting Seamless, Digital-Firs...
26/03/2025
The council will guide key focus areas to help grow a more secure, interoperable and effective media creation ecosystem, said the organisation
By Matthew Corri...
26/03/2025
Donna Thomas, executive vice president, Vubiquity talks to TVBEurope about how nurturing an early entrepreneurial streak paved the way for her career
By Matthe...
26/03/2025
The Italian company has reportedly called a board meeting for today to evaluate its options
By Jenny Priestley
Published: March 26, 2025
The Italian compa...
26/03/2025
The Chancellor presented her speech in the House of Commons today
By Matthew Co...
26/03/2025
Los Angeles, California: The industry's premier filmmaking exposition has chosen the world-renowned Universal Studios Lot for Cine Gear Expo LA 2025. Regist...
26/03/2025
MovieLabs, the technology joint venture of the major Hollywood studios today announced the formation of a Leadership Council and the inaugural member companies ...
26/03/2025
Magewell, developer of innovative, high-performance video I/O and IP workflow solutions, will be showcasing its latest innovations at the 2025 NAB Show, Las Veg...
26/03/2025
The media industry's rapid transformation presents organizations with both unprecedented opportunities and challenges. Over the years, Dalet has been at the...
26/03/2025
Pixel Power, a Rohde & Schwarz company, is pleased to announce that Brad Rochon has recently joined the company in the newly created position of Senior Business...
26/03/2025
Avid will showcase its most advanced news production solutions, designed to accelerate digital-first, story-driven journalism and broaden audience reach, at NA...
26/03/2025
Net Insight and Globecast are providing Premier Padel, the leading professional padel tour, with a cutting-edge IP and cloud-based distribution solution. Levera...
26/03/2025
Vizrt, the leader in real-time graphics and live production solutions for content creators, today announces Viz One 8, the biggest update to its Enterprise Medi...
26/03/2025
With the 2025 NAB Show approaching, Limecraft announces the release of the second in a series of eight major platform updates planned for this year. Building on...
26/03/2025
Kiloview has announced to unveil its most complete and lightweight broadcast solutions at NAB 2025 in Las Vegas. Located at SL9413, the company will showcase it...
26/03/2025
BURLINGTON, Mass. At the 2025 NAB Show, April 6-9 in Las Vegas, Avid will showcase new features to its MediaCentral platform, featuring the latest AI-powered ne...
26/03/2025
NEW YORK YouTube hit record share of monthly TV viewing in February and had the largest share of TV viewing by the major media companies, according to Nielsen...
26/03/2025
SAN FRANCISCO MovieLabs, the technology joint venture of the major Hollywood studios today announced the formation of a Leadership Council and the inaugural mem...
26/03/2025
New Music USA and Berklee Institute of Jazz and Gender Justice Announce 2025 Nex...
26/03/2025
26 Mar 2025
VEON Appoints Anand Ramachandran as Corporate Development Officer Dubai, March 26, 2025: VEON Ltd. (Nasdaq: VEON), a global digital operator ( VEON...
26/03/2025
Telos Alliance Reveals New AudioTools Server Features at 2025 NAB Show
Search
Cleveland, Ohio (March 26, 2025) Telos Alliance , trusted global leader i...
26/03/2025
Hologic WTA Tour Clay Tournament Makes Move to Electronic Line Calling By SVG Staff
Wednesday, March 26, 2025 - 9:19 am
Print This Story | Subscribe
Sto...
26/03/2025
FUJIFILM Intros New 4K Broadcast Zoom Lens Designed for Portability, Ease of Use By SVG Staff
Wednesday, March 26, 2025 - 9:40 am
Print This Story | Subsc...
26/03/2025
Immersive Experiences That Wow: How Stadiums, Arenas, and Venues Are Upping Thei...
26/03/2025
Intelligent Automation: Tackling In-Venue Graphic Design Without The Staff Burno...
26/03/2025
Save the Date: SVG Venue & Teams Summit Goes to the Intuit Dome in Los Angeles o...
26/03/2025
MLB Opening Day 2025: For FOX Sports' 30th Season, Full Onsite Productions f...
26/03/2025
To view this content, please enable our use of cookies. To do so, click Privacy ...
26/03/2025
Back to All News
Resident Playbook' Trailer Promises Warmth, Humor, and He...
26/03/2025
J nger Audio will be showcasing at the NAB 2025 trade show. You can find us in North Hall at the booth of our international distributor, Telos Alliance Stand N7...
26/03/2025
Curved, 13 x32 Alfalite LED Wall Serves as Dynamic Backdrop at New ST 2110 Walmart TV Studio...
26/03/2025
Sunday April 6, from 6pm
Easy's Cocktail Lounge / ARIA Resort & Casino
Register now
Join us between 6-9pm to meet up with the FilmLight team and your f...
26/03/2025
For the finale, there was a lot of experimenting with structure and testing out different ideas about how to play out different scenes, says Richman. It was a...
26/03/2025
The reliability of the electric grid is critical.
From handling demand surges and evolving power needs to preventing infrastructure failures that can cause wil...