Sony Pixel Power calrec Sony

How Scaling Laws Drive Smarter, More Powerful AI

12/02/2025

Just as there are widely understood empirical laws of nature - for example, what goes up must come down, or every action has an equal and opposite reaction - the field of AI was long defined by a single idea: that more compute, more training data and more parameters makes a better AI model.

However, AI has since grown to need three distinct laws that describe how applying compute resources in different ways impacts model performance. Together, these AI scaling laws - pretraining scaling, post-training scaling and test-time scaling, also called long thinking - reflect how the field has evolved with techniques to use additional compute in a wide variety of increasingly complex AI use cases.

The recent rise of test-time scaling - applying more compute at inference time to improve accuracy - has enabled AI reasoning models, a new class of large language models (LLMs) that perform multiple inference passes to work through complex problems, while describing the steps required to solve a task. Test-time scaling requires intensive amounts of computational resources to support AI reasoning, which will drive further demand for accelerated computing.

What Is Pretraining Scaling? Pretraining scaling is the original law of AI development. It demonstrated that by increasing training dataset size, model parameter count and computational resources, developers could expect predictable improvements in model intelligence and accuracy.

Each of these three elements - data, model size, compute - is interrelated. Per the pretraining scaling law, outlined in this research paper, when larger models are fed with more data, the overall performance of the models improves. To make this feasible, developers must scale up their compute - creating the need for powerful accelerated computing resources to run those larger training workloads.

This principle of pretraining scaling led to large models that achieved groundbreaking capabilities. It also spurred major innovations in model architecture, including the rise of billion- and trillion-parameter transformer models, mixture of experts models and new distributed training techniques - all demanding significant compute.

And the relevance of the pretraining scaling law continues - as humans continue to produce growing amounts of multimodal data, this trove of text, images, audio, video and sensor information will be used to train powerful future AI models.

Pretraining scaling is the foundational principle of AI development, linking the size of models, datasets and compute to AI gains. Mixture of experts, depicted above, is a popular model architecture for AI training. What Is Post-Training Scaling? Pretraining a large foundation model isn't for everyone - it takes significant investment, skilled experts and datasets. But once an organization pretrains and releases a model, they lower the barrier to AI adoption by enabling others to use their pretrained model as a foundation to adapt for their own applications.

This post-training process drives additional cumulative demand for accelerated computing across enterprises and the broader developer community. Popular open-source models can have hundreds or thousands of derivative models, trained across numerous domains.

Developing this ecosystem of derivative models for a variety of use cases could take around 30x more compute than pretraining the original foundation model.

Developing this ecosystem of derivative models for a variety of use cases could take around 30x more compute than pretraining the original foundation model.

Post-training techniques can further improve a model's specificity and relevance for an organization's desired use case. While pretraining is like sending an AI model to school to learn foundational skills, post-training enhances the model with skills applicable to its intended job. An LLM, for example, could be post-trained to tackle a task like sentiment analysis or translation - or understand the jargon of a specific domain, like healthcare or law.

The post-training scaling law posits that a pretrained model's performance can further improve - in computational efficiency, accuracy or domain specificity - using techniques including fine-tuning, pruning, quantization, distillation, reinforcement learning and synthetic data augmentation.

Fine-tuning uses additional training data to tailor an AI model for specific domains and applications. This can be done using an organization's internal datasets, or with pairs of sample model input and outputs.

Distillation requires a pair of AI models: a large, complex teacher model and a lightweight student model. In the most common distillation technique, called offline distillation, the student model learns to mimic the outputs of a pretrained teacher model.

Reinforcement learning, or RL, is a machine learning technique that uses a reward model to train an agent to make decisions that align with a specific use case. The agent aims to make decisions that maximize cumulative rewards over time as it interacts with an environment - for example, a chatbot LLM that is positively reinforced by thumbs up reactions from users. This technique is known as reinforcement learning from human feedback (RLHF). Another, newer technique, reinforcement learning from AI feedback (RLAIF), instead uses feedback from AI models to guide the learning process, streamlining post-training efforts.

Best-of-n sampling generates multiple outputs from a language model and selects the one with the highest reward score based on a reward model. It's often used to improve an AI's outputs without modifying model parameters, offering an alternative to fine-tuning with reinforcement learning.

Search methods explore a range of potential decision paths before selecting a final output. This post-training technique can iteratively improve the model's responses
LINK: https://blogs.nvidia.com/blog/ai-scaling-laws/...
See more stories from nvidia

Most recent headlines

27/03/2025

ESPN Platforms Scores Strong Ratings for NCAA Women's Tournament

Women's college basketball continued to produce hefty audiences in the first and second rounds of the NCAA Women's Tournament, with ESPN platforms repor...

27/03/2025

Sony To Feature New VENICE Extension System Mini At 2025 NAB Show

SYDNEY Sony Electronics will showcase its newly announced VENICE Extension System Mini (CBK-3621XS), the latest addition to its CineAlta lineup, during the 2025...

27/03/2025

The WNET Group Names Dana Roberson GM, Thirteen and Production Operations

NEW YORK The WNET Group, the parent company of the PBS station Thirteen, has announced the appointment of Dana Roberson to general manager, Thirteen and product...

27/03/2025

NBCU to Launch 40 FAST Channels on LG Channels

NEW YORK NBCUniversal and LG Electronics (LG) have announced a deal that will make a wide variety of content available from NBCU on LG smart TVs and add e-comme...

27/03/2025

DirecTV Joins IBCAP Anti-Piracy Group

DENVER The International Broadcaster Coalition Against Piracy (IBCAP) has announced the addition of DirecTV, a leading video distribution company in the U.S., a...

26/03/2025

Inside the Archives: Celebrating the Power of Women-Driven Cinema

Director Gina Prince-Bythewood reviews the script for her beloved film Love and Basketball, which she workshopped at the 1998 Directors Lab before it premiere...

26/03/2025

Svenska artister genererar rekordutbetalningar frn Spotify: 1,9 miljarder under 2024

nda sedan Spotify grundades har vi arbetat h rt f r att ka betalningsviljan f r...

26/03/2025

Meet the New Class of Artists Shaping the Future of Spanish Music

Spotify's RADAR program spotlights rising talent from around the world, and in Spain, it's been making waves since 2020. Five years in, RADAR Spain has ...

26/03/2025

The Secret DNA of Us premieres 17 April

The Secret DNA of Us premieres 17 April 20 March, 2025 Media releases Family secrets, royal relationships and hidden histories revealed What would you disc...

26/03/2025

L3Harris' New Advanced Large Solid Rocket Motors Power Successful US Missile Defense Test

A Medium Range Ballistic Missile with a Hypersonic Target Vehicle (HTV) - 1 fron...

26/03/2025

WPLG Selects Ikegami HDK-X500 Cameras For New Production Studio

MIAMI WPLG, the Berkshire Hathaway-owned station, has purchased four complete Ikegami HDK-X500 camera systems, including a BSX-1000 base station and OCP-300 ope...

26/03/2025

GatesAir to Debut ASTC 3.0 Modulator for LPTV at 2025 NAB Show

CINCINNATI GatesAir has introduced a new cost-efficient ATSC 3.0-ready platform that prepares its low-power TV transmission products for NextGen TV opportunitie...

26/03/2025

Nab Show: Appear to Introduce VX Media Gateway

OSLO, Norway Appear has announced that it will introduce the VX Media Gateway at the 2025 NAB Show 2025 and that it will be showing the solution at its Booth W2...

26/03/2025

Very little for creative industries' in Spring Statement 2025

The Chancellor presented her speech in the House of Commons today By Matthew Corrigan Published: March 26, 2025 The Chancellor presented her speech in the...

26/03/2025

Disguise to Showcase Immersive Sports Programming Technologies at NAB 2025

Disguise to Showcase Immersive Sports Programming Technologies at NAB 2025 Brie Clayton March 26, 2025 0 Comments See Demos Across Partner Booths; Wat...

26/03/2025

Archiware PresentsUpcoming P5 Version 7.4 at NAB Show 2025

Archiware PresentsUpcoming P5 Version 7.4 at NAB Show 2025 Brie Clayton March 26, 2025 0 Comments Archiware, a leading provider of data management sof...

26/03/2025

Avid Redefines Digital-First News Production at NAB Show 2025

Avid Redefines Digital-First News Production at NAB Show 2025 Brie Clayton March 26, 2025 0 Comments Avid and Wolftech Debuting Seamless, Digital-Firs...

26/03/2025

Adobe, Avid, AWS, Dolby join MovieLabs Industry Forum Leadership Council

The council will guide key focus areas to help grow a more secure, interoperable and effective media creation ecosystem, said the organisation By Matthew Corri...

26/03/2025

Meet the executive vice president

Donna Thomas, executive vice president, Vubiquity talks to TVBEurope about how nurturing an early entrepreneurial streak paved the way for her career By Matthe...

26/03/2025

Is MediaForEurope preparing to launch its bid for ProSieben?

The Italian company has reportedly called a board meeting for today to evaluate its options By Jenny Priestley Published: March 26, 2025 The Italian compa...

26/03/2025

Spring Statement 2025: UK Chancellor outlines plans ahead of June Spending Review

The Chancellor presented her speech in the House of Commons today By Matthew Co...

26/03/2025

Cine Gear Expo to be Held at Universal Studios Lot Univer...

Los Angeles, California: The industry's premier filmmaking exposition has chosen the world-renowned Universal Studios Lot for Cine Gear Expo LA 2025. Regist...

26/03/2025

MovieLabs Announces Industry Forum Leadership Council and...

MovieLabs, the technology joint venture of the major Hollywood studios today announced the formation of a Leadership Council and the inaugural member companies ...

26/03/2025

Magewell to Showcase Newest Innovations at 2025 NAB Show

Magewell, developer of innovative, high-performance video I/O and IP workflow solutions, will be showcasing its latest innovations at the 2025 NAB Show, Las Veg...

26/03/2025

Breaking the Bottleneck - Building Time to Value Solution...

The media industry's rapid transformation presents organizations with both unprecedented opportunities and challenges. Over the years, Dalet has been at the...

26/03/2025

Pixel Power Announces Brad Rochon as Senior Business Deve...

Pixel Power, a Rohde & Schwarz company, is pleased to announce that Brad Rochon has recently joined the company in the newly created position of Senior Business...

26/03/2025

Avid Redefines Digital-First News Production at NAB Show...

Avid will showcase its most advanced news production solutions, designed to accelerate digital-first, story-driven journalism and broaden audience reach, at NA...

26/03/2025

Net Insight and Globecast power Premier Padels global exp...

Net Insight and Globecast are providing Premier Padel, the leading professional padel tour, with a cutting-edge IP and cloud-based distribution solution. Levera...

26/03/2025

Vizrt delivers modern broadcast MAM demands with Viz One...

Vizrt, the leader in real-time graphics and live production solutions for content creators, today announces Viz One 8, the biggest update to its Enterprise Medi...

26/03/2025

Limecraft Teams with DPG Media to Launch New Capabilities...

With the 2025 NAB Show approaching, Limecraft announces the release of the second in a series of eight major platform updates planned for this year. Building on...

26/03/2025

Kiloview to Showcase Its Innovative AV-over-IP Solutions...

Kiloview has announced to unveil its most complete and lightweight broadcast solutions at NAB 2025 in Las Vegas. Located at SL9413, the company will showcase it...

26/03/2025

Avid Discusses New Leadership, NAB Show Plans

BURLINGTON, Mass. At the 2025 NAB Show, April 6-9 in Las Vegas, Avid will showcase new features to its MediaCentral platform, featuring the latest AI-powered ne...

26/03/2025

YouTube Sees Record Viewing, Beats Disney in TV Viewing Share

NEW YORK YouTube hit record share of monthly TV viewing in February and had the largest share of TV viewing by the major media companies, according to Nielsen&#...

26/03/2025

MovieLabs Announces Industry Forum Leadership Council

SAN FRANCISCO MovieLabs, the technology joint venture of the major Hollywood studios today announced the formation of a Leadership Council and the inaugural mem...

26/03/2025

New Music USA and Berklee Institute of Jazz and Gender Justice Announce 2025 Next Jazz Legacy Cohort

New Music USA and Berklee Institute of Jazz and Gender Justice Announce 2025 Nex...

26/03/2025

VEON Appoints Anand Ramachandran as Corporate Development Officer

26 Mar 2025 VEON Appoints Anand Ramachandran as Corporate Development Officer Dubai, March 26, 2025: VEON Ltd. (Nasdaq: VEON), a global digital operator ( VEON...

26/03/2025

Telos Alliance Reveals New AudioTools Server Features at 2025 NAB Show

Telos Alliance Reveals New AudioTools Server Features at 2025 NAB Show Search Cleveland, Ohio (March 26, 2025) Telos Alliance , trusted global leader i...

26/03/2025

Hologic WTA Tour Clay Tournament Makes Move to Electronic Line Calling

Hologic WTA Tour Clay Tournament Makes Move to Electronic Line Calling By SVG Staff Wednesday, March 26, 2025 - 9:19 am Print This Story | Subscribe Sto...

26/03/2025

FUJIFILM Intros New 4K Broadcast Zoom Lens Designed for Portability, Ease of Use

FUJIFILM Intros New 4K Broadcast Zoom Lens Designed for Portability, Ease of Use By SVG Staff Wednesday, March 26, 2025 - 9:40 am Print This Story | Subsc...

26/03/2025

Immersive Experiences That Wow: How Stadiums, Arenas, and Venues Are Upping Their Game in Audience Engagement

Immersive Experiences That Wow: How Stadiums, Arenas, and Venues Are Upping Thei...

26/03/2025

Intelligent Automation: Tackling In-Venue Graphic Design Without The Staff Burnout

Intelligent Automation: Tackling In-Venue Graphic Design Without The Staff Burno...

26/03/2025

Save the Date: SVG Venue & Teams Summit Goes to the Intuit Dome in Los Angeles on July 23

Save the Date: SVG Venue & Teams Summit Goes to the Intuit Dome in Los Angeles o...

26/03/2025

Trailer drops for the second season of The Walking Dead: Dead City, returning exclusively to Sky and NOW this summer

To view this content, please enable our use of cookies. To do so, click Privacy ...

26/03/2025

Resident Playbook' Trailer Promises Warmth, Humor, and Heartfelt Journeys of Young Doctors

Back to All News Resident Playbook' Trailer Promises Warmth, Humor, and He...

26/03/2025

Jnger Audio at NAB 2025

J nger Audio will be showcasing at the NAB 2025 trade show. You can find us in North Hall at the booth of our international distributor, Telos Alliance Stand N7...

26/03/2025

FOR-A America Delivers its Largest Commercial XR Wall to Date

Curved, 13 x32 Alfalite LED Wall Serves as Dynamic Backdrop at New ST 2110 Walmart TV Studio...

26/03/2025

FilmLight Cocktail Party. Las Vegas. Sunday April 6

Sunday April 6, from 6pm Easy's Cocktail Lounge / ARIA Resort & Casino Register now Join us between 6-9pm to meet up with the FilmLight team and your f...

26/03/2025

2025-03-26

For the finale, there was a lot of experimenting with structure and testing out different ideas about how to play out different scenes, says Richman. It was a...

26/03/2025

Buzz Solutions Uses Vision AI to Supercharge the Electric Grid

The reliability of the electric grid is critical. From handling demand surges and evolving power needs to preventing infrastructure failures that can cause wil...