Sony Pixel Power calrec Sony

Math Test? No Problems: NVIDIA Team Scores Kaggle Win With Reasoning Model

15/04/2025

The final days of the AI Mathematical Olympiad's latest competition were a transcontinental relay for team NVIDIA.

Every evening, two team members on opposite ends of the U.S. would submit an AI reasoning model to Kaggle - the online Olympics of data science and machine learning. They'd wait a tense five hours before learning how well the model tackled a sample set of 50 complex math problems.

After seeing the results, the U.S. team would pass the baton to teammates waking up in Armenia, Finland, Germany and Northern Ireland, who would spend their day testing, modifying and optimizing different model versions.

Every night I'd be so disappointed in our score, but then I'd wake up and see the messages that came in overnight from teammates in Europe, said Igor Gitman, senior applied scientist. My hopes would go up and we'd try again.

While the team was disheartened by their lack of improvement on the public dataset during the competition's final days, the real test of an AI model is how well it can generalize to unseen data. That's where their reasoning model leapt to the top of the leaderboard - correctly answering 34 out of 50 Olympiad questions within a five-hour time limit using a cluster of four NVIDIA L4 GPUs.

We got the magic in the end, said Northern Ireland-based team member Darragh Hanley, a Kaggle grandmaster and senior large language model (LLM) technologist.

Building a Winning Equation The NVIDIA team competed under the name NemoSkills - a nod to their use of the NeMo-Skills collection of pipelines for accelerated LLM training, evaluation and inference. The seven members each contributed different areas of expertise, spanning LLM training, model distillation and inference optimization.

For the Kaggle challenge, over 2,200 participating teams submitted AI models tasked with solving 50 math questions - complex problems at the National Olympiad level, spanning algebra, geometry, combinatorics and number theory - within five hours.

https://blogs.nvidia.com/wp-content/uploads/2025/04/Sample-Reasoning-AI.mp4

The team's winning model uses a combination of natural language reasoning and Python code execution.

To complete this inference challenge on the small cluster of NVIDIA L4 GPUs available via Kaggle, the NemoSkills team had to get creative.

Their winning model used Qwen2.5-14B-Base, a foundation model with chain-of-thought reasoning capabilities which the team fine-tuned on millions of synthetically generated solutions to math problems.

These synthetic solutions were primarily generated by two larger reasoning models - DeepSeek-R1 and QwQ-32B - and used to teach the team's foundation model via a form of knowledge distillation. The end result was a smaller, faster, long-thinking model capable of tackling complex problems using a combination of natural language reasoning and Python code execution.

To further boost performance, the team's solution reasons through multiple long-thinking responses in parallel before determining a final answer. To optimize this process and meet the competition's time limit, the team also used an innovative early-stopping technique.

A reasoning model might, for example, be set to answer a math problem 12 different times before picking the most common response. Using the asynchronous processing capabilities of NeMo-Skills and NVIDIA TensorRT-LLM, the team was able to monitor and exit inference early if the model had already converged at the correct answer four or more times.

TensorRT-LLM also enabled the team to harness FP8 quantization, a compression method that resulted in a 1.5x speedup over using the more commonly used FP16 format. ReDrafter, a speculative decoding technique developed by Apple, was used for a further 1.8x speedup.

The final model performed even better on the competition's unseen final dataset than it did on the public dataset - a sign that the team successfully built a generalizable model and avoided overfitting their LLM to the sample data.

Even without the Kaggle competition, we'd still be working to improve AI reasoning models for math, said Gitman. But Kaggle gives us the opportunity to benchmark and discover how well our models generalize to a third-party dataset.

Sharing the Wealth The team will soon release a technical report detailing the techniques used in their winning solution - and plans to share their dataset and a series of models on Hugging Face. The advancements and optimizations they made over the course of the competition have been integrated into NeMo-Skills pipelines available on GitHub.

Key data, technology, and insights from this pipeline were also used to train the just-released NVIDIA Llama Nemotron Ultra model.

Throughout this collaboration, we used tools across the NVIDIA software stack, said Christof Henkel, a member of the Kaggle Grandmasters of NVIDIA, known as KGMON. By working closely with our LLM research and development teams, we're able to take what we learn from the competition on a day-to-day basis and push those optimizations into NVIDIA's open-source libraries.

After the competition win, Henkel regained the title of Kaggle World Champion - ranking No. 1 among the platform's over 23 million users. Another teammate, Finland-based Ivan Sorokin, earned the Kaggle Grandmaster title, held by just over 350 people around the world.

For their first-place win, the group also won a $262,144 prize that they're directing to the NVIDIA Foundation to support charitable organizations.

Meet the full team - Igor Gitman, Darragh Hanley, Christof Henkel, Ivan Moshkov, Benedikt Schifferer, Ivan Sorokin and Shubham Toshniwal - in the video below:

Sample math questions in the featured visual above are from the 2025 American Invitational Mathematics Examination. Find the full set of questions and solutions on the Art
LINK: https://blogs.nvidia.com/blog/reasoning-ai-math-olympiad/...
See more stories from nvidia

Most recent headlines

09/11/2025

Dalet Unveils Agentic AI Media Workflows at IBC2025

Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...

06/10/2025

France Tlvisions Wins Prestigious 2025 EBU Technology & Innovation Award in Groundbreaking Collaboration with Dalet

France T l visions, France's leading broadcaster, has received the 2025 EBU ...

16/09/2025

SVG All-Stars: Leigh Michaud, Manager, Remote Operations, ESPN

SVG All-Stars: Leigh Michaud, Manager, Remote Operations, ESPNThe UConn grad rose from ESPN's mailroom to become one of its most valuable ops leadersBy Bran...

16/09/2025

Live From IBC 2025: Friday's Latest From Halls 1-4, Outdoor Exhibits in Amsterdam

Live From IBC 2025: Friday's Latest From Halls 1-4, Outdoor Exhibits in Amst...

16/09/2025

Live From IBC 2025: Saturday's Latest From Halls 5-7 in Amsterdam

Live From IBC 2025: Saturday's Latest From Halls 5-7 in Amsterdam By SVG Staff Friday, September 12, 2025 - 17:00 Print This Story The SVG Europe and ...

16/09/2025

Live From IBC 2025: Sunday's Latest From Halls 8-10 in Amsterdam

Live From IBC 2025: Sunday's Latest From Halls 8-10 in Amsterdam By SVG Staff Saturday, September 13, 2025 - 17:00 Print This Story The SVG Europe and...

16/09/2025

Live From IBC 2025: Monday's Latest From Halls 11-14 in Amsterdam

Live From IBC 2025: Monday's Latest From Halls 11-14 in Amsterdam By SVG Staff Sunday, September 14, 2025 - 17:00 Print This Story The SVG Europe and ...

16/09/2025

Amazon Prime Video Picks Up Four Hours of Early-Round Masters Coverage in 2026

Amazon Prime Video Picks Up Four Hours of Early-Round Masters Coverage in 2026By Jason Dachman, Editorial Director, U.S. Tuesday, September 16, 2025 - 10:15 a...

16/09/2025

VERSANT Inks Deal for League One Volleyball as Women's Sports Rights Slate Grows

VERSANT Inks Deal for League One Volleyball as Women's Sports Rights Slate G...

16/09/2025

ESPN VP, Corporate Communications, Katina Arnold Named SVP, Disney Advertising Communications

ESPN VP, Corporate Communications, Katina Arnold Named SVP, Disney Advertising C...

16/09/2025

IBC 2025 in Review: SVG Europe's Full Collection of Video Interviews From the Show Floor

IBC 2025 in Review: SVG Europe's Full Collection of Video Interviews From th...

16/09/2025

Celebramos 10 aos de Viva Latino en Spotify y el xito global de la msica latina

Hace una d cada, la m sica latina representaba apenas el 8% de las reproducciones globales en Spotify. Hoy, constituye m s de una cuarta parte (27%) de toda la ...

16/09/2025

Celebrating 10 Years of Spotify's Viva Latino Playlist and the Global Rise of Latin Music

A decade ago, Latin music made up just 8% of global Spotify streams. Today, it a...

16/09/2025

Spotify Welcomes Graham Norton and Select VICE Studios Content

Spotify is expanding our video lineup with a new partnership with Zoo 55, part of ITV Studios. For the first time, acclaimed content from ITV Studios is landing...

16/09/2025

One Enterprise, One Mission: Aligning the Supply Chain to the Warfighter

At DSEI 2025, James Dunne of L3Harris Maritime UK chaired a panel on aligning the supply chain to the warfighter, where leaders discussed modernising support fo...

16/09/2025

RTW chooses Calrec as technology partner

Calrec has strengthened its collaboration with audio metering expert RTW by integrating RTW's new TMxCore metering platform across its full range of Argo IP...

16/09/2025

Football and Back-to-School Dynamics Spark First Gains Since April for Traditional TV

College Football Scores Top Telecast in August with 16M+ Viewers on FOX, Followe...

16/09/2025

Index Exchange and Gracenote Team to Enhance Contextual Intelligence in Programmatic Streaming TV

Collaboration marks the first SSP integration of Gracenote IDs, enabling show-le...

16/09/2025

IBC2025 Attracts 43,858 Visitors

AMSTERDAM The organizers of IBC2025 are reporting that 43,858 visitors from more than 170 countries attended the event, which had more than 1,300 exhibitors and...

16/09/2025

Wooden Camera Releases Accessory Collection for FUJIFILMs...

Wooden Camera announces the release of its new Accessory Collection for the FUJIFILM GFX ETERNA 55. The highlights of this collection include vital power soluti...

16/09/2025

AntonBauer Launches Free Cloud Platform for Smarter Batte...

Anton/Bauer, a leading manufacturer of mobile power solutions for broadcast and cinematic equipment, has announced the launch of Anton/Bauer Fleet Management, a...

16/09/2025

Teradek Launches Prism Jetpack - A New Era of 5G Video Co...

Teradek, a leading provider of video transmission and live production solutions, today announced the launch of Prism Jetpack, a groundbreaking 5G video contribu...

16/09/2025

Astera Reinvents Practical Lighting with SolaBulb

Astera, the leader in wireless LED lighting solutions, announces the ultra-versatile SolaBulb. Building on the success of the Astera bulb family, SolaBulb intro...

16/09/2025

TED2025 Relies on Clear-Com and NETGEAR to Power Producti...

As the world gathered at TED2025 to explore the provocative theme "Humanity Reimagined", Clear-Com , supported by NETGEAR networking infrastructure, delivered f...

16/09/2025

Bitfocus Wins IABM Impact Award as Pro AV Changemaker

Bitfocus' Buttons platform celebrated as a catalyst for AV and broadcast convergence Bitfocus has been named winner of the IABM Impact Award Pro AV Chan...

16/09/2025

IABM announces winners of the Inaugural IABM Impact Award...

Record entries and outstanding innovation celebrated at IBC2025 as the MediaTech community honors its leading people, projects and organizations IABM announced...

16/09/2025

SKY Perfect Modernizes Playout-to-Delivery with Harmonic

Harmonic (NASDAQ: HLIT) today announced that SKY Perfect JSAT Corporation (SJC), a leading satellite operator and pay-TV provider in Japan, has partnered with H...

16/09/2025

Telestream Congratulates Sky Group on IBC Innovation Awar...

Telestream congratulates Sky Group, which has been awarded the prestigious IBC Innovation Award for Content Distribution for its MediaMesh platform on Sunday, S...

16/09/2025

SES Partners with Cailabs to Test Next-Generation Laser Communication Technology

Leading space solutions company will use optical ground stations to deliver faster, more secure data from space Luxembourg, September 15, 2025 - SES, a leading...

16/09/2025

ENCO Introduces Raptor Cloud-Based Captioning For Live Streaming Video

NOVI, Mich. ENCO has unveiled Raptor, a cloud-based live streaming captioning encoder that injects the speed, power and reliability of real-time AI capabilities...

16/09/2025

Comcast NBCU and NBCU Local Award $2.5 Million to Nonprofits

NEW YORK Comcast NBCUniversal and NBCUniversal Local have announced that a total of $2.5 million has been awarded in 2025 to 69 nonprofit organizations servicin...

16/09/2025

Martin Euredjian Joins Atomos

AMSTERDAM Martin Euredjian has joined Atomos as director of advanced imaging and will lead innovation for advanced display technology....

16/09/2025

Calrec Unveils 48-Fader Argo M

AMSTERDAM Calrec introduced a 48-faced Argo M and showcased its largest Argo software updates at the recently concluded IBC2025....

16/09/2025

Lawo Unveils HOME Audio Shuffler App

AMSTERDAM Lawo introduced its HOME Audio Shuffler app, a replacement for a traditional baseband audio matrix within an IP-based Dynamic Media Facility, during t...

16/09/2025

77th Emmy Awards on CBS Deliver Largest Audience Since 2021

CBS is reporting that the 77TH Emmy Awards hosted by Nate Bargatze on Sunday Sept. 14 was seen by more than 7.42 million viewers on the CBS Television Network a...

16/09/2025

Meet the Adversaries That Will Clash in 'Rulers of Fortune', a Series Premiering October 29 on Netflix

Back to All News Meet the Adversaries That Will Clash in Rulers of Fortune, a S...

16/09/2025

Would You Take 30 Million Baht from a Dead Person's Account? Everybody Loves Me When I'm Dead' Unveils Main Trailer, Key Art

Back to All News Would You Take 30 Million Baht from a Dead Persons Account? E...

16/09/2025

Netflix Unveils the Official Trailer and Poster for 'She Walks in Darkness', by Agustn Daz Yanes

Back to All News Netflix Unveils the Official Trailer and Poster for She Walks ...

16/09/2025

Comscore Unveils The Scoreboard: An Interactive Destination Surfacing Consumer Behaviors and Trends Across Multiple Platforms

Comscore Unveils The Scoreboard: An Interactive Destination Surfacing Consumer B...

15/09/2025

Get to Know This Fall's Filmmakers Through These 27 Sundance Institute-Supported Titles

Steve Zahn, Winona Ryder, Ethan Hawke, and Janeane Garofalo star in Ben Stiller&...

15/09/2025

aespa and Spotify Invite Fans to Unlock Their Inner Rich Man' With an Immersive MY VAULT Experience

Global K-Pop sensation aespa is redefining what it means to be rich with the r...

15/09/2025

Spotify's Free Experience Is Even Better-Here's How to Make the Most of It

Every day, millions of people around the world turn to Spotify to enjoy the audi...

15/09/2025

Brembo SGL Carbon Ceramic Brakes (BSCCB) successfully expands production capacity by 50% in Germany and Italy to meet rising demand

After months of intensive planning and implementation, Brembo SGL Carbon Ceramic...

15/09/2025

L3Harris Receives Multi-Year Javelin Solid Rocket Motor Contract

A U.S. Marine launches a Javelin shoulder-fired anti-tank missile during a training exercise. (Photo credit: U.S. Marine Corps)...

15/09/2025

TV Tech Unveils Best of Show Winners at IBC 2025

AMSTERDAM TV Tech has named its Best of Show Awards winners for IBC2025, which wraps up today. Entrants were judged by a panel of industry experts on the criter...

15/09/2025

SES SCORE Surpasses 600,000 of Transmission Hours, Delivering 900 Hours of Major Sports Content Daily

Unique sports content orchestration platform builds momentum among SES's cus...

15/09/2025

Over 41 Million Global Viewers On Netflix Watch Terence Crawford Defeat Canelo lvarez To Claim Super Middleweight Championship

Back to All News Over 41 Million Global Viewers On Netflix Watch Terence Crawfo...

15/09/2025

The Great Flood' Teaser Trailer Previews A Gripping Struggle for Survival

Back to All News The Great Flood' Teaser Trailer Previews A Gripping Struggle for Survival Entertainment 15 September 2025 GlobalSouth Korea Link copi...

15/09/2025

From Hardship to Hope: Typhoon Family' Presents a Tale of Youth, Crisis, and Resilience on October 11

Back to All News From Hardship to Hope: Typhoon Family' Presents a Tale of...