Sony Pixel Power calrec Sony

Math Test? No Problems: NVIDIA Team Scores Kaggle Win With Reasoning Model

15/04/2025

The final days of the AI Mathematical Olympiad's latest competition were a transcontinental relay for team NVIDIA.

Every evening, two team members on opposite ends of the U.S. would submit an AI reasoning model to Kaggle - the online Olympics of data science and machine learning. They'd wait a tense five hours before learning how well the model tackled a sample set of 50 complex math problems.

After seeing the results, the U.S. team would pass the baton to teammates waking up in Armenia, Finland, Germany and Northern Ireland, who would spend their day testing, modifying and optimizing different model versions.

Every night I'd be so disappointed in our score, but then I'd wake up and see the messages that came in overnight from teammates in Europe, said Igor Gitman, senior applied scientist. My hopes would go up and we'd try again.

While the team was disheartened by their lack of improvement on the public dataset during the competition's final days, the real test of an AI model is how well it can generalize to unseen data. That's where their reasoning model leapt to the top of the leaderboard - correctly answering 34 out of 50 Olympiad questions within a five-hour time limit using a cluster of four NVIDIA L4 GPUs.

We got the magic in the end, said Northern Ireland-based team member Darragh Hanley, a Kaggle grandmaster and senior large language model (LLM) technologist.

Building a Winning Equation The NVIDIA team competed under the name NemoSkills - a nod to their use of the NeMo-Skills collection of pipelines for accelerated LLM training, evaluation and inference. The seven members each contributed different areas of expertise, spanning LLM training, model distillation and inference optimization.

For the Kaggle challenge, over 2,200 participating teams submitted AI models tasked with solving 50 math questions - complex problems at the National Olympiad level, spanning algebra, geometry, combinatorics and number theory - within five hours.

https://blogs.nvidia.com/wp-content/uploads/2025/04/Sample-Reasoning-AI.mp4

The team's winning model uses a combination of natural language reasoning and Python code execution.

To complete this inference challenge on the small cluster of NVIDIA L4 GPUs available via Kaggle, the NemoSkills team had to get creative.

Their winning model used Qwen2.5-14B-Base, a foundation model with chain-of-thought reasoning capabilities which the team fine-tuned on millions of synthetically generated solutions to math problems.

These synthetic solutions were primarily generated by two larger reasoning models - DeepSeek-R1 and QwQ-32B - and used to teach the team's foundation model via a form of knowledge distillation. The end result was a smaller, faster, long-thinking model capable of tackling complex problems using a combination of natural language reasoning and Python code execution.

To further boost performance, the team's solution reasons through multiple long-thinking responses in parallel before determining a final answer. To optimize this process and meet the competition's time limit, the team also used an innovative early-stopping technique.

A reasoning model might, for example, be set to answer a math problem 12 different times before picking the most common response. Using the asynchronous processing capabilities of NeMo-Skills and NVIDIA TensorRT-LLM, the team was able to monitor and exit inference early if the model had already converged at the correct answer four or more times.

TensorRT-LLM also enabled the team to harness FP8 quantization, a compression method that resulted in a 1.5x speedup over using the more commonly used FP16 format. ReDrafter, a speculative decoding technique developed by Apple, was used for a further 1.8x speedup.

The final model performed even better on the competition's unseen final dataset than it did on the public dataset - a sign that the team successfully built a generalizable model and avoided overfitting their LLM to the sample data.

Even without the Kaggle competition, we'd still be working to improve AI reasoning models for math, said Gitman. But Kaggle gives us the opportunity to benchmark and discover how well our models generalize to a third-party dataset.

Sharing the Wealth The team will soon release a technical report detailing the techniques used in their winning solution - and plans to share their dataset and a series of models on Hugging Face. The advancements and optimizations they made over the course of the competition have been integrated into NeMo-Skills pipelines available on GitHub.

Key data, technology, and insights from this pipeline were also used to train the just-released NVIDIA Llama Nemotron Ultra model.

Throughout this collaboration, we used tools across the NVIDIA software stack, said Christof Henkel, a member of the Kaggle Grandmasters of NVIDIA, known as KGMON. By working closely with our LLM research and development teams, we're able to take what we learn from the competition on a day-to-day basis and push those optimizations into NVIDIA's open-source libraries.

After the competition win, Henkel regained the title of Kaggle World Champion - ranking No. 1 among the platform's over 23 million users. Another teammate, Finland-based Ivan Sorokin, earned the Kaggle Grandmaster title, held by just over 350 people around the world.

For their first-place win, the group also won a $262,144 prize that they're directing to the NVIDIA Foundation to support charitable organizations.

Meet the full team - Igor Gitman, Darragh Hanley, Christof Henkel, Ivan Moshkov, Benedikt Schifferer, Ivan Sorokin and Shubham Toshniwal - in the video below:

Sample math questions in the featured visual above are from the 2025 American Invitational Mathematics Examination. Find the full set of questions and solutions on the Art
LINK: https://blogs.nvidia.com/blog/reasoning-ai-math-olympiad/...
See more stories from nvidia

More from Nvidia

22/05/2025

Sale Into Summer With 40% Off GeForce NOW Six-Month Performance Memberships

GeForce NOW is turning up the heat this summer with a hot new deal. For a limited time, save 40% on six-month Performance memberships and enjoy premium GeForce ...

21/05/2025

NVIDIA and SAP Bring AI Agents to the Physical World

As robots increasingly make their way to the largest enterprises' manufacturing plants and warehouses, the need for access to critical business and operatio...

20/05/2025

Siemens Makes Factory Floors Smarter With Industrial AI

Industrial AI is transforming how factories operate, innovate and scale. The convergence of AI, simulation and digital twins is poised to unlock new levels of ...

19/05/2025

NVIDIA and Microsoft Accelerate Agentic AI Innovation, From Cloud to PC

Agentic AI is redefining scientific discovery and unlocking research breakthroughs and innovations across industries. Through deepened collaboration, NVIDIA and...

19/05/2025

NVIDIA Research Breakthroughs Put Advanced Robots in Motion

Across robot training and development, NVIDIA Research is uncovering breakthroughs in areas such as multimodal generative AI and synthetic data generation. The...

19/05/2025

NVIDIA and Microsoft Advance Development on RTX AI PCs

Generative AI is transforming PC software into breakthrough experiences - from digital humans to writing assistants, intelligent agents and creative tools. NVI...

18/05/2025

NVIDIA CEO Envisions AI Infrastructure Industry Worth Trillions of Dollars'

Electricity. The Internet. Now it's time for another major technology, AI, to sweep the globe. NVIDIA founder and CEO Jensen Huang took the stage at a pack...

18/05/2025

NVIDIA Expands Omniverse Blueprint for AI Factory Digital Twins With New Ecosystem Integrations, Development Tools

Empowering engineering teams with more tools for building AI factories, NVIDIA t...

18/05/2025

AI Blueprint for Video Search and Summarization Now Available to Deploy Video Analytics AI Agents Across Industries

The age of video analytics AI agents is here. Video is one of the defining feat...

18/05/2025

Semiconductor Industry Accelerates Design Manufacturing With NVIDIA Blackwell and CUDA-X

TSMC, Cadence, KLA, Siemens and Synopsys are advancing semiconductor manufacturi...

18/05/2025

NVIDIA Grace CPU C1 Gains Broad Support in Edge, Telco and Storage

NVIDIA is highlighting significant momentum for its new Grace CPU C1 this week at the COMPUTEX trade show in Taipei, with a strong showing of support from key o...

18/05/2025

NVIDIA-Powered Supercomputer to Enable Quantum Leap for Taiwan Research

Researchers across Taiwan are tackling complex challenges in AI development, climate science and quantum computing. Their work will soon be boosted by a new sup...

18/05/2025

That's One Smart Hospital! Taiwan Medical Centers Deploy Life-Saving Innovations With NVIDIA System-Builder Partners

Leading healthcare organizations across the globe are using agentic AI, robotics...

18/05/2025

NVIDIA Grows Quantum Computing Ecosystem With Taiwan Manufacturers and Supercomputing

Quantum computing promises to shorten the path to solving some of the world'...

15/05/2025

Into the Omniverse: Computational Fluid Dynamics Simulation Finds Smoothest Flow With AI-Driven Digital Twins

Editor's note: This post is part of Into the Omniverse, a series focused on ...

15/05/2025

Exploring the Revenue-Generating Potential of AI Factories

AI is creating value for everyone - from researchers in drug discovery to quantitative analysts navigating financial market changes. The faster an AI system ca...

15/05/2025

Time to Slay: DOOM: The Dark Ages' Looms on GeForce NOW

Steel clashes and war drums thunder as a new age of battle dawns - one that will test even the mightiest Slayer. This GFN Thursday, DOOM: The Dark Ages - the b...

14/05/2025

Visa Makes Payments Personalized and Secure With AI

Think tap to pay - but smarter and safer. Visa is tapping into AI to enhance services for its global network of customers, focused on fraud prevention, personal...

14/05/2025

Press Play on Don Diablo's Music Video - Created With NVIDIA RTX-Powered Generative AI

Electronic music icon Don Diablo is known for pushing the boundaries of music, v...

13/05/2025

NVIDIA Partners Showcase Cutting-Edge Robotic and Industrial AI Solutions at Automate 2025

As the manufacturing industry faces challenges - such as labor shortages, reshor...

13/05/2025

How Reasoning AI Agents Transform High-Stakes Decision Making

Editor's note: This post is part of the AI On blog series, which explores the latest techniques and real-world applications of agentic AI, chatbots and copi...

12/05/2025

NVIDIA Scores COMPUTEX Best Choice Awards

NVIDIA today received multiple accolades at COMPUTEX's Best Choice Awards, in recognition of innovation across the company. The NVIDIA GeForce RTX 5090 GPU...

08/05/2025

Wildfire Prevention: AI Startups Support Prescribed Burns, Early Alerts

Artificial intelligence is helping identify and treat diseases faster with better results for humankind. Natural disasters like wildfires are next. Fires in th...

08/05/2025

Join the Family: GeForce NOW Welcomes 2K's Acclaimed Mafia' Franchise to the Cloud

Calling all wiseguys - 2K's acclaimed Mafia franchise is available to stream...

08/05/2025

LM Studio Accelerates LLM Performance With NVIDIA GeForce RTX GPUs and CUDA 12.8

As AI use cases continue to expand - from document summarization to custom software agents - developers and enthusiasts are seeking faster, more flexible ways t...

07/05/2025

Cadence Taps NVIDIA Blackwell to Accelerate AI-Driven Engineering Design and Scientific Simulation

A new supercomputer offered by Cadence, a leading provider of technology for ele...

07/05/2025

NVIDIA's Rama Akkiraju on How AI Platform Architects Help Bridge Business Vision and Technical Execution

Enterprises across industries are exploring AI to rethink problem-solving and re...

02/05/2025

NVIDIA Experts Share Top 5 Tips for Standing Out in the AI Job Market

With graduation season approaching, a new cohort of students is embarking on next steps, aiming to use their passions and skills to make a real, tangible impact...

01/05/2025

May the Cloud Be With You: GeForce NOW Unveils 21 New Games This Month

May brings more than just rainbows and sunshine - it's also time for fresh adventures and epic battles. This GFN Thursday spotlights 20 can't-miss games...

01/05/2025

Wandercraft Begins Clinical Trials for Physical AI-Powered Personal Exoskeleton

For Nicolas Simon, advancing the field of robotics is a personal mission that could change his siblings' lives. Two-thirds of Simon's family members us...

30/04/2025

From Kitchen to Drive-Thru: How Yum! Brands Is Accelerating Restaurant Innovation With AI

The quick-service restaurant (QSR) industry is being reinvented by AI. For exam...

30/04/2025

Control the Composition of AI-Generated Images With the NVIDIA AI Blueprint for 3D-Guided Generative AI

AI-powered image generation has progressed at a remarkable pace - from early exa...

28/04/2025

NVIDIA Brings Cybersecurity to Every AI Factory

As enterprises increasingly adopt AI, securing AI factories - where complex, agentic workflows are executed - has never been more critical. NVIDIA is bringing ...

28/04/2025

How Agentic AI Enables the Next Leap in Cybersecurity

Agentic AI is redefining the cybersecurity landscape - introducing new opportunities that demand rethinking how to secure AI while offering the keys to addressi...

28/04/2025

Oracle Cloud Infrastructure Deploys Thousands of NVIDIA Blackwell GPUs for Agentic AI and Reasoning Models

Oracle has stood up and optimized its first wave of liquid-cooled NVIDIA GB200 N...

24/04/2025

NVIDIA Research at ICLR - Pioneering the Next Wave of Multimodal Generative AI

Advancing AI requires a full-stack approach, with a powerful foundation of computing infrastructure - including accelerated processors and networking technologi...

24/04/2025

All Roads Lead Back to Oblivion: Bethesda's The Elder Scrolls IV: Oblivion Remastered' Arrives on GeForce NOW

Get the controllers ready and clear the calendar - it's a jam-packed GFN Thu...

23/04/2025

How the Economics of Inference Can Maximize AI Value

As AI models evolve and adoption grows, enterprises must perform a delicate balancing act to achieve maximum value. That's because inference - the process ...

23/04/2025

Capital One Banks on AI for Financial Services

Financial services has long been at the forefront of adopting technological innovations. Today, generative AI and agentic systems are redefining the industry, f...

23/04/2025

Project G-Assist Plug-In Builder Lets Anyone Customize AI on GeForce RTX AI PCs

AI is rapidly reshaping what's possible on a PC - whether for real-time image generation or voice-controlled workflows. As AI capabilities grow, so does the...

23/04/2025

Enterprises Onboard AI Teammates Faster With NVIDIA NeMo Tools to Scale Employee Productivity

An AI agent is only as accurate, relevant and timely as the data that powers it....

22/04/2025

Keeping AI on the Planet: NVIDIA Technologies Make Every Day About Earth Day

Whether at sea, land or in the sky - even outer space - NVIDIA technology is helping research scientists and developers alike explore and understand oceans, wil...

22/04/2025

Chill Factor: NVIDIA Blackwell Platform Boosts Water Efficiency by Over 300x

Traditionally, data centers have relied on air cooling - where mechanical chillers circulate chilled air to absorb heat from servers, helping them maintain opti...

22/04/2025

Making Brain Waves: AI Startup Speeds Disease Research With Lab in the Loop

About 15% of the world's population - over a billion people - are affected by neurological disorders, from commonly known diseases like Alzheimer's and ...

17/04/2025

AI Bites Back: Researchers Develop Model to Detect Malaria Amid Venezuelan Gold Rush

Gold prospecting in Venezuela has led to a malaria resurgence, but researchers h...

17/04/2025

Spring Into Action With 11 New Games on GeForce NOW

As the days grow longer and the flowers bloom, GFN Thursday brings a fresh lineup of games to brighten the week. Dive into thrilling hunts and dark fantasy adv...

16/04/2025

Isomorphic Labs Rethinks Drug Discovery With AI

Isomorphic Labs is reimagining the drug discovery process with an AI-first approach. At the heart of this work is a new way of thinking about biology. Max Jade...

16/04/2025

Into the Omniverse: How Digital Twins Are Scaling Industrial AI

Editor's note: This post is part of Into the Omniverse, a series focused on how developers, 3D practitioners, and enterprises can transform their workflows ...

15/04/2025

Math Test? No Problems: NVIDIA Team Scores Kaggle Win With Reasoning Model

The final days of the AI Mathematical Olympiad's latest competition were a transcontinental relay for team NVIDIA. Every evening, two team members on oppos...

15/04/2025

Everywhere, All at Once: NVIDIA Drives the Next Phase of AI Growth

Every company and country wants to grow and create economic opportunity - but they need virtually limitless intelligence to do so. Working with its ecosystem pa...