Sony Pixel Power calrec Sony

NVIDIA's AI Masters Sweep KDD Cup 2024 Data Science Competition

22/07/2024

Team NVIDIA has triumphed at the Amazon KDD Cup 2024, securing first place Friday across all five competition tracks.

The team - consisting of NVIDIANs Ahmet Erdem, Benedikt Schifferer, Chris Deotte, Gilberto Titericz, Ivan Sorokin and Simon Jegou - demonstrated its prowess in generative AI, winning in categories that included text generation, multiple-choice questions, name entity recognition, ranking, and retrieval.

The competition, themed Multi-Task Online Shopping Challenge for LLMs, asked participants to solve various challenges using limited datasets.

The new trend in LLM competitions is that they don't give you training data, said Deotte, a senior data scientist at NVIDIA. They give you 96 example questions - not enough to train a model - so we came up with 500,000 questions on our own.

Deotte explained that the NVIDIA team generated a variety of questions by writing some themselves, using a large language model to create others, and transforming existing e-commerce datasets.

Once we had our questions, it was straightforward to use existing frameworks to fine-tune a language model, he said.

The competition organizers hid the test questions to ensure participants couldn't exploit previously known answers. This approach encourages models that generalize well to any question about e-commerce, proving the model's capability to handle real-world scenarios effectively.

Despite these constraints, Team NVIDIA's innovative approach outperformed all competitors by using Qwen2-72B, a just-released LLM with 72 billion parameters, fine-tuned on eight NVIDIA A100 Tensor Core GPUs, and employing QLoRA, a technique for fine-tuning models with datasets.

About the KDD Cup 2024 The KDD Cup, organized by the Association for Computing Machinery's Special Interest Group on Knowledge Discovery and Data Mining, or ACM SIGKDD, is a prestigious annual competition that promotes research and development in the field.

This year's challenge, hosted by Amazon, focused on mimicking the complexities of online shopping with the goal of making it a more intuitive and satisfying experience using large language models. Organizers utilized the test dataset ShopBench - a benchmark that replicates the massive challenge for online shopping with 57 tasks and about 20,000 questions derived from real-world Amazon shopping data - to evaluate participants' models.

The ShopBench benchmark focused on four key shopping skills, along with a fifth all-in-one challenge:

Shopping Concept Understanding: Decoding complex shopping concepts and terminologies.

Shopping Knowledge Reasoning: Making informed decisions with shopping knowledge.

User Behavior Alignment: Understanding dynamic customer behavior.

Multilingual Abilities: Shopping across languages.

All-Around: Solving all tasks from the previous tracks in a unified solution.

NVIDIA's Winning Solution NVIDIA's winning solution involved creating a single model for each track.

The team fine-tuned the just-released Qwen2-72B model using eight NVIDIA A100 Tensor Core GPUs for approximately 24 hours. The GPUs provided fast and efficient processing, significantly reducing the time required for fine-tuning.

First, the team generated training datasets based on the provided examples and synthesized additional data using Llama 3 70B hosted on build.nvidia.com.

Next, they employed QLoRA (Quantized Low-Rank Adaptation), a training process using the data created in step one. QLoRA modifies a smaller subset of the model's weights, allowing efficient training and fine-tuning.

The model was then quantized - making it smaller and able to run on a system with a smaller hard drive and less memory - with AWQ 4-bit and used the vLLM inference library to predict the test datasets on four NVIDIA T4 Tensor Core GPUs within the time constraints.

This approach secured the top spot in each individual track and the overall first place in the competition-a clean sweep for NVIDIA for the second year in a row.

The team plans to submit a detailed paper on its solution next month and plans to present its findings at KDD 2024 in Barcelona.
LINK: https://blogs.nvidia.com/blog/nvidia-ai-masters-kdd-cup-2024/...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

06/09/2026

Dolby and MagentaTV Bring Fans Closer to the FIFA World Cup 2026 in Germany with Dolby Vision and Dolby Atmos

June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...

04/08/2026

Dalet Announces Commercial Availability of Dalia, Bringing Media-Aware Agentic AI to Enterprise Productions

Dalet, a leading technology and service provider for media-rich organizations, t...

04/07/2026

Detective Conan: Fallen Angel of the Highway Opens in Dolby Cinemas Across Japan, Presented in Dolby Atmos and Dolby ...

April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...

25/06/2026

Creative Remote to Open London Offline Facility

Creative Remote, the provider of remote and hybrid offline editing infrastructure, today announced the opening of 41, its new offline edit facility located at 4...

25/06/2026

Rise Announces 2026 Worldwide Mentoring Cohorts Supportin...

Rise, the award-winning advocacy group for gender diversity in the broadcast and media technology sector, is pleased to announce the global mentoring cohort for...

25/06/2026

Emergent Partners with ROCKET to Expand Canadian Operatio...

Emergent, a pioneer in browser-based, AI-enhanced content production environments, today announced a strategic partnership with ROCKET, a premier media-centric ...

25/06/2026

Mobile Television Group Launches MTVG Full-Stack Production Platform

Share Copy link Facebook X Linkedin Bluesky Email...

25/06/2026

NAB Updates FCC on ATSC 3.0 Alerting Advances

Share Copy link Facebook X Linkedin Bluesky Email...

25/06/2026

Tegna Elevates Four Executives to Senior VP

Share Copy link Facebook X Linkedin Bluesky Email...

25/06/2026

Comedian Joe McGucken hosts new RT podcast series Ramble

Launching today (Thursday 25 June), new RT podcast Ramble with Joe McGucken is a series of curiosity-driven conversations where actor, writer and comedian Joe ...

25/06/2026

June 24, 2026

Immune molecule may drive excessive drinking in alcohol use disorder Scripps Research scientists showed that blocking an immune molecule tied to inflammation r...

24/06/2026

NoiseWorks Audio add Mouth De-Click to VoiceAssist

Plus: VoiceAssist Basic now available to UA LUNA users NoiseWorks Audio have just released an update that adds a new Mouth De-Click module to the Advanced t...

24/06/2026

Gator introduce the Frameworks Studio Mic Boom 2000

New heavy-duty mic stand joins range The latest arrival to Gator's Frameworks family introduces a new heavy-duty boom stand that's been designed for...

24/06/2026

Waves V17 now available

Latest major plug-in update goes live Waves have just announced that the latest major update for their hugely popular plug-in range is now officially availa...

24/06/2026

Why Four Bars of Signal Doesn't Always Mean Good Performance

When assessing cellular coverage, many people look at the signal bars displayed on a smartphone, router or modem. More bars are often assumed to mean better per...

24/06/2026

Rohde & Schwarz THORIS sets new standard for counterUAS defense

Rohde & Schwarz THORIS sets new standard for counter UAS defense At Eurosatory 2026, Rohde & Schwarz is unveiling THORIS, a German engineered, sovereign count...

24/06/2026

Rohde & Schwarz expands voice communications modernization program for Egyptian air traffic control

Rohde & Schwarz expands voice communications modernization program for Egyptian ...

24/06/2026

Clear-Com FreeSpeak Cell Successfully Tested by RTL Deutschland in 5G Network at...

eds3_5_jq(document).ready(function($) { $(#eds_sliderM519).chameleonSlider_2_1({...

24/06/2026

Nielsen's Q1 2026 Ad Supported Gauge

Streaming sets record high of 46.6% of ad supported TV viewing, driven by Super Bowl and Winter Olympics; overall share of ad supported TV remains steady NEW Y...

24/06/2026

FCC Flooded with Nearly 28K Comments Regarding Its Probe of 'The View'

Share Copy link Facebook X Linkedin Bluesky Email...

24/06/2026

Hearst Television Brings Ad Addressability to Local Broadcast TV

Share Copy link Facebook X Linkedin Bluesky Email...

24/06/2026

FCC Raises $3.5 Billion in AWS-3 Wireless Auction

Share Copy link Facebook X Linkedin Bluesky Email...

24/06/2026

RE:Vision Effects Announces Twixtor Standalone v 8.1 and a Sale!

RE:Vision Effects Announces Twixtor Standalone v 8.1 and a Sale! Brie Clayton June 24, 2026 0 Comments Twixtor v8.1 Standalone adds support for variab...

24/06/2026

Dreamtek Uses Full Blackmagic Workflow for Vercel Next JS Event

Dreamtek Uses Full Blackmagic Workflow for Vercel Next JS Event Brie Clayton June 24, 2026 0 Comments Blackmagic cameras, switchers, routers, recorder...

24/06/2026

Chyron LIVE Unveils New Features: Haivision StreamHub Integration, SCTE-35 Ad Insertion, and Refined Switching Tools

Chyron LIVE Unveils New Features: Haivision StreamHub Integration, SCTE-35 Ad In...

24/06/2026

Mapping an Education

Mapping an Education How composer Chloe Clarke Smith navigated her Boston Conservatory experience and brought new meaning to her work June 24, 2026 By Sara...

24/06/2026

The Next Act

The Next Act Dean Krisha Marcano's vision for a connected Theater Division, and the fund making it possible June 24, 2026 Photo by Eric Antoniou The Or...

24/06/2026

Announcing STAGES Magazine 2026

Announcing STAGES Magazine 2026 Marking a decade since Boston Conservatory and Berklee College of Music joined forces, this issue spotlights some of the groun...

24/06/2026

Rede Legislativa Chooses Appear to Support Brazil TV Ver...

In Brazil's TV 3.0 Trials, Appear's X5 is transporting live signals from Bras lia to S o Paulo over the public internet using secure, reliable next-gene...

24/06/2026

Mediaproxy partners with HVS for US broadcast market

Melbourne, Australia - 24 June 2026: Mediaproxy, the global standard for software-based IP compliance monitoring and multiviewing solutions, has named Heartland...

24/06/2026

Gray Media Launches Political 360 Digital Advertising Solution

Share Copy link Facebook X Linkedin Bluesky Email...

24/06/2026

Walmart to Pay $1.4 Billion to Acquire Ad Tech Firm Vibe.co

Share Copy link Facebook X Linkedin Bluesky Email...

24/06/2026

FCC Flooded with Nearly 28K Comments on 'The View'

Share Copy link Facebook X Linkedin Bluesky Email...

24/06/2026

First Rush Brings SDI Multicam ProRes Recording to Apple Silicon Macs

First Rush Brings SDI Multicam ProRes Recording to Apple Silicon Macs Brie Clayton June 23, 2026 0 Comments First Rush is a native macOS application d...

24/06/2026

Vertical Drama Beneath Crimson Sails Created with Blackmagic Design

Vertical Drama Beneath Crimson Sails Created with Blackmagic Design Brie Clayton June 23, 2026 0 Comments Thunder Child Productions relies on cameras&...

24/06/2026

X-Rite Pantone Adds MSI PRO MAX Displays to Pantone Validated Program for Proven Color Fidelity for Real World Colors

X-Rite Pantone Adds MSI PRO MAX Displays to Pantone Validated Program for Proven...

24/06/2026

Dara Briain forms Dara's Dull Appreciation Society for U

24th June 2026, London: Dara Briain will host brand new comedy entertainment series Dara's Dull Appreciation Society (4x40') celebrating members of th...

24/06/2026

Sky reveals wild new trailer and images for Shaun the Sheep: The Beast of Mossy Bottom

Wednesday 24 June 2026 Sky reveals wild new trailer and images for Shaun the Sh...

24/06/2026

Seven paradoxes shaping the next era of media production - Episode 1

Why Dynamic Media Facilities Matter In this series, we explore the technologies, architectures and operational realities shaping modern media operations. Along ...

24/06/2026

Comscore Announces Partnership with Amazon DSP to Expand Content Addressability to Drive Campaign Performance

Comscore Announces Partnership with Amazon DSP to Expand Content Addressability ...

24/06/2026

RT is supporting 21 Arts and Cultural Events all over Ireland this July

From Cork to Limerick to Earagail: RT is supporting 21 Arts and Cultural Events all over Ireland this July RT Supporting the Arts is delighted to spotlight...

23/06/2026

Case Study: YES Networks IP Transition Expands Production Possibilities and Redefines Workflows

When we began planning our transition from an SDI-based infrastructure to a new ...

23/06/2026

Imagine Communications Appoints Greg Garmon as SVP, Americas Video Sales

Imagine Communications has announced the appointment of Greg Garmon as Senior Vice President, Americas Video Sales. Garmon will oversee account growth and busin...

23/06/2026

Snap Promotes Emma Wakely to Head of Sports and Media Partnerships, Americas

Snap has promoted Emma Wakely to Head of Sports and Media Partnerships, Americas, succeeding Anmol Malhotra, who has been elevated to Global Head of Content and...

23/06/2026

YES Network and Gotham Sports App to Air MI New York Major League Cricket Matches

YES Network and The Gotham Sports App will air MI New York's Major League Cr...

23/06/2026

HAND Issues Persistent Digital IDs to 2026 NBA Draft Class

The Universal Talent Identifier (HAND) has issued HAND IDs to 34 top projected prospects in the 2026 NBA Draft class, including AJ Dybantsa, Cameron Boozer, and...

23/06/2026

World Boxing Launches World Boxing TV Streaming Platform

World Boxing has announced the launch of World Boxing TV, a subscription-based streaming platform built on the Joymo platform, offering live events, on-demand c...

23/06/2026

FloRacing to Stream 32 Off-Road Motorcycle Racing Events Including AMA Amateur National Motocross Championship

FloSports will stream 32 off-road motorcycle racing events on FloRacing, includi...

23/06/2026

SES Adds 14 Regional Channels and New Set-Top Boxes to ASTRA TV in Spain

SES has announced the expansion of its ASTRA TV platform in Spain with the addition of 14 regional channels in HD and UHD quality and the launch of new hybrid s...