Sony Pixel Power calrec Sony

Pinterest Boosts Home Feed Engagement 16% With Switch to GPU Acceleration of Recommenders

04/08/2022

Pinterest has engineered a way to serve its photo-sharing community more of the images they love.

The social-image service, with more than 400 million monthly active users, has trained bigger recommender models for improved accuracy at predicting people's interests.

Pinterest handles hundreds of millions of user requests an hour on any given day. And it must also narrow down relevant images from roughly 300 billion images on the site to roughly 50 for each person.

The last step - ranking the most relevant and engaging content for everyone using Pinterest - required a leap in acceleration to run heftier models, with minimal latency, for better predictions.

Pinterest has improved the accuracy of its recommender models powering people's home feeds and other areas, increasing engagement by as much as 16%.

The leap was enabled by switching from CPUs to NVIDIA GPUs, which could easily be applied next to other areas, including advertising images, according to Pinterest.

Normally we would be happy with a 2% increase, and 16% is just a beginning for home feeds. We see additional gains - it opens a lot of doors for opportunities, said Pong Eksombatchai, a software engineer at Pinterest.

Transformer models capable of better predictions are shaking up industries from retail to entertainment and advertising. But their leaps in performance gains of the past few years have come with a need to serve models that are some 100x bigger as their number of model parameters and computations skyrockets.

Huge Inference Gains, Same Infrastructure Cost Like many, Pinterest engineers wanted to tap into state-of-the-art recommender models to increase engagement. But serving these massive models on CPUs presented a 100x increase in cost and latency. That wasn't going to maintain its magical user experience - fresh and more appealing images - occurring within a fraction of a second.

If that latency happened, then obviously our users wouldn't like that very much because they would have to wait forever, said Eksombatchai. We are pretty close to the limit of what we can do on CPU basically.

The challenge was to serve these hundredfold larger recommender models within the same cost and latency constraints.

Working with NVIDIA, Pinterest engineers began architectural changes to optimize their inference pipeline and recommender models to enable the transition from CPU to GPU cloud instances. The technology transition began late last year and required major changes to how the company manages workloads. The result is a 100x gain in inference efficiency on the same IT budget, meeting their goals.

We are starting to use really, really big models now. And that is where the GPU comes in - to help make these models possible, Eksombatchai said.

Tapping Into cuCollections Switching from CPUs to GPUs required rethinking its inference systems architecture. Among other issues, engineers had to change how they send workloads to their inference servers. Fortunately, there are tools to assist in making the transition easier.

The Pinterest inference server built for CPUs had to be altered because it was set up to send smaller batch sizes to its servers. GPUs can handle much larger workloads, so it's necessary to set up larger batch requests to increase efficiency.

One area where this comes into play is with its embedding table lookup module. Embedding tables are used to track interactions between various context-specific features and interests of user profiles. They can track where you navigate, and what people Pin on Pinterest, share or numerous other actions, helping refine predictions on what users might like to click on next.

They are used to incrementally learn user preference based on context in order to make better content recommendations to those using Pinterest. Its embedding table lookup module required two computation steps repeated hundreds of times because of the number of features tracked.

Pinterest engineers greatly reduced this number of operations using a GPU-accelerated concurrent hash table from NVIDIA cuCollections. And they set up a custom consolidated embedding lookup module so they could merge requests into a single lookup. Better results were seen immediately.

Using cuCollections helped us to remove bottlenecks, said Eksombatchai.

Enlisting CUDA Graphs Pinterest relied on CUDA Graphs to eliminate what was remaining of the small batch operations, further optimizing its inference models.

CUDA Graphs helps reduce the CPU interactions when launching on GPUs. They're designed to enable workloads to be defined as graphs rather than single operations. They provide a mechanism to launch multiple GPU operations through a single CPU operation, reducing CPU overheads.

Pinterest enlisted CUDA Graphs to represent the model inference process as a static graph of operation instead of as those individually scheduled. This enabled the computation to be handled as a single unit without any kernel launching overhead.

The company now supports CUDA Graph as a new backend of its model server. When a model is first loaded, the model server runs the model inference once to build the graph instance. This graph can then be run repeatedly in inference to show content on its app or site.

Implementing CUDA Graphs helped Pinterest to significantly reduce inference latency of its recommender models, according to its engineers.

GPUs have enabled Pinterest to do something that was impossible with CPUs on the same budget, and by doing this they can make changes that have a direct impact on various business metrics.

Learn about Pinterest's GPU-driven inference and optimizations at its GTC session, Serving 100x Bigger Recommender Models, and in the Pinterest Engineering blog.

Register for GTC, running Sept. 19-22, for free to attend sessions with NVIDIA and dozens of industry leaders.
LINK: https://blogs.nvidia.com/blog/2022/08/04/pinterest-gpu-acceleration-re...
See more stories from nvidia

Most recent headlines

03/12/2025

ToolsOnAir Composition Builder 2025 Boilerplate

ToolsOnAir Composition Builder 2025 Boilerplate More Details: The Composition Builder 2025 application for macOS enables TV stations and Live Event broadcast...

03/12/2025

ToolsOnAr just:live pro 2025 Boilerplate

ToolsOnAr just:live pro 2025 Boilerplate More Details: just:live pro 2025 is a Single Channel Live Production Playout solution for video and static or real-t...

03/12/2025

ToolsOnAr just:play pro 2025 Boilerplate

ToolsOnAr just:play pro 2025 Boilerplate More Details: just:play pro 2025 is a Single Channel automated 24/7 Master Control playout solution with SD, HD and ...

03/12/2025

ToolsOnAr live:cut 2025 Boilerplate

ToolsOnAr live:cut 2025 Boilerplate More Details: live:cut is an option to just:in mac pro 2025 and enables multicamera production workflows for up to 16 cam...

03/12/2025

ToolsOnAir Just In Mac Lite NDI 2025 Boilerplate

ToolsOnAir Just In Mac Lite NDI 2025 Boilerplate More Details: The Just In Mac Lite NDI application is a streamlined media capture solution designed specific...

03/12/2025

ToolsOnAir Just In Mac Lite 2025 Boilerplate

ToolsOnAir Just In Mac Lite 2025 Boilerplate More Details: The Just In Mac Lite application is a streamlined media capture solution designed specifically for...

03/12/2025

ToolsOnAir just:in mac pro 2025 Boilerplate

ToolsOnAir just:in mac pro 2025 Boilerplate More Details: just:in mac pro is a macOS-based client-server multichannel capture solution to record SDI, HDMI, N...

03/12/2025

Opportunity awaits Young Journalist of the Year winner

Tracy Bonareri Onchoke, Thomson's Young Journalist of the year 2025 is hoping the accolade will be a springboard to more cross-border collaboration between ...

03/12/2025

MLS Cup 2025 Production To Feature Four iPhone 17 Pros as Game-Coverage Cameras

MLS Cup 2025 Production To Feature Four iPhone 17 Pros as Game-Coverage CamerasStay tuned to SVG on Friday for our in-depth story on this year's MLS Cup pro...

03/12/2025

SVG LIVE! 2025: All Sessions Now Available to Watch on SVG PLAY

SVG LIVE! 2025: All Sessions Now Available to Watch on SVG PLAYThe inaugural event placed a spotlight on the exciting world of live entertainmentBy SVG Staff ...

03/12/2025

Endless Cookie Is an Animated Documentary Unlike Any Other

(L-R) Peter Scriver and Seth Scriver introduce their documentary Endless Cookie for its premiere at the Egyptian Theatre in Park City. (Photo by Andrew H. Wa...

03/12/2025

Listeners Worldwide Crown Bad Bunny Global Top Artist for the Fourth Time and His Latest Release Takes Global Top Album

For the fourth time, Bad Bunny is the most-streamed Wrapped artist on Spotify gl...

03/12/2025

2025 Wrapped Is Here With More Layers, Stories, and Connection Than Ever Before

The wait is over. It's time to look back at the audio that defined your year with 2025 Spotify Wrapped, our annual celebration for fans, artists, creators, ...

03/12/2025

Los oyentes del mundo coronan a Bad Bunny como el Top Artista Global por cuarta vez y su ltimo lanzamiento se lleva el Top lbum Global

Por cuarta vez, Bad Bunny es el Top Artista Global de Wrapped en Spotify, con 19...

03/12/2025

These Music Trends Took 2025 in Surprising New Directions

Spotify Wrapped is back, and as always, it's powered by the billions of streams that fans around the world delivered throughout the year. From the artists w...

03/12/2025

The 2025 Wrapped Guide for Artists, Songwriters, Creators, Authors, and Advertisers

Spotify Wrapped is the moment when hundreds of millions of fans around the world...

03/12/2025

The Top Artists, Songs, Albums, Podcasts, and Audiobooks of 2025

With more than 700 million listeners around the world turning to Spotify to soundtrack their lives, it's time to look back at the audio that defined the yea...

03/12/2025

Los top artistas, canciones, lbumes, pdcasts y audiolibros de 2025

Con m s de 700 millones de oyentes en todo el mundo usando Spotify para acompa ar su d a a d a, es momento de mirar hacia atr s y ver el audio que marc el a o....

03/12/2025

Inside the 2025 Audiobook Trends on Spotify: Romantasy, Modern Classics, and What's Next

From page-turning thrillers to inspiring memoirs, audiobooks are becoming a core...

03/12/2025

Os Top Artistas, Msicas, lbuns, Podcasts e Audiolivros de 2025

Com mais de 700 milh es de ouvintes em todo o mundo recorrendo ao Spotify para embalar seu dia a dia, chega o momento de revisitar o udio que marcou o ano. Nos...

03/12/2025

How Our 2025 Wrapped Campaign Turns Your Year in Listening Into a Global Celebration

Spotify Wrapped celebrates the audio that defined our year, and the annual globa...

03/12/2025

Lionsgate, Debmar-Mercury Turn to LTN to Distribute MovieSphereGold

COLUMBIA, Md. Lionsgate and its TV syndicator subsidiary Debmar-Mercury have selected LTN to launch and deliver the new MovieSphereGold all-movie digital networ...

03/12/2025

Bitmovin, ThinkAnalytics Partner to Expand AI Capabilities

VIENNA, Austria Video streaming solutions provider Bitmovin and ThinkAnalytics, a provider of AI-powered data analytics for TV, have formed a strategic partners...

03/12/2025

Dolby and NFM to Debut First-Ever Dolby Home Experience

SAN FRANCISCO & THE COLONY, Texas Dolby Laboratories is making what it is calling a new chapter in its retail efforts as part of an agreement with NFM (Nebras...

03/12/2025

Chinese Broadcaster Takes Delivery of Native-IP Outside Broadcast Vehicle

MONTREAL Grass Valley has delivered a 4K Ultra-High-Definition (UHD) outside broadcast (OB) truck to Guangdong Radio and Television (GRT), in partnership with B...

03/12/2025

Whats New in Maxon One

Our users spoke and we listened. December's Maxon One release delivers long-awaited improvements across Cinema 4D, Redshift, ZBrush, and Red Giant. Whether...

03/12/2025

DHD Highlights Latest-Generation Advances in Broadcast Au...

DHD audio reports a successful 2025 with new additions to its range of broadcast-quality audio production, post-production and routing equipment. DHD innovatio...

03/12/2025

Grass Valley Delivers Breakthrough 4K UHD OB Truck to Gua...

Grass Valley recently delivered a cutting-edge 4K Ultra-High-Definition (UHD) outside broadcast (OB) truck to Guangdong Radio and Television (GRT), in partnersh...

03/12/2025

New GatesAir ATSC 3.0 Exciter Targets Global Markets

CINCINNATI GatesAir has introduced Maxiva XTK, an update to its Maxiva XTE software-defined TV exciter. XTK is a new cost-efficient model primarily developed fo...

03/12/2025

Riedel, Haivision Partner on Wireless Video Transmission Solutions

WUPPERTAL, Germany Riedel Communications is partnering with Haivision, a global provider of mission-critical, real-time video networking and visual collaboratio...

03/12/2025

FCC Sets Deadlines for Comments on Nexstar Acquisition of Tegna

WASHINGTON The Federal Communications Commission has opened a docket for comments on the proposed $6.2 billion Nexstar acquisition for Tegna and set deadlines f...

03/12/2025

Brightcove Unveils New AI Features

MILAN, Italy Brightcove has released seven new features designed to expand global reach, improve audience engagement, enhance live-streaming quality and streaml...

03/12/2025

Great American Media to Launch Pure Flix Familia in 2026

NEW YORK Great American Media said it plans to launch Pure Flix Familia, a dedicated Spanish-language platform, in 2026....

03/12/2025

FanDuel Sports Network Launches on Vizio

SOUTHPORT, Conn. Main Street Sports Group has announced that the FanDuel Sports Network app is now available directly on Vizio and on smart TVs with Vizio OS. T...

03/12/2025

Telia Taps Harmonic for Broadband Upgrade in Norway

SAN JOSE, Calif. Harmonic has announced that Telia, the second-largest telecom operator in Norway, is modernizing its broadband network with the company's c...

03/12/2025

Todd Ziegler to Take Reins of Sinclair's Green Bay, Wis., Stations

GREEN BAY, Wis. Sinclair said Jay Zollar, vice president and general manager of WLUK-WCWF here, will retire Dec. 31 after 26 years running the stations. Station...

03/12/2025

Riedel and Haivision Join Forces to Advance Wireless Video Transmission

Wuppertal December 3, 2025 Riedel and Haivision Join Forces to Advance Wireless Video TransmissionRiedel Communications today announced a new partnership with...

03/12/2025

Netflix Strengthens Longstanding Commitment to Southeast Asia Storytelling at JAFF 2025

Back to All News Netflix Strengthens Longstanding Commitment to Southeast Asia ...

03/12/2025

Breaking the Trend: Small Business Creation Jumps 69% as...

Breaking the Trend: Small Business Creation Jumps 69% as Entrepreneurs Bet Big on Growth Published on Dec 3, 2025 Categories: Data and insights LinkedIn Co...

03/12/2025

Take Five with Emily

Tell us a little bit about your job I am a Digital Marketing Executive working on Paid Social, PPC, and Organic Social. I joined the team in May 2025 and am enj...

03/12/2025

Harmonic and Normann Engineering Achieve Major Milestone with 20 Broadband Deployments Across Europe

SAN JOSE, Calif. and VIENNA - Dec. 3, 2025 - Harmonic (NASDAQ: HLIT) and Normann...

03/12/2025

Mixture of Experts Powers the Most Intelligent Frontier AI Models, Runs 10x Faster on NVIDIA Blackwell NVL72

The top 10 most intelligent open-source models all use a mixture-of-experts arch...

03/12/2025

Tidings of sport across RT this Christmas

S an Nollaig Celebrate 2025's top Irish sporting stars with RT Sport Awards live from RT Studios on RT One and RT Player RT pays tribute to Ror...

02/12/2025

Case Study: How Mid-Atlantic Sports Network Moved to All-IP Distribution in 60 Days

Case Study: How Mid-Atlantic Sports Network Moved to All-IP Distribution in 60 D...

02/12/2025

2025 Sports Broadcasting Hall of Fame: Lee Corso, Coach, Commentator, Firebrand

2025 Sports Broadcasting Hall of Fame: Lee Corso, Coach, Commentator, FirebrandBy Ken Kerschbaumer Tuesday, December 2, 2025 - 7:00 am Print This Story | S...

02/12/2025

SVG All-Stars: Dan Nabors, Senior Director, Remote Engineering, TNT Sports

SVG All-Stars: Dan Nabors, Senior Director, Remote Engineering, TNT SportsThe veteran tech leader is helping guide Warner Bros. Discovery's at-home' re...

02/12/2025

Eubank Jr v Benn II: DAZN on Bringing the Epic Rematch to Spectacular Life with 1080p HDR, Cinematic Cameras and Drones

Epic rematch: DAZN on bringing Eubank Jr v Benn II to spectacular life with 1080...

02/12/2025

National Lacrosse League Opens Season With New Cloud-Based Official Replay-Review System

National Lacrosse League Opens Season With New Cloud-Based Official Replay-Revie...

02/12/2025

Platinum White Paper: The Cinematic Look in Live Production - Bridging Aesthetics and Real-Time Broadcast Technology with Grass Valley

Platinum White Paper: The Cinematic Look in Live Production - Bridging Aesthetic...