Sony Pixel Power calrec Sony

Pinterest Boosts Home Feed Engagement 16% With Switch to GPU Acceleration of Recommenders

04/08/2022

Pinterest has engineered a way to serve its photo-sharing community more of the images they love.

The social-image service, with more than 400 million monthly active users, has trained bigger recommender models for improved accuracy at predicting people's interests.

Pinterest handles hundreds of millions of user requests an hour on any given day. And it must also narrow down relevant images from roughly 300 billion images on the site to roughly 50 for each person.

The last step - ranking the most relevant and engaging content for everyone using Pinterest - required a leap in acceleration to run heftier models, with minimal latency, for better predictions.

Pinterest has improved the accuracy of its recommender models powering people's home feeds and other areas, increasing engagement by as much as 16%.

The leap was enabled by switching from CPUs to NVIDIA GPUs, which could easily be applied next to other areas, including advertising images, according to Pinterest.

Normally we would be happy with a 2% increase, and 16% is just a beginning for home feeds. We see additional gains - it opens a lot of doors for opportunities, said Pong Eksombatchai, a software engineer at Pinterest.

Transformer models capable of better predictions are shaking up industries from retail to entertainment and advertising. But their leaps in performance gains of the past few years have come with a need to serve models that are some 100x bigger as their number of model parameters and computations skyrockets.

Huge Inference Gains, Same Infrastructure Cost Like many, Pinterest engineers wanted to tap into state-of-the-art recommender models to increase engagement. But serving these massive models on CPUs presented a 100x increase in cost and latency. That wasn't going to maintain its magical user experience - fresh and more appealing images - occurring within a fraction of a second.

If that latency happened, then obviously our users wouldn't like that very much because they would have to wait forever, said Eksombatchai. We are pretty close to the limit of what we can do on CPU basically.

The challenge was to serve these hundredfold larger recommender models within the same cost and latency constraints.

Working with NVIDIA, Pinterest engineers began architectural changes to optimize their inference pipeline and recommender models to enable the transition from CPU to GPU cloud instances. The technology transition began late last year and required major changes to how the company manages workloads. The result is a 100x gain in inference efficiency on the same IT budget, meeting their goals.

We are starting to use really, really big models now. And that is where the GPU comes in - to help make these models possible, Eksombatchai said.

Tapping Into cuCollections Switching from CPUs to GPUs required rethinking its inference systems architecture. Among other issues, engineers had to change how they send workloads to their inference servers. Fortunately, there are tools to assist in making the transition easier.

The Pinterest inference server built for CPUs had to be altered because it was set up to send smaller batch sizes to its servers. GPUs can handle much larger workloads, so it's necessary to set up larger batch requests to increase efficiency.

One area where this comes into play is with its embedding table lookup module. Embedding tables are used to track interactions between various context-specific features and interests of user profiles. They can track where you navigate, and what people Pin on Pinterest, share or numerous other actions, helping refine predictions on what users might like to click on next.

They are used to incrementally learn user preference based on context in order to make better content recommendations to those using Pinterest. Its embedding table lookup module required two computation steps repeated hundreds of times because of the number of features tracked.

Pinterest engineers greatly reduced this number of operations using a GPU-accelerated concurrent hash table from NVIDIA cuCollections. And they set up a custom consolidated embedding lookup module so they could merge requests into a single lookup. Better results were seen immediately.

Using cuCollections helped us to remove bottlenecks, said Eksombatchai.

Enlisting CUDA Graphs Pinterest relied on CUDA Graphs to eliminate what was remaining of the small batch operations, further optimizing its inference models.

CUDA Graphs helps reduce the CPU interactions when launching on GPUs. They're designed to enable workloads to be defined as graphs rather than single operations. They provide a mechanism to launch multiple GPU operations through a single CPU operation, reducing CPU overheads.

Pinterest enlisted CUDA Graphs to represent the model inference process as a static graph of operation instead of as those individually scheduled. This enabled the computation to be handled as a single unit without any kernel launching overhead.

The company now supports CUDA Graph as a new backend of its model server. When a model is first loaded, the model server runs the model inference once to build the graph instance. This graph can then be run repeatedly in inference to show content on its app or site.

Implementing CUDA Graphs helped Pinterest to significantly reduce inference latency of its recommender models, according to its engineers.

GPUs have enabled Pinterest to do something that was impossible with CPUs on the same budget, and by doing this they can make changes that have a direct impact on various business metrics.

Learn about Pinterest's GPU-driven inference and optimizations at its GTC session, Serving 100x Bigger Recommender Models, and in the Pinterest Engineering blog.

Register for GTC, running Sept. 19-22, for free to attend sessions with NVIDIA and dozens of industry leaders.
LINK: https://blogs.nvidia.com/blog/2022/08/04/pinterest-gpu-acceleration-re...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

01/04/2026

DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION

January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION Douyin Users Can Now Create And Share Videos With Stun...

07/01/2026

New AES Technical Document Focuses on Dialogue Intelligibility

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

07/01/2026

CTA: U.S. Consumer Tech Revenue to Hit $565 Billion in 2026

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

07/01/2026

NBCUniversal Upgrades AI-Powered Guide for 2026 Winter Olympics

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

07/01/2026

Parks: Smart TVs Are Primary Streaming Device in U.S. Homes

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

07/01/2026

Clear-Com to Feature 4-Channel HelixNet Beltpack at ISE 2026

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

07/01/2026

Cable Center to Sell Its Building to University of Denver

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

07/01/2026

CES: Aktas AI-First Video Platform Adds New Capabilities

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

07/01/2026

From Warehouse to Wallet: New State of AI in Retail and CPG Survey Uncovers How AI Is Rewiring Supply Chains and Customer Experiences

AI has transformed retail and consumer packaged goods (CPG) operations, enhancin...

06/01/2026

Peacock Bringing Dolby Vision and Dolby Atmos to Live Sports Content

Peacock Bringing Dolby Vision and Dolby Atmos to Live Sports ContentPeacock is the first streamer to embrace Dolbys full suite of advanced picture/sound innovat...

06/01/2026

Milano Cortina 2026: Listening to the Sounds of Powder and Ice With a Behind the Scenes Tour of OBS and NBC's Audio Set Ups

SVG Europe Audio: Listening to the sounds of powder and ice at Milano Cortina wi...

06/01/2026

Quintar Meta Spatial SDK Integration Promises Next-Level XR Experiences

Quintar Meta Spatial SDK Integration Promises Next-Level XR ExperiencesBy Ken Kerschbaumer, Editorial Director Tuesday, January 6, 2026 - 9:41 am Print This...

06/01/2026

Milano Cortina 2026: BBC Sport Previews Broadcast Ops, Studio Setup, Social Media Plans, and More

Milano Cortina 2026: BBC Sport Previews Broadcast Ops, Studio Setup, Social Medi...

06/01/2026

Dolby's Jason Power on Elevating Live Sports Through Immersive Audio and HDR Imagery

Dolby's Jason Power on Elevating Live Sports Through Immersive Audio and HDR...

06/01/2026

DAZN's Global CRO Walker Jacobs on the Streamer's Breakout Year in the U.S.

DAZN's Global CRO Walker Jacobs on the Streamer's Breakout Year in the U...

06/01/2026

California Dreamin': LA28's SVP of Media Jim Bell Previews the Olympic and Paralympic Games' Return to the U.S.

California Dreamin': LA28's SVP of Media Jim Bell Previews the Olympic a...

06/01/2026

One Month Out From Winter Olympics Opening Ceremony, NBC Sports in Final Prep for a Legendary February'

One Month Out From Winter Olympics Opening Ceremony, NBC Sports in Final Prep fo...

06/01/2026

Advanced HDR by Technicolor and Zinwell Integrated onto ATSC 3.0 Conversion Boxes

Advanced HDR by Technicolor and Zinwell Integrated onto ATSC 3.0 Conversion Boxe...

06/01/2026

Fox Sports President and COO Mark Silverman Shifts to Consulting Role

Fox Sports President and COO Mark Silverman Shifts to Consulting RoleBy Ken Kerschbaumer, Editorial Director Tuesday, January 6, 2026 - 2:37 pm Print This S...

06/01/2026

Spotify Celebrates Podcasting's Big Moment Ahead of First-Ever Golden Globes Category

Spotify is launching a week-long celebration spotlighting creators at the center...

06/01/2026

Spotify's New Gov Ball Experience Turns the 2026 Lineup Into a Personalized Data Story for Every Fan

Lorde. A$AP Rocky. JENNIE. Baby Keem. KATSEYE. That's just a taste of who...

06/01/2026

L3Harris CFO and Missile Solutions President Appears on CNBC

In a live broadcast, L3Harris CFO and Missile Solutions President Ken Bedingfield joined Morgan Brennan on CNBCs Closing Bell: Overtime. Bedingfield discussed...

06/01/2026

Index Exchange Launches Gracenote-Powered Show-Level Reporting with Spectrum Reach

First-of-its-kind SSP capability delivers program-level insight and brand suitab...

06/01/2026

iWedia, Skyworth Partner on Turnkey Solutions for NextGen TV

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

06/01/2026

Hub: Younger Viewers More Receptive to Ads on Streaming Services

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

06/01/2026

Zixi Hires Heather Mellish as Vice President, Global Sales

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

06/01/2026

Scripps Names Amira Lewally as Senior Director, Original Programming

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

06/01/2026

Carr Appoints New FCC Chief Economist

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

06/01/2026

iWedia Highlights NextGen TV Software Stack at CES

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

06/01/2026

Boston Conservatory at Berklee Kicks Off Spring Center Stage Season in February

Boston Conservatory at Berklee Kicks Off Spring Center Stage Season in February Boston Conservatory at Berklee's Center Stage season continues this spring...

06/01/2026

Operative Announces the New AOS Services Platform Foundat...

Operative, the preferred advertising management solution provider for the world's leading media brands, today announced a new AOS Services Platform, advanci...

06/01/2026

Beam Dynamics Expands Leadership to Drive Product and Ent...

Beam Dynamics has strengthened its leadership team with two appointments to support the company's next stage of growth. Jonathan Rollman will lead product s...

06/01/2026

MNC Software strengthens global sales team with appointme...

MNC Software, a global leader in network management and operational support systems tailored to the broadcast and media industry, has today announced the appoin...

06/01/2026

JAS teams with Friend MTS to protect Premier League

Friend MTS, the number one anti-piracy provider and video cybersecurity partner in entertainment, media and sports, today announced a partnership with Jasmine I...

06/01/2026

Kiloview New Innovations at ISE 2026 - Advancing Its Inte...

Kiloview, a global leader in AV-over-IP solutions, will showcase its latest innovations at ISE 2026, highlighting the continued evolution of its complete, light...

06/01/2026

Opponents Urge FCC to Reject Nexstar-Tegna Takeover

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

06/01/2026

DirecTV Launches Streaming Solution for Small Businesses

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

06/01/2026

Public Broadcasters Deploy Advanced HDR by Technicolor

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

06/01/2026

Comcast's Versant Spinoff Goes Public

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

06/01/2026

Board Votes to Dissolve Corporation for Public Broadcasting

Share Share by: Copy link Facebook X Whatsapp Pinterest Flipboard...

06/01/2026

Channel 4 and UKTV Deliver Unmissable Entertainment and More Choice for Viewers

Channel 4 and UKTV are giving viewers even more reasons to stream, with UKTV's U service set to feature thousands of hours of free, unmissable and bingeable...

06/01/2026

An update on our Sky Mobile prices

Tuesday 6 January 2026 An update on our Sky Mobile prices Devesh Raj, Chief Operating Officer, Sky Today, we've announced some changes to the prices of ...

06/01/2026

The Wait is Finally Over... 'Museum of Innocence' Premiering on Netflix February 13th

Back to All News The Wait is Finally Over... Museum of Innocence Premiering on ...

06/01/2026

Comscore Launches Daily Program-Level Reporting with Deduplicated Insights on Shows and Episodes across CTV and Linear TV

Comscore Launches Daily Program-Level Reporting with Deduplicated Insights on Sh...

06/01/2026

Comscore Completes Recapitalization Transaction with Preferred Stockholders Following Approval from Common Stockholders

Comscore Completes Recapitalization Transaction with Preferred Stockholders Foll...

05/01/2026

NFL's Blake Jones on How the League's Latest Video Additions Change How Viewers Watch Football

NFL's Blake Jones on How the League's Latest Video Additions Change How ...

05/01/2026

Spiideo's Scott Bushman on How AI Is Already Powering Live Sports Production

Spiideo's Scott Bushman on How AI Is Already Powering Live Sports ProductionFrom a line of camera systems to its automated production platform, the tech ven...