Sony Pixel Power calrec Sony

Pinterest Boosts Home Feed Engagement 16% With Switch to GPU Acceleration of Recommenders

04/08/2022

Pinterest has engineered a way to serve its photo-sharing community more of the images they love.

The social-image service, with more than 400 million monthly active users, has trained bigger recommender models for improved accuracy at predicting people's interests.

Pinterest handles hundreds of millions of user requests an hour on any given day. And it must also narrow down relevant images from roughly 300 billion images on the site to roughly 50 for each person.

The last step - ranking the most relevant and engaging content for everyone using Pinterest - required a leap in acceleration to run heftier models, with minimal latency, for better predictions.

Pinterest has improved the accuracy of its recommender models powering people's home feeds and other areas, increasing engagement by as much as 16%.

The leap was enabled by switching from CPUs to NVIDIA GPUs, which could easily be applied next to other areas, including advertising images, according to Pinterest.

Normally we would be happy with a 2% increase, and 16% is just a beginning for home feeds. We see additional gains - it opens a lot of doors for opportunities, said Pong Eksombatchai, a software engineer at Pinterest.

Transformer models capable of better predictions are shaking up industries from retail to entertainment and advertising. But their leaps in performance gains of the past few years have come with a need to serve models that are some 100x bigger as their number of model parameters and computations skyrockets.

Huge Inference Gains, Same Infrastructure Cost Like many, Pinterest engineers wanted to tap into state-of-the-art recommender models to increase engagement. But serving these massive models on CPUs presented a 100x increase in cost and latency. That wasn't going to maintain its magical user experience - fresh and more appealing images - occurring within a fraction of a second.

If that latency happened, then obviously our users wouldn't like that very much because they would have to wait forever, said Eksombatchai. We are pretty close to the limit of what we can do on CPU basically.

The challenge was to serve these hundredfold larger recommender models within the same cost and latency constraints.

Working with NVIDIA, Pinterest engineers began architectural changes to optimize their inference pipeline and recommender models to enable the transition from CPU to GPU cloud instances. The technology transition began late last year and required major changes to how the company manages workloads. The result is a 100x gain in inference efficiency on the same IT budget, meeting their goals.

We are starting to use really, really big models now. And that is where the GPU comes in - to help make these models possible, Eksombatchai said.

Tapping Into cuCollections Switching from CPUs to GPUs required rethinking its inference systems architecture. Among other issues, engineers had to change how they send workloads to their inference servers. Fortunately, there are tools to assist in making the transition easier.

The Pinterest inference server built for CPUs had to be altered because it was set up to send smaller batch sizes to its servers. GPUs can handle much larger workloads, so it's necessary to set up larger batch requests to increase efficiency.

One area where this comes into play is with its embedding table lookup module. Embedding tables are used to track interactions between various context-specific features and interests of user profiles. They can track where you navigate, and what people Pin on Pinterest, share or numerous other actions, helping refine predictions on what users might like to click on next.

They are used to incrementally learn user preference based on context in order to make better content recommendations to those using Pinterest. Its embedding table lookup module required two computation steps repeated hundreds of times because of the number of features tracked.

Pinterest engineers greatly reduced this number of operations using a GPU-accelerated concurrent hash table from NVIDIA cuCollections. And they set up a custom consolidated embedding lookup module so they could merge requests into a single lookup. Better results were seen immediately.

Using cuCollections helped us to remove bottlenecks, said Eksombatchai.

Enlisting CUDA Graphs Pinterest relied on CUDA Graphs to eliminate what was remaining of the small batch operations, further optimizing its inference models.

CUDA Graphs helps reduce the CPU interactions when launching on GPUs. They're designed to enable workloads to be defined as graphs rather than single operations. They provide a mechanism to launch multiple GPU operations through a single CPU operation, reducing CPU overheads.

Pinterest enlisted CUDA Graphs to represent the model inference process as a static graph of operation instead of as those individually scheduled. This enabled the computation to be handled as a single unit without any kernel launching overhead.

The company now supports CUDA Graph as a new backend of its model server. When a model is first loaded, the model server runs the model inference once to build the graph instance. This graph can then be run repeatedly in inference to show content on its app or site.

Implementing CUDA Graphs helped Pinterest to significantly reduce inference latency of its recommender models, according to its engineers.

GPUs have enabled Pinterest to do something that was impossible with CPUs on the same budget, and by doing this they can make changes that have a direct impact on various business metrics.

Learn about Pinterest's GPU-driven inference and optimizations at its GTC session, Serving 100x Bigger Recommender Models, and in the Pinterest Engineering blog.

Register for GTC, running Sept. 19-22, for free to attend sessions with NVIDIA and dozens of industry leaders.
LINK: https://blogs.nvidia.com/blog/2022/08/04/pinterest-gpu-acceleration-re...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

01/04/2026

DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION

January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION Douyin Users Can Now Create And Share Videos With Stun...

18/03/2026

Neutrik To Showcase opticalCON ADVANCED Connectors At 2026 NAB Show

Share Copy link Facebook X Linkedin Bluesky Email...

18/03/2026

SMPTE Details 2026 NAB Show Educational Sessions

Share Copy link Facebook X Linkedin Bluesky Email...

18/03/2026

Ben Bradshaw Joins PSSI as Director, Product and Network Development

Share Copy link Facebook X Linkedin Bluesky Email...

18/03/2026

Peter Thordarson Joins ASG as Technical Account Executive

Share Copy link Facebook X Linkedin Bluesky Email...

18/03/2026

Survey: Voters Trust TV News Over AI, Social and Search

Share Copy link Facebook X Linkedin Bluesky Email...

18/03/2026

2026 NAB Show Exhibitor Insight: Amazon Web Services (AWS)

Share Copy link Facebook X Linkedin Bluesky Email...

18/03/2026

SMPTE Unveils 2026 NAB Show Educational Presentations

SMPTE , the home of media professionals, technologists, and engineers, today unveiled its educational presentations for the 2026 NAB Show. This year SMPTE will ...

18/03/2026

Maxon Marks Its Official Entry Into the AEC Market With I...

Maxon, maker of powerful, approachable software solutions for creators working in 2D and 3D design, motion graphics, visual effects, gaming, and more, today ann...

18/03/2026

Digital Alert Systems NAB Preview 2026

Digital Alert Systems Preview 2026 NAB Show April 19 - 22 Booth C3452 At the 2026 NAB Show, Digital Alert Systems will showcase Version 6.0 of its DASDEC ...

18/03/2026

Setplex Transforms Video Streaming with AI and Super Aggr...

Setplex today announced that it will showcase its complete, fully integrated Zapflex platform for the first time at the 2026 NAB Show, introducing powerful new ...

18/03/2026

SES Announces Extension of Tender Offer

THIS ANNOUNCEMENT RELATES TO THE DISCLOSURE OF INFORMATION THAT QUALIFIED OR MAY HAVE QUALIFIED AS INSIDE INFORMATION WITHIN THE MEANING OF ARTICLE 7(1) OF THE ...

18/03/2026

COW Jobs: Seeking DP for Low Budget Dramedy - Chicago

COW Jobs: Seeking DP for Low Budget Dramedy - Chicago Brie Clayton March 17, 2026 0 Comments Seeking Director of Photography for Low Budget Dramedy Fe...

18/03/2026

COW Jobs: Seeking Gaffer for Low Budget Dramedy - Chicago

COW Jobs: Seeking Gaffer for Low Budget Dramedy - Chicago Brie Clayton March 17, 2026 0 Comments Seeking Gaffer for Low Budget Dramedy Feature Film- I...

18/03/2026

COW Jobs: Seeking Location, Sound for Low Budget Dramedy - Chicago

COW Jobs: Seeking Location, Sound for Low Budget Dramedy - Chicago Brie Clayton March 17, 2026 0 Comments Seeking Location/Sound for Low Budget Dramed...

18/03/2026

COW Jobs: Seeking Child Wrangler for Low Budget Film - Chicago

COW Jobs: Seeking Child Wrangler for Low Budget Film - Chicago Brie Clayton March 17, 2026 0 Comments Seeking Child Wrangler for Low Budget Dramedy Fe...

18/03/2026

Calrec Redefines Broadcast Workflows at NAB 2026 with its Most Powerful Hardware, Virtual and Hybrid Audio Lineup Yet

Calrec Redefines Broadcast Workflows at NAB 2026 with its Most Powerful Hardware...

18/03/2026

Oscar Nominated Two People Exchanging Saliva Posted with DaVinci Resolve Studio

Oscar Nominated Two People Exchanging Saliva Posted with DaVinci Resolve Studio Brie Clayton March 17, 2026 0 Comments DaVinci Resolve Studio handle...

18/03/2026

Boston Conservatory Presents Celebrated Musical Satire Urinetown

Boston Conservatory Presents Celebrated Musical Satire Urinetown Performances for this Center Stage production will take place at Boston Conservatory Theater ...

18/03/2026

Charlie Puth Joins Switched On Pop at Berklee NYC

Charlie Puth Joins Switched On Pop at Berklee NYC The Berklee alum spoke with host and Berklee NYC professor Charlie Harding for a live taping, answering audi...

17/03/2026

NASA+ Prepares To Live Stream Historic Artemis II Mission, Bringing Deep-Space Exploration to Global Audiences

NASA+'s Rebecca Sirmons and Brittany Brown offer unique look at live streami...

17/03/2026

BBright's TTML & SMPTE ST 2110-43: One Single Stream For the Whole World

The transition to IP has fundamentally reshaped professional media infrastructures. Video, audio, and increasingly metadata now circulate as independent, precis...

17/03/2026

Op-Ed: How Generative AI Is Transforming Live Sports Streaming Optimization

Live sports streaming can push every element in your video delivery chain to its limit, exposing every potential weakness in seconds. When the Super Bowl, the O...

17/03/2026

Dell Case Study: Powering the Future of Sports Media One Experience at a Time at UT Austin

Texas Athletics sought to modernize its media production, enhance fan experience...

17/03/2026

NAB 2026: Ikegami to Showcase Latest Generation TV Production Cameras, Controllers and Monitors

Ikegami USA will demonstrate the latest additions to its wide range of broadcast...

17/03/2026

TNA Wrestling and iHeartMedia Announce Major Multi-Platform Collaboration

TNA Wrestling and iHeartMedia announces a new multi-platform collaboration that will integrate iHeartMedia across TNA's premium live events, weekly televisi...

17/03/2026

The Miami Dolphins and Dell Boost Fan Experience, Safety, and Efficiency at Hard Rock Stadium

The goal was to transform Hard Rock Stadium into a global leader in sports and e...

17/03/2026

Spectrum Launches Multiview for NCAA Basketball Tournaments

Spectrum has announced the launch of its new Multiview feature in the Spectrum TV App, giving customers the ability to watch up to four NCAA men's or women&...

17/03/2026

Pac-12 Inks Integrity/Data Deals With Genius Sports, IC360

Genius Sports deal also covers data technology, AI, fan engagement, and performance analysis....

17/03/2026

Rede Massa Chooses Net Insight to Enable State-Wide Centralized Operations

Net Insight is supporting the rollout of a new state-wide centralized operation with Rede Massa, which is an SBT affiliate, the Brazilian regional television ne...

17/03/2026

F1 The Movie' Wins the Academy Award for Best Sound

Featuring audio from practice sessions, qualifying races, and Grand Prix races, the film represents Apple's sports-media ambitions At Sunday night's Ac...

17/03/2026

SVG New Sponsor Spotlight: Oracle's Mark Ramberg on the Future of Live Broadcast in the Cloud with OCI

Live broadcast has always been one of the most demanding environments in media a...

17/03/2026

DIRECTV Adds Multiview and Sports Central Features Ahead of NCAA Tournament

DirecTV is introducing several new viewing features, including a multi-screen March Madness Mix channel and an updated Sports Central mobile app hub, ahead of...

17/03/2026

Deltatre and ATP Media Announce Multi-Year Broadcast Graphics/Data Partnership

Deltatre has announced a multi-year partnership with ATP Media, the media arm of the ATP Tour, covering broadcast graphics, data, and production across the 2026...

17/03/2026

Detroit Pistons, Scripps Sports To Air Five Games Free Over the Air on TV-20 Detroit

The Detroit Pistons have announced a third consecutive season partnering with Sc...

17/03/2026

How Fresh Finds Africa Propelled Rapper Zaylevelten to a Breakout Year

Fresh Finds Africa spotlights emerging artists and movements across the continent and its global diaspora, with listeners tuning in to discover new Afro-forward...

17/03/2026

Spotify Sparked Viral Moments at G27: genie fest, Driving the Discovery of Thai Rock

Last month, more than 60,000 fans piled into Bangkok's Rajamangala National ...

17/03/2026

Black Rooster Audio release VWB-1X

Vintage-inspired channel strip joins line-up Black Rooster Audio's latest plug-in provides an all-in-one mixing tool inspired by classic analogue consol...

17/03/2026

Accentize unveil dxSplit

Level and EQ voice, reverb and noise independently The latest plug-in to join Accentize's collection is said to take a new approach to dialogue processi...

17/03/2026

RF Spectrum Threat: OFCOM Survey

UHF radio mic & IEM bandwidth at risk Once again, the UHF bandwidth that is currently allocated to RF audio gear is at risk of being reassigned to high-spee...

17/03/2026

SGL Carbon hosts Bavaria's first pilot training course for prospective plant fire department squad leaders

Last Friday, the first Plant Fire Department Training Week in Bavaria successf...

17/03/2026

New campaign from NAATI and SBS CulturalConnnect highlights how we all deserve to be understood'

New campaign from NAATI and SBS CulturalConnnect highlights how we all deserve ...

17/03/2026

NAB Appoints Two New Members to Television Board of Directors

Share Copy link Facebook X Linkedin Bluesky Email...

17/03/2026

PMVG's TechConnect Goes Virtual for 2026

Share Copy link Facebook X Linkedin Bluesky Email...

17/03/2026

Miris unlocks high-fidelity 3D asset streaming at scale

3D streaming infrastructure provider Miris today announced the launch of a public beta for its new 3D asset streaming platform. Miris is building the infrastruc...