
Pinterest has engineered a way to serve its photo-sharing community more of the images they love.
The social-image service, with more than 400 million monthly active users, has trained bigger recommender models for improved accuracy at predicting people's interests.
Pinterest handles hundreds of millions of user requests an hour on any given day. And it must also narrow down relevant images from roughly 300 billion images on the site to roughly 50 for each person.
The last step - ranking the most relevant and engaging content for everyone using Pinterest - required a leap in acceleration to run heftier models, with minimal latency, for better predictions.
Pinterest has improved the accuracy of its recommender models powering people's home feeds and other areas, increasing engagement by as much as 16%.
The leap was enabled by switching from CPUs to NVIDIA GPUs, which could easily be applied next to other areas, including advertising images, according to Pinterest.
Normally we would be happy with a 2% increase, and 16% is just a beginning for home feeds. We see additional gains - it opens a lot of doors for opportunities, said Pong Eksombatchai, a software engineer at Pinterest.
Transformer models capable of better predictions are shaking up industries from retail to entertainment and advertising. But their leaps in performance gains of the past few years have come with a need to serve models that are some 100x bigger as their number of model parameters and computations skyrockets.
Huge Inference Gains, Same Infrastructure Cost Like many, Pinterest engineers wanted to tap into state-of-the-art recommender models to increase engagement. But serving these massive models on CPUs presented a 100x increase in cost and latency. That wasn't going to maintain its magical user experience - fresh and more appealing images - occurring within a fraction of a second.
If that latency happened, then obviously our users wouldn't like that very much because they would have to wait forever, said Eksombatchai. We are pretty close to the limit of what we can do on CPU basically.
The challenge was to serve these hundredfold larger recommender models within the same cost and latency constraints.
Working with NVIDIA, Pinterest engineers began architectural changes to optimize their inference pipeline and recommender models to enable the transition from CPU to GPU cloud instances. The technology transition began late last year and required major changes to how the company manages workloads. The result is a 100x gain in inference efficiency on the same IT budget, meeting their goals.
We are starting to use really, really big models now. And that is where the GPU comes in - to help make these models possible, Eksombatchai said.
Tapping Into cuCollections Switching from CPUs to GPUs required rethinking its inference systems architecture. Among other issues, engineers had to change how they send workloads to their inference servers. Fortunately, there are tools to assist in making the transition easier.
The Pinterest inference server built for CPUs had to be altered because it was set up to send smaller batch sizes to its servers. GPUs can handle much larger workloads, so it's necessary to set up larger batch requests to increase efficiency.
One area where this comes into play is with its embedding table lookup module. Embedding tables are used to track interactions between various context-specific features and interests of user profiles. They can track where you navigate, and what people Pin on Pinterest, share or numerous other actions, helping refine predictions on what users might like to click on next.
They are used to incrementally learn user preference based on context in order to make better content recommendations to those using Pinterest. Its embedding table lookup module required two computation steps repeated hundreds of times because of the number of features tracked.
Pinterest engineers greatly reduced this number of operations using a GPU-accelerated concurrent hash table from NVIDIA cuCollections. And they set up a custom consolidated embedding lookup module so they could merge requests into a single lookup. Better results were seen immediately.
Using cuCollections helped us to remove bottlenecks, said Eksombatchai.
Enlisting CUDA Graphs Pinterest relied on CUDA Graphs to eliminate what was remaining of the small batch operations, further optimizing its inference models.
CUDA Graphs helps reduce the CPU interactions when launching on GPUs. They're designed to enable workloads to be defined as graphs rather than single operations. They provide a mechanism to launch multiple GPU operations through a single CPU operation, reducing CPU overheads.
Pinterest enlisted CUDA Graphs to represent the model inference process as a static graph of operation instead of as those individually scheduled. This enabled the computation to be handled as a single unit without any kernel launching overhead.
The company now supports CUDA Graph as a new backend of its model server. When a model is first loaded, the model server runs the model inference once to build the graph instance. This graph can then be run repeatedly in inference to show content on its app or site.
Implementing CUDA Graphs helped Pinterest to significantly reduce inference latency of its recommender models, according to its engineers.
GPUs have enabled Pinterest to do something that was impossible with CPUs on the same budget, and by doing this they can make changes that have a direct impact on various business metrics.
Learn about Pinterest's GPU-driven inference and optimizations at its GTC session, Serving 100x Bigger Recommender Models, and in the Pinterest Engineering blog.
Register for GTC, running Sept. 19-22, for free to attend sessions with NVIDIA and dozens of industry leaders.
Most recent headlines
11/12/2025
Dalet, a leading provider of cloud-native, end-to-end media workflow solutions, ...
21/11/2025
Platinum White Paper: Appear Shares Why Media Exchange Is the Missing Link in So...
21/11/2025
NWSL Championship 2025: CBS Sports To Deploy Two-Point FlyCam for Match Coverage...
21/11/2025
NWSL Caps 2025 Season With Awards Show, Skills Challenge ProductionsA team of 70 is on the ground in California to produce both eventsBy Mark J Burns, SVG Contr...
21/11/2025
USL and NEP Ready for Largest USL Championship Final Production EverThe broadcast from Tulsa, OK, will air CBS and TUDN on Saturday at 12 p.m. ETBy Jason Dachma...
21/11/2025
With Two New Teams, PWHL Boosts Production Workforce and Central Review for Seas...
21/11/2025
Jared Lank and his mother in the '90s...
21/11/2025
Fans have been counting down the days until the final theatrical chapter of Wicked is revealed. To celebrate the highly anticipated release of Wicked: For Good ...
21/11/2025
Last week, Spotify turned up the volume in Seoul with the return of Spotify Hous...
21/11/2025
Wiesbaden, November 21, 2025. The SGL Carbon site in Meitingen has reason to celebrate as one of its trainees received a special award. Elias Stemmer was honore...
21/11/2025
MELBOURNE, Fla., Nov. 21, 2025 - L3Harris Technologies (NYSE: LHX) has announced this year's LHX Excellence Awards, the company's most prestigious recog...
21/11/2025
WASHINGTON The Federal Communications Commission by a 3-0 vote opened a notice of proposed rulemaking (NPRM) to advance Congress's mandate to clear a minimu...
21/11/2025
WASHINGTON The Federal Communications Commission by a 3-0 vote adopted a Notice of Proposed Rulemaking (NPRM) to advance Congress's mandate to clear a minim...
21/11/2025
STAMFORD, Conn. Charter's Spectrum has expanded the devices that can offer 4K content on the Spectrum TV app to compatible Apple TV 4K and Roku devices....
21/11/2025
NAPERVILLE, Ill. Media industry employers are continuing their multiyear trend of increasing salaries for all worker segments but lag general industry raises, s...
21/11/2025
WASHINGTON The National Association of Broadcasters said it is accepting nominations for the 2026 NAB Technology Awards, honors that recognize excellence in bro...
21/11/2025
American Amplifier Technologies has released a vector network analysis module....
21/11/2025
The Best Movie Musicals on Every Streaming Platform From Wicked to The Sound of Music, heres where to stream all the classic movie musicals and recent hits on...
21/11/2025
The agreement creates a platform for joint collaboration, technology integration...
21/11/2025
Sky Media's £2m award-winning sustainability initiative crowns its first charity as this year's standout changemakerFriday 21 November 2025
GoodGym nam...
21/11/2025
As COP30 draws to a close, the International Electrotechnical Commission (IEC), ...
20/11/2025
MLB Media-Rights Shakeup: NBC's New Three-Year Deal Covers Sunday Night Bas...
20/11/2025
MLB Media-Rights Shakeup: New Deal Will Bring 30 National Games to ESPN's Li...
20/11/2025
MLB Media-Rights Shakeup: Netflix Lands Opening Night, Home Run Derby, Field of ...
20/11/2025
MLB Media-Rights Shakeup Overview: ESPN, NBCU, Netflix Ink Three-Year DealsESPN gets new 30-game package, MLB.TV; NBC extends Sunday nights; Netflix adds tentpo...
20/11/2025
SVG Students To Watch: Henry Thuss, Indiana UniversityThe Southern California product has his goals set on the front benchBy Brandon Costa, Director of Digital ...
20/11/2025
Done+Dusted's Guy Carrington on Creating the Spectacular League of Legends W...
20/11/2025
FIA Extreme H World Cup Host Broadcaster Aurora Goes Inside the Production of th...
20/11/2025
Platinum White Paper: Amagi Utilizes Cloud Production for Sports Events - Multi-...
20/11/2025
2025 Sports Broadcasting Hall of Fame: Marc Herklotz, Steady Hand Behind the Sce...
20/11/2025
NFL Deep Dive: How 32 Cameras at Each Stadium Drive Virtual Measurement, Boundar...
20/11/2025
Charlie Shackleton attends the 2025 Sundance Film Festival premiere of Zodiac K...
20/11/2025
Your playlists are personal. They're the soundtracks to your road trips, your quiet mornings, and your biggest celebrations; collections of memories and dis...
20/11/2025
Spotify, uzun s redir zerine al t T rk m zik k lt r n n ikon haline gelmi ...
20/11/2025
For the first time, Spotify has teamed up with The Hollywood Reporter to cohost ...
20/11/2025
This year's ARIA Awards marked a turning point for Australian music, and Spotify was right at the heart of it. For the first time in the awards' nearly ...
20/11/2025
L3Harris has achieved NSA Cybersecurity Directorate certification for its KSV-650 space hub end cryptographic unit, ensuring secure, adaptable communications fo...
20/11/2025
Left to Right: David Taubman, Regional Managing Director, Central and Eastern Europe; Arek Szalpuk, Poland Sr. Account Manager; Mr. Marcin Wi niewski, President...
20/11/2025
15 March 2012
SHARE Facebook Twitter Linkedin Email
Cinegy and Vericom are pleased to announce the recent deployment of Cinegy Archive for production and as...
20/11/2025
6 September 2012
SHARE Facebook Twitter Linkedin Email
FIC Turkey supplies media content to Pay TV under Fox TV Channel brands in Turkey.
At the start of s...
20/11/2025
25 June 2014
SHARE Facebook Twitter Linkedin Email
Last month VIVA finalized its Cinegy installation in order to produce and broadcast the World Cup Brazil ...
20/11/2025
1 October 2014
SHARE Facebook Twitter Linkedin Email
With 5 locations spread out around Baden-W rttemberg, Schw bisch Media required a scalable end to end s...
20/11/2025
27 April 2015
SHARE Facebook Twitter Linkedin Email
Cinegy, which develops and produces media asset management products that are used in flagship production...
20/11/2025
12 August 2015
SHARE Facebook Twitter Linkedin Email
Munich, Germany, 26 October 2015 - Cinegy, which develops and produces software technology for digital ...
20/11/2025
5 November 2015
SHARE Facebook Twitter Linkedin Email
Munich, Germany 3 November 2015 - Cinegy, which develops and produces software technology for digital ...
20/11/2025
12 February 2016
SHARE Facebook Twitter Linkedin Email
Munich, Germany 9 February 2016 - Cinegy, which develops and produces software technology for digital...
20/11/2025
22 June 2016
SHARE Facebook Twitter Linkedin Email
Munich, Germany 21 June 2016 - Cinegy today announced that ABS BROADCAST, one of the largest independentl...
20/11/2025
10 January 2017
SHARE Facebook Twitter Linkedin Email
Find us on page 12 of the Program Guide 2016
Tags
NewBay Media Awards , Program Guide 2016 , broadc...
20/11/2025
15 March 2017
SHARE Facebook Twitter Linkedin Email
Munich, Germany 15 March 2017 - Cinegy today announced that Thailand's Next Step direct-to-home Free...
20/11/2025
12 December 2017
SHARE Facebook Twitter Linkedin Email
Munich, Germany 13 December 2017 - Cinegy today announced that it is making its Cinegy Air Cloud Pack...