
Pinterest has engineered a way to serve its photo-sharing community more of the images they love.
The social-image service, with more than 400 million monthly active users, has trained bigger recommender models for improved accuracy at predicting people's interests.
Pinterest handles hundreds of millions of user requests an hour on any given day. And it must also narrow down relevant images from roughly 300 billion images on the site to roughly 50 for each person.
The last step - ranking the most relevant and engaging content for everyone using Pinterest - required a leap in acceleration to run heftier models, with minimal latency, for better predictions.
Pinterest has improved the accuracy of its recommender models powering people's home feeds and other areas, increasing engagement by as much as 16%.
The leap was enabled by switching from CPUs to NVIDIA GPUs, which could easily be applied next to other areas, including advertising images, according to Pinterest.
Normally we would be happy with a 2% increase, and 16% is just a beginning for home feeds. We see additional gains - it opens a lot of doors for opportunities, said Pong Eksombatchai, a software engineer at Pinterest.
Transformer models capable of better predictions are shaking up industries from retail to entertainment and advertising. But their leaps in performance gains of the past few years have come with a need to serve models that are some 100x bigger as their number of model parameters and computations skyrockets.
Huge Inference Gains, Same Infrastructure Cost Like many, Pinterest engineers wanted to tap into state-of-the-art recommender models to increase engagement. But serving these massive models on CPUs presented a 100x increase in cost and latency. That wasn't going to maintain its magical user experience - fresh and more appealing images - occurring within a fraction of a second.
If that latency happened, then obviously our users wouldn't like that very much because they would have to wait forever, said Eksombatchai. We are pretty close to the limit of what we can do on CPU basically.
The challenge was to serve these hundredfold larger recommender models within the same cost and latency constraints.
Working with NVIDIA, Pinterest engineers began architectural changes to optimize their inference pipeline and recommender models to enable the transition from CPU to GPU cloud instances. The technology transition began late last year and required major changes to how the company manages workloads. The result is a 100x gain in inference efficiency on the same IT budget, meeting their goals.
We are starting to use really, really big models now. And that is where the GPU comes in - to help make these models possible, Eksombatchai said.
Tapping Into cuCollections Switching from CPUs to GPUs required rethinking its inference systems architecture. Among other issues, engineers had to change how they send workloads to their inference servers. Fortunately, there are tools to assist in making the transition easier.
The Pinterest inference server built for CPUs had to be altered because it was set up to send smaller batch sizes to its servers. GPUs can handle much larger workloads, so it's necessary to set up larger batch requests to increase efficiency.
One area where this comes into play is with its embedding table lookup module. Embedding tables are used to track interactions between various context-specific features and interests of user profiles. They can track where you navigate, and what people Pin on Pinterest, share or numerous other actions, helping refine predictions on what users might like to click on next.
They are used to incrementally learn user preference based on context in order to make better content recommendations to those using Pinterest. Its embedding table lookup module required two computation steps repeated hundreds of times because of the number of features tracked.
Pinterest engineers greatly reduced this number of operations using a GPU-accelerated concurrent hash table from NVIDIA cuCollections. And they set up a custom consolidated embedding lookup module so they could merge requests into a single lookup. Better results were seen immediately.
Using cuCollections helped us to remove bottlenecks, said Eksombatchai.
Enlisting CUDA Graphs Pinterest relied on CUDA Graphs to eliminate what was remaining of the small batch operations, further optimizing its inference models.
CUDA Graphs helps reduce the CPU interactions when launching on GPUs. They're designed to enable workloads to be defined as graphs rather than single operations. They provide a mechanism to launch multiple GPU operations through a single CPU operation, reducing CPU overheads.
Pinterest enlisted CUDA Graphs to represent the model inference process as a static graph of operation instead of as those individually scheduled. This enabled the computation to be handled as a single unit without any kernel launching overhead.
The company now supports CUDA Graph as a new backend of its model server. When a model is first loaded, the model server runs the model inference once to build the graph instance. This graph can then be run repeatedly in inference to show content on its app or site.
Implementing CUDA Graphs helped Pinterest to significantly reduce inference latency of its recommender models, according to its engineers.
GPUs have enabled Pinterest to do something that was impossible with CPUs on the same budget, and by doing this they can make changes that have a direct impact on various business metrics.
Learn about Pinterest's GPU-driven inference and optimizations at its GTC session, Serving 100x Bigger Recommender Models, and in the Pinterest Engineering blog.
Register for GTC, running Sept. 19-22, for free to attend sessions with NVIDIA and dozens of industry leaders.
Most recent headlines
09/11/2025
Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...
21/10/2025
As Amazon's Prime Video prepares to launch its coverage of NBA basketball under a major new deal, Grup Mediapro has announced that it is working with the st...
21/10/2025
ATLANTA Good news for consumers using an Atlanta DTH receiver to watch ATSC 3.0: with a new software update, they will be able to blanket their homes with Wi-Fi...
21/10/2025
While recent news has been heavily focused on Hispanic migration into the U.S., The 2025 Hispanic Market Report from Claritas highlights the fact that this gr...
21/10/2025
MAIDENHEAD, UK RWS has hired Michael Wayne as its head of media and entertainment in Los Angeles where he will lead the company's media localization busines...
21/10/2025
Imagine Communications and Rohde & Schwarz today announced a definitive agreement under which Imagine will acquire Pixel Power Limited, a wholly owned subsidiar...
21/10/2025
Atlanta DTH (ADTH) today announced a major update that will expand the functionality of its NEXTGEN TV receiver by enabling gateway capabilities allowing viewer...
21/10/2025
Heartland Video Systems, Inc. (HVS), a premier video systems integration, consulting, and expert ATSC 3.0 implementation firm announces that it has partnered wi...
21/10/2025
QuickLink, the leading provider of award-winning video production and remote guest integration solutions, today announced the appointment of Austin Hinton as it...
21/10/2025
nternet connectivity startup Miri Technologies Inc. will use this week's NAB Show New York as the launch pad for its latest ground-breaking innovation, the ...
20/10/2025
Inside TAMS: How Time-Addressable Media Stores could redefine sports workflows By Paul Markham
Friday, October 17, 2025 - 08:57
Print This Story
A penalty...
20/10/2025
Transformational production: Inside TVN's remote production push for the DFL...
20/10/2025
How NBC Sports Transitioned Stamford Facility to One Format: 1080p HDRMulti-year plan harmonizes workflows, simplifies operationsBy Ken Kerschbaumer, Editorial ...
20/10/2025
NBA on NBC' Studio Production Team Is Ready for Tip-Off With Coast-to-Coast...
20/10/2025
Under pressure: TVN CEO Markus Osthaus considers the German sports broadcasting ...
20/10/2025
(L-R) Maria Dizzia, Carmen Emmi, and Russell Tovey attend the Plainclothes pre...
20/10/2025
In March, we launched Concerts Near You to help listeners find concerts from their favorite artists. Since then, more than 3 million people have used it to disc...
20/10/2025
Em diversas cidades do Brasil, um movimento tem se fortalecido para celebrar o poder, a beleza e a profundidade da criatividade negra. O Dia AMPLIFIKA, agora em...
20/10/2025
In cities across Brazil, a movement is growing that celebrates the power, beauty, and depth of Black creativity. AMPLIFIKA Day, now in its fifth edition, return...
20/10/2025
Airborne Early Warning and Control aircraft rendering...
20/10/2025
DENVER and MUNICH Imagine Communications today announced its plans to acquire Pixel Power Ltd., a wholly owned subsidiary of Rohde & Schwarz. Financial terms of...
20/10/2025
LOS ANGELES G Morgan has joined Globecast, a provider of broadcast, media and entertainment managed services, as executive vice president of sales, Globecast Am...
20/10/2025
PLYMOUTH, Wisc. Heartland Video Systems and Zixi have partnered to enable broadcast-quality live video delivery over any IP network....
20/10/2025
A. R. Rahman on Facing Fear and Finding the Divine In an interview with Berklee President Jim Lucchese, the Oscar-winning composer reflects on how courage and...
20/10/2025
Monday 20 October 2025
To view this content, please enable our use of cookies. ...
20/10/2025
Rohde & Schwarz transfers Pixel Power to Imagine Communications Companies work collaboratively to ensure continuity and ongoing support for existing customers...
20/10/2025
RT 's Prime Time is set to host the final Presidential Election Debate this Tuesday night, October 21, providing an opportunity to hear directly from Irelan...
20/10/2025
NVIDIA and Google Cloud are expanding access to accelerated computing to transform the full spectrum of enterprise workloads, from visual computing to agentic a...
19/10/2025
Back to All News
Sins of Kujo' Comes to Life in New Live-Action Series Set for Spring 2026
Entertainment
19 October 2025
GlobalJapan
Link copied to cl...
18/10/2025
New England Sports Network (NESN) has chosen Harmonic, working with Astound Business Solutions, as its enterprise technology partner to transform primary distri...
18/10/2025
NEW ORLEANS, La. In the run-up to the start of the NBA season, WVUE-TV and Gray Local Media have announced a deal with DirecTV that will greatly expand access t...
18/10/2025
Berklee Celebrates 40 Years of the Fall Together Concert Faculty composers Bob Pilkington and Greg Hopkins are among the featured artists for this year's ...
17/10/2025
NEP Group Receives New Equity Investment From 26North Partners LP, Co-InvestorsCarlyle remains the largest shareholder as the company prepares for the futureBy ...
17/10/2025
Apple Lands Five-Year Deal for F1 Distribution in the U.S.Besides airing on Apple TV, the sport will be amplified on other Apple servicesBy Ken Kerschbaumer, Ed...
17/10/2025
SVG Sit-Down: Marshall Electronics' Bernie Keach on the Future of PTZ Camera...
17/10/2025
L2 Productions' REMI Facility in Austin Can Produce Content From AnywhereMusic festivals, sports events are produced via flypacks and remote control roomsBy...
17/10/2025
By Lucy Spicer
One of the most exciting things about the Sundance Film Festival...
17/10/2025
(L-R) Christopher Meyer, Addison Timlin, Cooper Raiff, Lili Reinhart, Alyah Chan...
17/10/2025
M sica e arte se uniram em uma noite especial na semana passada na ZIV Gallery, ...
17/10/2025
Music and art came together for one special night last week at ZIV Gallery, an i...
17/10/2025
Spotify and FC Barcelona are extending our partnership through 2030, continuing a collaboration that's redefining how fans, players, and artists connect. Th...
17/10/2025
MURRIETA, Calif. The Sports Fishing Championship (SFC) has deployed DigitalGlue's creative.space storage platform to streamline video production by centrali...
17/10/2025
BELLEVUE, Wash. Football continued to cement its reputation as a bulwark of TV advertising in Q3 2025 with new data from iSpot that showed both the NFL and coll...
17/10/2025
The Sports Fishing Championship (SFC), the premier competitive saltwater fishing series, has transformed its production workflow by adopting creative.space, the...
17/10/2025
QuickLink, a leading provider of award-winning multi-camera video productions and remote contribution solutions, announces the release of StudioPro Version 4, ...
17/10/2025
Although the annual Grammy Awards celebration is best known for recognizing achievements in the recording industry, the show often proves a visual spectacle as ...
17/10/2025
OpenDrives, Inc., a leading provider of software-defined data storage and data services, has promoted Alex Dunfey to Chief Technology Officer (CTO) from his for...
17/10/2025
The University of Arizona (UofA) has significantly upgraded its broadcast communication infrastructure with the integration of Riedel Communications' advanc...
17/10/2025
Harmonic (NASDAQ: HLIT) today announced that New England Sports Network (NESN), owned by Fenway Sports Group and Delaware North, has selected Harmonic as its en...