Pinterest Boosts Home Feed Engagement 16% With Switch to GPU Acceleration of Recommenders

04/08/2022

Pinterest has engineered a way to serve its photo-sharing community more of the images they love.

The social-image service, with more than 400 million monthly active users, has trained bigger recommender models for improved accuracy at predicting people's interests.

Pinterest handles hundreds of millions of user requests an hour on any given day. And it must also narrow down relevant images from roughly 300 billion images on the site to roughly 50 for each person.

The last step - ranking the most relevant and engaging content for everyone using Pinterest - required a leap in acceleration to run heftier models, with minimal latency, for better predictions.

Pinterest has improved the accuracy of its recommender models powering people's home feeds and other areas, increasing engagement by as much as 16%.

The leap was enabled by switching from CPUs to NVIDIA GPUs, which could easily be applied next to other areas, including advertising images, according to Pinterest.

Normally we would be happy with a 2% increase, and 16% is just a beginning for home feeds. We see additional gains - it opens a lot of doors for opportunities, said Pong Eksombatchai, a software engineer at Pinterest.

Transformer models capable of better predictions are shaking up industries from retail to entertainment and advertising. But their leaps in performance gains of the past few years have come with a need to serve models that are some 100x bigger as their number of model parameters and computations skyrockets.

Huge Inference Gains, Same Infrastructure Cost Like many, Pinterest engineers wanted to tap into state-of-the-art recommender models to increase engagement. But serving these massive models on CPUs presented a 100x increase in cost and latency. That wasn't going to maintain its magical user experience - fresh and more appealing images - occurring within a fraction of a second.

If that latency happened, then obviously our users wouldn't like that very much because they would have to wait forever, said Eksombatchai. We are pretty close to the limit of what we can do on CPU basically.

The challenge was to serve these hundredfold larger recommender models within the same cost and latency constraints.

Working with NVIDIA, Pinterest engineers began architectural changes to optimize their inference pipeline and recommender models to enable the transition from CPU to GPU cloud instances. The technology transition began late last year and required major changes to how the company manages workloads. The result is a 100x gain in inference efficiency on the same IT budget, meeting their goals.

We are starting to use really, really big models now. And that is where the GPU comes in - to help make these models possible, Eksombatchai said.

Tapping Into cuCollections Switching from CPUs to GPUs required rethinking its inference systems architecture. Among other issues, engineers had to change how they send workloads to their inference servers. Fortunately, there are tools to assist in making the transition easier.

The Pinterest inference server built for CPUs had to be altered because it was set up to send smaller batch sizes to its servers. GPUs can handle much larger workloads, so it's necessary to set up larger batch requests to increase efficiency.

One area where this comes into play is with its embedding table lookup module. Embedding tables are used to track interactions between various context-specific features and interests of user profiles. They can track where you navigate, and what people Pin on Pinterest, share or numerous other actions, helping refine predictions on what users might like to click on next.

They are used to incrementally learn user preference based on context in order to make better content recommendations to those using Pinterest. Its embedding table lookup module required two computation steps repeated hundreds of times because of the number of features tracked.

Pinterest engineers greatly reduced this number of operations using a GPU-accelerated concurrent hash table from NVIDIA cuCollections. And they set up a custom consolidated embedding lookup module so they could merge requests into a single lookup. Better results were seen immediately.

Using cuCollections helped us to remove bottlenecks, said Eksombatchai.

Enlisting CUDA Graphs Pinterest relied on CUDA Graphs to eliminate what was remaining of the small batch operations, further optimizing its inference models.

CUDA Graphs helps reduce the CPU interactions when launching on GPUs. They're designed to enable workloads to be defined as graphs rather than single operations. They provide a mechanism to launch multiple GPU operations through a single CPU operation, reducing CPU overheads.

Pinterest enlisted CUDA Graphs to represent the model inference process as a static graph of operation instead of as those individually scheduled. This enabled the computation to be handled as a single unit without any kernel launching overhead.

The company now supports CUDA Graph as a new backend of its model server. When a model is first loaded, the model server runs the model inference once to build the graph instance. This graph can then be run repeatedly in inference to show content on its app or site.

Implementing CUDA Graphs helped Pinterest to significantly reduce inference latency of its recommender models, according to its engineers.

GPUs have enabled Pinterest to do something that was impossible with CPUs on the same budget, and by doing this they can make changes that have a direct impact on various business metrics.

Learn about Pinterest's GPU-driven inference and optimizations at its GTC session, Serving 100x Bigger Recommender Models, and in the Pinterest Engineering blog.

Register for GTC, running Sept. 19-22, for free to attend sessions with NVIDIA and dozens of industry leaders.

LINK:	https://blogs.nvidia.com/blog/2022/08/04/pinterest-gpu-acceleration-re...
	See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

07/10/2026

Dalet Flex LTS Delivers Smarter Media Operations from Ingest to Distribution

Dalet, a leading technology and service provider for media-rich organizations, today announced the latest Long-Term Supported (LTS) release of Dalet Flex. Build...

06/09/2026

Dolby and MagentaTV Bring Fans Closer to the FIFA World Cup 2026 in Germany with Dolby Vision and Dolby Atmos

June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...

06/08/2026

Hisense Adds Dolby Vision 2 to Select Models

Share Copy link Facebook X Linkedin Bluesky Email...

06/08/2026

MediaKind to showcase one unified portfolio at IBC2026

At IBC2026, MediaKind will make its first major appearance as a unified global powerhouse in video, showcasing one of the world's most comprehensive video i...

06/08/2026

Big Blue Marble to unveil AI-assisted piracy detection an...

Big Blue Marble (#5.A63) will demonstrate how its integrated technology and operational expertise help media companies scale premium services with less complexi...

06/08/2026

NIH expected to award Scripps Research nearly $4.2 million over 5 years to advance tools for vaccine design

LA JOLLA, CA-Scripps Research has received more than $500,000 in first-year fund...

06/08/2026

Improving vaccine design for Ebola, HIV and more

LA JOLLA, CA-Viruses are masters at invading our cells thanks to specialized proteins that coat their surfaces. When scientists design vaccines, they often crea...

06/08/2026

How a chemical reaction triggers brain inflammation in Alzheimer's disease

LA JOLLA, CA-The brain has its own immune system, which detects threats and mounts a defense. A growing body of evidence has shown that in Alzheimer's disea...

06/08/2026

Jin-Quan Yu elected to the National Academy of Sciences

LA JOLLA, CA-Scripps Research chemist Jin-Quan Yu has been elected to the National Academy of Sciences (NAS), one of the highest honors a scientist can achieve....

06/08/2026

Scripps Research ranks third in 2026 Cure Innovation Index

LA JOLLA, CA-Scripps Research ranked third in the inaugural 2026 Cure Innovation Index recognizing the top-performing institutes and centers across the United S...

06/08/2026

Scripps Research chemist Benjamin Cravatt elected to American Philosophical Society

Benjamin Cravatt, the Gilula Chair of Chemical Biology and a professor of chemis...

06/08/2026

Scripps Research immunologist Dennis Burton elected to American Academy of Arts and Sciences

LA JOLLA, CA-Dennis Burton, professor and the James & Jessie Minor Chair in Immu...

06/08/2026

How changes to proteins can alter drug interactions for new precision therapies

LA JOLLA, CA-Inside every human cell, proteins are constantly being tagged with small chemical modifications after they're produced. Known as post-translati...

06/08/2026

Scripps Research establishes endowed chair honoring renowned structural biologist Ian Wilson

LA JOLLA-Scripps Research has established the Ian Wilson Endowed Chair, a new fa...

06/08/2026

Scripps Research's Skaggs Graduate School awards doctoral degrees to 34th graduating class

Scripps Research's Skaggs Graduate School of Chemical and Biological Science...

06/08/2026

Scripps Research chemist Jin-Quan Yu is named a Fellow of the Royal Society

LA JOLLA, CA-Professor Jin-Quan Yu of Scripps Research has been elected to the Fellowship of the Royal Society, the U.K.'s national academy of sciences and ...

06/08/2026

Experimental HIV vaccine achieves a long-sought goal

LA JOLLA, CA-For years, researchers have been hoping for vaccines that protect people against not just one strain of HIV, but every strain of the quickly mutati...

06/08/2026

Calibr-Skaggs advances CLF065, a regenerative GLP-2 therapy, into two Phase 2 IBD studies

LA JOLLA, CA-The Calibr-Skaggs Institute for Innovative Medicines, the nonprofit...

06/08/2026

Chemists snap together complex 3D molecules from highly reactive radicals'-without losing their shape

LA JOLLA, CA-Building the complex 3D molecules needed for new medicines has alwa...

06/08/2026

A fentanyl countermeasure that adapts to combat future black-market drugs

LA JOLLA, CA-Fentanyl and related variants of the synthetic opioid kill more Americans each year than car accidents and gun violence combined. In too-high doses...

06/08/2026

Two Scripps Research assistant professors named 2026 Baxter Young Investigators

LA JOLLA, CA-What do decoding communication between organs and reimagining the future of genome editing have in common? They're among the scientific questio...

06/08/2026

Calibr-Skaggs awarded $5.1M by NIH to develop long-acting hepatitis B virus therapy

LA JOLLA, CA-Of the 1.2 million people living with HIV in the United States, app...

06/08/2026

Lab studies explain how new cancer drug works as it enters patient testing

LA JOLLA, CA-For some people, cancer immunotherapies are life-changing. These treatments can turn the body's own immune system against a tumor, either elimi...

06/08/2026

Newly identified molecule strengthens the eye's response to damage in retinal disease

LA JOLLA, CA-Many conditions that cause vision loss share a common feature: the ...

06/08/2026

Molecular scissors caught in action: A structural blueprint for RNA therapeutics

LA JOLLA, CA-RNA interference is a natural mechanism for living cells to control whether specific genes are being used or not. Crowned with the 2006 Nobel Priz...

06/08/2026

Immune molecule may drive excessive drinking in alcohol use disorder

LA JOLLA, CA-The drugs that keep rheumatoid arthritis in check may one day help people stop drinking. A new Scripps Research study shows that an anti-inflammato...

06/08/2026

Back in action: Researchers make drug-resistant bacteria vulnerable again

LA JOLLA, CA-Antibiotic resistance is one of the most urgent threats to global health, linked to an estimated 4.7 million deaths worldwide in 2019 alone. As mor...

06/08/2026

Scripps Research scientists demonstrate a faster, cheaper route to making critical drugs using common table sugar

LA JOLLA, CA-Some of the world's best-selling diabetes drugs depend on a che...

06/08/2026

Scripps Research scientists awarded $2M to advance global disease surveillance

LA JOLLA, CA-Detecting infectious disease threats early and responding quickly can dramatically alter the course of an infectious outbreak. Technologies such as...

06/08/2026

Joan Pulupa joins Scripps Research faculty to study the organization of DNA in brain cells and its links to neurodegeneration

LA JOLLA, CA-Molecular biophysicist Joan Pulupa will join Scripps Research in Ja...

06/08/2026

Scripps Research scientists train the immune system to make antibodies against numerous HIV strains

LA JOLLA, CA-HIV is globally so diverse, consisting of hundreds of thousands of ...

06/08/2026

ASG Ups Michele Ferreira to Chief Business Officer

Share Copy link Facebook X Linkedin Bluesky Email...

06/08/2026

WNBC and WNJU Expand New York Giants Deal

Share Copy link Facebook X Linkedin Bluesky Email...

06/08/2026

Utah Scientific to Highlight NBOSS at IBC 2026

Share Copy link Facebook X Linkedin Bluesky Email...

06/08/2026

Disney, TikTok Ink Global Short-Form Content-Sharing Deal

Share Copy link Facebook X Linkedin Bluesky Email...

06/08/2026

Kane Peterson Joins QuickLink's North America Team

Share Copy link Facebook X Linkedin Bluesky Email...

06/08/2026

FCC Returns $881 Million in Unused TV Broadcaster Relocation Funds

Share Copy link Facebook X Linkedin Bluesky Email...

06/08/2026

PlayBox Neo to highlight secure and scalable workflow del...

At this year's SET EXPO, PlayBox Neo will present recent innovations across its PlayBox Neo Suite and integrated range of broadcast media solutions. By show...

06/08/2026

Modern Streaming Solutions Private Limited Partners with...

Live demo at IBC2026 VisualOn Booth, Hall 5, Stand A55 Amsterdam, Netherlands Modern Streaming Solutions Private Limited, a rising force in India's dig...

06/08/2026

Screen Australia and YouTube Australia fund five new teams through eleventh Skip Ahead initiative

Screen Australia and YouTube Australia fund five new teams through eleventh Skip...

06/08/2026

How Karukera Studio Built a Media Production Hub with SNS EVO

How Karukera Studio Built a Media Production Hub with SNS EVO Melanie Ciotti August 5, 2026 0 Comments Hero images displays Karukera Studio in Sainte-...

06/08/2026

NHK drama series Rosanjin no Kamado shot on PYXIS 6K

NHK drama series Rosanjin no Kamado shot on PYXIS 6K Brie Clayton August 5, 2026 0 Comments Blackmagic PYXIS 6K and DaVinci Resolve Studio capture the...

06/08/2026

Camera Match: AutoSetup 3d Plugin / script for Photomontages- Cinema 4D

Camera Match: AutoSetup 3d Plugin / script for Photomontages- Cinema 4D Jamie Cardoso August 5, 2026 0 Comments If you do any kind of Architectural Vi...

06/08/2026

Documentary on the life and impact of Dnal Lunny to air on RT

In Time airs Monday 10 August at 9.35pm on RT One and RT Player In Time is the first film account of the life of D nal Lunny, one of the most influential Iri...

06/08/2026

Into the Omniverse: How Open World Models Push the Frontier of Physical AI

Editor's note: This post is part of Into the Omniverse, a series focused on how developers, 3D practitioners and enterprises can transform their workflows u...

06/08/2026

GeForce NOW Shakes Up August With 26 New Games

August is here, bringing 26 new games for GeForce NOW members. Command the seas in World of Warships: Legends and discover what's next in the GeForce NOW ...

05/08/2026

Thomson toolkit added to virtual hub built to strengthen climate storytelling

Thomson's Climate Crisis Toolkit for Media in Tanzania is being included in a new virtual library designed to ensure accurate and engaging storytelling arou...

05/08/2026

SVG Regional Sports Production Summit 2026: All Sessions Now Available to Watch on SVG PLAY

RSNs, teams, leagues, and streamers explore the present and future of local spor...

05/08/2026

SVG New Sponsor Spotlight: Hitachi Vantara's Lenny Khaitov on Building Resilient Data Infrastructure for Sports Production

As sports-production workflows generate larger volumes of unstructured data and ...

View most recent headlines