Sony Pixel Power calrec Sony

Pinterest Boosts Home Feed Engagement 16% With Switch to GPU Acceleration of Recommenders

04/08/2022

Pinterest has engineered a way to serve its photo-sharing community more of the images they love.

The social-image service, with more than 400 million monthly active users, has trained bigger recommender models for improved accuracy at predicting people's interests.

Pinterest handles hundreds of millions of user requests an hour on any given day. And it must also narrow down relevant images from roughly 300 billion images on the site to roughly 50 for each person.

The last step - ranking the most relevant and engaging content for everyone using Pinterest - required a leap in acceleration to run heftier models, with minimal latency, for better predictions.

Pinterest has improved the accuracy of its recommender models powering people's home feeds and other areas, increasing engagement by as much as 16%.

The leap was enabled by switching from CPUs to NVIDIA GPUs, which could easily be applied next to other areas, including advertising images, according to Pinterest.

Normally we would be happy with a 2% increase, and 16% is just a beginning for home feeds. We see additional gains - it opens a lot of doors for opportunities, said Pong Eksombatchai, a software engineer at Pinterest.

Transformer models capable of better predictions are shaking up industries from retail to entertainment and advertising. But their leaps in performance gains of the past few years have come with a need to serve models that are some 100x bigger as their number of model parameters and computations skyrockets.

Huge Inference Gains, Same Infrastructure Cost Like many, Pinterest engineers wanted to tap into state-of-the-art recommender models to increase engagement. But serving these massive models on CPUs presented a 100x increase in cost and latency. That wasn't going to maintain its magical user experience - fresh and more appealing images - occurring within a fraction of a second.

If that latency happened, then obviously our users wouldn't like that very much because they would have to wait forever, said Eksombatchai. We are pretty close to the limit of what we can do on CPU basically.

The challenge was to serve these hundredfold larger recommender models within the same cost and latency constraints.

Working with NVIDIA, Pinterest engineers began architectural changes to optimize their inference pipeline and recommender models to enable the transition from CPU to GPU cloud instances. The technology transition began late last year and required major changes to how the company manages workloads. The result is a 100x gain in inference efficiency on the same IT budget, meeting their goals.

We are starting to use really, really big models now. And that is where the GPU comes in - to help make these models possible, Eksombatchai said.

Tapping Into cuCollections Switching from CPUs to GPUs required rethinking its inference systems architecture. Among other issues, engineers had to change how they send workloads to their inference servers. Fortunately, there are tools to assist in making the transition easier.

The Pinterest inference server built for CPUs had to be altered because it was set up to send smaller batch sizes to its servers. GPUs can handle much larger workloads, so it's necessary to set up larger batch requests to increase efficiency.

One area where this comes into play is with its embedding table lookup module. Embedding tables are used to track interactions between various context-specific features and interests of user profiles. They can track where you navigate, and what people Pin on Pinterest, share or numerous other actions, helping refine predictions on what users might like to click on next.

They are used to incrementally learn user preference based on context in order to make better content recommendations to those using Pinterest. Its embedding table lookup module required two computation steps repeated hundreds of times because of the number of features tracked.

Pinterest engineers greatly reduced this number of operations using a GPU-accelerated concurrent hash table from NVIDIA cuCollections. And they set up a custom consolidated embedding lookup module so they could merge requests into a single lookup. Better results were seen immediately.

Using cuCollections helped us to remove bottlenecks, said Eksombatchai.

Enlisting CUDA Graphs Pinterest relied on CUDA Graphs to eliminate what was remaining of the small batch operations, further optimizing its inference models.

CUDA Graphs helps reduce the CPU interactions when launching on GPUs. They're designed to enable workloads to be defined as graphs rather than single operations. They provide a mechanism to launch multiple GPU operations through a single CPU operation, reducing CPU overheads.

Pinterest enlisted CUDA Graphs to represent the model inference process as a static graph of operation instead of as those individually scheduled. This enabled the computation to be handled as a single unit without any kernel launching overhead.

The company now supports CUDA Graph as a new backend of its model server. When a model is first loaded, the model server runs the model inference once to build the graph instance. This graph can then be run repeatedly in inference to show content on its app or site.

Implementing CUDA Graphs helped Pinterest to significantly reduce inference latency of its recommender models, according to its engineers.

GPUs have enabled Pinterest to do something that was impossible with CPUs on the same budget, and by doing this they can make changes that have a direct impact on various business metrics.

Learn about Pinterest's GPU-driven inference and optimizations at its GTC session, Serving 100x Bigger Recommender Models, and in the Pinterest Engineering blog.

Register for GTC, running Sept. 19-22, for free to attend sessions with NVIDIA and dozens of industry leaders.
LINK: https://blogs.nvidia.com/blog/2022/08/04/pinterest-gpu-acceleration-re...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

01/04/2026

DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION

January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION Douyin Users Can Now Create And Share Videos With Stun...

06/02/2026

Chris Myers Joins Net Insight as SVP of Sales, Americas

Share Copy link Facebook X Linkedin Bluesky Email...

06/02/2026

Sen. Cruz Announces Hearing on Broadcast Media Ownership Rules

Share Copy link Facebook X Linkedin Bluesky Email...

06/02/2026

NAB Show Relocates TV and Radio HQ To LVCC Central Hall

Share Copy link Facebook X Linkedin Bluesky Email...

06/02/2026

Sony Solutions Widely Deployed for Super Bowl LX in San Francisco

Share Copy link Facebook X Linkedin Bluesky Email...

06/02/2026

Telemundo Puerto Rico Launches In Mainland U.S.

Share Copy link Facebook X Linkedin Bluesky Email...

06/02/2026

RT announce details of live Winter Olympics 2026 coverage

The Winter Olympics 2026 in Milano Cortina officially get underway this evening (Friday 6 February) with the Opening Ceremony live on RT Player and RT News ch...

06/02/2026

February 05, 2026

How invisible vaccine scaffolding boosts HIV immune response Scripps Research scientists designed a DNA scaffold that carries HIV vaccine proteins into the bo...

05/02/2026

Tech Focus: Wireless Audio, Part 2 - RF Mics Have a Key Role in Sports Broadcasting

Three examples of how wireless microphones are deployed to bring fans in deep an...

05/02/2026

Samsung's Galaxy S25 Ultra Camera To Capture the Opening Ceremony

Broadcast coverage will include 25 cameras distributed around the venues, including to some athletes; Galaxy AI Interpreter will also be deployed The Opening C...

05/02/2026

Kiswe to Power Mountain West Conference's New Direct-to-Consumer Streaming Platform

Kiswe has partnered with the Mountain West Conference to power the next iteratio...

05/02/2026

NBCUniversal, Roku Launch the NBC Winter Olympics Experience

NBCUniversal and Roku announce the launch of the 2026 NBC Winter Olympics Experience, a destination delivering NBCUniversal's comprehensive CTV coverage of ...

05/02/2026

Vizrt Transforms Corporate Communications with AI-Powered Augmented Reality in Zoom

Vizrt, which specializes in live production technology as well as transforming v...

05/02/2026

Canon Intros RF7-14mm Fisheye Zoom, RF14mm Prime Lens

Canon USA has launched the RF7-14mm F2.8-3.5 L fisheye STM zoom lens and the RF14mm F1.4 L VCM prime lens. Building on Canon's legacy of innovative optics, ...

05/02/2026

UMass Lowell's Tsongas Center Upgrades with Ikegami UHK-X600 Cameras

The Paul E. Tsongas Center at UMass Lowell in Massachusetts has chosen Ikegami cameras for incorporation into its broadcast-quality television production facili...

05/02/2026

Exchange, NBCUniversal to Provide Service Members with Free Streaming of Winter Olympics

Once again, service members and Veterans worldwide will enjoy free access to NBC...

05/02/2026

Advanced Systems Group Appoints Industry Veteran Derek Pezzotti to Lead Sports and Venue Market Growth

Advanced Systems Group, LLC (ASG), a technology and services provider for media ...

05/02/2026

Broadcast Management Group Expands Management Team to Support Managed Services and Live Production Growth

Broadcast Management Group (BMG) is strengthening its leadership team to support...

05/02/2026

NBC Sports Selects Comcast Technology Solutions for Production of Winter Olympics

NBC Sports selects Comcast Technology Solutions (CTS) to provide multiscreen vid...

05/02/2026

AIM Sports Group Enhances AIM Sportsplex With Spiideo's Advanced Automated Video Technology

AIM Sports Group, a sports enterprise dedicated to elevating youth athletics thr...

05/02/2026

Inside the 2026 Milano Cortina IBC: How Tech Makes a Difference for Rightsholders, Fans, the Environment

Designed for efficient use of shared services and resources, the home of OBS pro...

05/02/2026

SVG Students To Watch: Brandon Malin, University of Michigan

The Yankees fan from Connecticut is executive producer of BTN StudentU for the Wolverines In the live-sports-video industry, the future is bright. Our series S...

05/02/2026

OBS Is Ready To Deliver for Milano Cortina Opening Ceremony

In an Olympic first, the ceremony will be held in four locations simultaneously...

05/02/2026

Remembering Charlie Jablonski, an Olympic Broadcasting Legend

Members of the broadcast and tech communities share four decades of memories of the technology leader The 2026 Milano Cortina Olympics are upon us, and every O...

05/02/2026

NBC Sports Has an Army of Technology Providers Supporting Winter Olympics Production

Key vendors include Appear, Audio-Technica, Canon, Chyron, Cisco, Comcast Techno...

05/02/2026

Spotify Partners With Bookshop.org and Debuts Page Match Feature to Bridge Physical, E-book, and Audio Formats

Since bringing audiobooks to Spotify in 2022, we've helped listeners discove...

05/02/2026

How to Use Page Match to Seamlessly Switch Between a Book and Its Audiobook on Spotify

Today, Spotify announced two new updates to give book lovers a more personalized...

05/02/2026

HC-130J Aircraft Enhances Coast Guard Readiness

A U.S. Coast Guard HC-130J aircraft during a test flight at L3Harris' facility in Waco, Texas....

05/02/2026

Al Seer Marine and L3Harris Deepen Strategic Agreement to Advance Maritime Unmanned Systems in the Middle East

Al Seer Marine and L3Harris have announced a strategic partnership combining UAE...

05/02/2026

Well-Thought-Out UX: The Quiet Power Behind Our Latest Improvements

AI should balance automation (replacing tasks) and augmentation (empowering humans). Automate the mundane and augment the creative by applying the right AI type...

05/02/2026

Football And Younger Viewers Drive Ad Supported TV Viewing To 2025 High, Nielsen's Q4 2025 Ad Supported Gauge Finds

During this period, streaming comprised the majority of ad supported TV (45.6%),...

05/02/2026

New Orlando TV Station Focuses On Puerto Rican Viewers

Share Copy link Facebook X Linkedin Bluesky Email...

05/02/2026

Teads, Google TV Partner To Grow CTV HomeScreen Ad Availability

Share Copy link Facebook X Linkedin Bluesky Email...

05/02/2026

Advanced Systems Group Appoints Industry Veteran Derek Pe...

Advanced Systems Group, LLC (ASG), a technology and services provider for media creatives and content owners, announced the appointment of Derek Pezzotti as Sen...

05/02/2026

Taurus Technologies Elevates Podcast Production with Brig...

Taurus Technologies, a Dallas-area professional AV systems integrator, has upgraded its in-house podcast studio with Brightline Lighting's AV/720 low-voltag...

05/02/2026

NBC Sports Selects Production Infrastructure and Signal P...

NBC Universal to Present XXV Olympic Winter Games Feb. 6-22 and Milan Cortina Paralympics March 6-15 NBC Sports to Utilize Grass Valley's Frame Rate Conver...

05/02/2026

Atomos Unveils All New Shogun AV-19

Atomos today announced Shogun AV-19, a rack-mountable, 19-inch 4K HDR monitor-recorder-switcher designed for professional live production, broadcast, and video ...

05/02/2026

Vizrt revolutionizes corporate communications with AI-pow...

Vizrt, the leader in live production technology, revolutionizing viewer experience and engagement, today introduces two brand new solutions in partnership with ...

05/02/2026

Appear Appoints Simon Frost as Chief Marketing Officer to...

Appear, a global leader in live production technology, today announced the appointment of Simon Frost in a newly created role as Chief Marketing Officer (CMO). ...

05/02/2026

Noah Chamis ICLS Illuminates Only Murders in the Building...

New York gaffer Noah Chamis, ICLS ( You Deserve Each Other , The Half of It , Project Runway ) practices a mix of technical precision and creative play in his...

05/02/2026

NBC Sports Deploys Audio-Technica Microphones for Winter Olympics

Share Copy link Facebook X Linkedin Bluesky Email...

05/02/2026

Hemisphere Media Group, Entravision Launch WAPA Orlando

Share Copy link Facebook X Linkedin Bluesky Email...

05/02/2026

SMT Providing Timing And Production Data Services for Winter Olympics

Share Copy link Facebook X Linkedin Bluesky Email...

05/02/2026

BBC Studios and UKTV appoint Karin Marelle as Global Head of Acquisitions

BBC Studios and UKTV have appointed Karin Marelle to lead their Global Acquisitions team, overseeing the sourcing of content across BBC Studios' global chan...

05/02/2026

The Miniature Wife, A Sky Exclusive comedy drama starring Elizabeth Banks and Matthew Macfadyen, to land on 9 April

Thursday 5 February 2026 The Miniature Wife, A Sky Exclusive comedy drama starr...

05/02/2026

Trailer Revealed for Sky Original Film FUZE starring Aaron Taylor-Johnson, Theo James, Gugu Mbatha-Raw and Sam Worthington

Thursday 5 February 2026 Trailer Revealed for Sky Original Film FUZE starring A...