
With persistence and the right tools, Deborah Tylor was able to do the impossible.
A data scientist, she was tasked to comb a 3+ terabyte dataset at the Internal Revenue Service for patterns that might help uncover fraud. But even when she let the job run all night on a large bank of CPU servers the data refused to line up.
She returned in the morning to find the job had failed, so she tried again. It failed again.
About that time, Nasheb Ismaily of Cloudera knocked on the door of Rahul Tikekar, manager of a technical team that supports data analysts at the IRS. The Cloudera solutions engineer asked if Tikekar's team had any uses for Cloudera Data Platform (CDP), implementing Apache Spark 3.0 software accelerated by GPUs.
I jumped at the opportunity, said Tikekar. We have NVIDIA graphics cards on standalone servers, but using Spark to run them on a distributed cluster had eluded us for a while, so this was perfect timing for us and Deb had the perfect use case, he said.
A Nerdy Knot Untied A quick test of the software immediately speeded up many parts of Tylor's work up to 5x with no code changes, but a few pieces still lagged.
Ismaily called in a team of data scientists at NVIDIA to examine the guts of the code. They quickly determined a few tasks with particularly gnarly data structures were still running on CPUs. They wrote code to handle those jobs and inserted it into Spark's software interface for RAPIDS, the open library for running data analytics on GPUs.
Tylor ran another test, and boom, it all went on the GPUs in a distributed Spark cluster and the speedup was remarkable - Deb's running the whole program on a four-node cluster right now, said Tikekar.
The Cloudera and NVIDIA integration will empower us to use data-driven insights to power mission-critical use cases, said Joe Ansaldi, technical branch chief of the research and applied analytics and statistics division at the IRS and Tikekar's boss.
We're currently implementing this integration, and already seeing over 20x speed improvements at half the cost for our data engineering and data science workflows, he added.
Spark 3.0 + GPUs = New Horizons The work promises several payoffs the IRS team is already exploring.
With a Spark cluster of GPU-powered servers, the group can accelerate all its current jobs and run others previously thought impractical. And those jobs can tackle big datasets the team has at its disposal.
Before Spark 3.0, this was not possible, but now we're upping the ante with GPUs and we can dream of solving problems that were once impossible, said Tikekar.
Charting a Course to AI The team plans to apply what it learned with its success in data preparation, the so-called extract/transform/load (ETL) work of data analytics. Its next big step is accelerating full-blown AI inference jobs.
The partnership with Cloudera and NVIDIA helped us harness GPUs in clusters. When such advances come along, it takes a while to realize their power and develop apps that can use them, so Deb is really charting a new course for us - she's definitely the hero of the story, Tikekar said.
Specifically, the team aims to provide this distributed Spark-GPU infrastructure to analysts. Together, they will build large deep learning neural networks to tackle natural language processing and other analytics jobs currently impossible on a single server.
Many Apps for Machine Learning It's the kind of transformation many enterprises are seeking today with machine learning.
My personal feeling is that machine learning brings an incredible potential to make things that were difficult to achieve possible, said Tikekar, a Ph.D. in computer science who spent a decade teaching at Southern Oregon University before joining the IRS more than 13 years ago.
For example, today we scan in forms and then apply optical character recognition to read pieces of them, but with AI we can do a much better job of reading forms and finding patterns that can help find ID theft or reduce waste - a lot of applications can benefit from AI in numerous ways, he added.
To learn more about accelerating Cloudera's CDP 7.1.6 with NVIDIA GPUs, watch a GTC talk (free to view with registration) from October 2020, when the two companies announced their partnership.
And view Cloudera's demo below of a 44x speed increase on a data science workload using NVIDIA GPUs and RAPIDS compared to CPUs.
Most recent headlines
09/11/2025
Dalet today announced a transformative leap forward for media operations: Agentic Artificial Intelligence (AI) that unifies the Dalet ecosystem under one natura...
17/10/2025
NEP Group Receives New Equity Investment From 26North Partners LP, Co-InvestorsCarlyle remains the largest shareholder as the company prepares for the futureBy ...
17/10/2025
Apple Lands Five-Year Deal for F1 Distribution in the U.S.Besides airing on Apple TV, the sport will be amplified on other Apple servicesBy Ken Kerschbaumer, Ed...
17/10/2025
SVG Sit-Down: Marshall Electronics' Bernie Keach on the Future of PTZ Camera...
17/10/2025
L2 Productions' REMI Facility in Austin Can Produce Content From AnywhereMusic festivals, sports events are produced via flypacks and remote control roomsBy...
17/10/2025
By Lucy Spicer
One of the most exciting things about the Sundance Film Festival...
17/10/2025
(L-R) Christopher Meyer, Addison Timlin, Cooper Raiff, Lili Reinhart, Alyah Chan...
17/10/2025
M sica e arte se uniram em uma noite especial na semana passada na ZIV Gallery, ...
17/10/2025
Music and art came together for one special night last week at ZIV Gallery, an i...
17/10/2025
Spotify and FC Barcelona are extending our partnership through 2030, continuing a collaboration that's redefining how fans, players, and artists connect. Th...
17/10/2025
MURRIETA, Calif. The Sports Fishing Championship (SFC) has deployed DigitalGlue's creative.space storage platform to streamline video production by centrali...
17/10/2025
BELLEVUE, Wash. Football continued to cement its reputation as a bulwark of TV advertising in Q3 2025 with new data from iSpot that showed both the NFL and coll...
17/10/2025
The Sports Fishing Championship (SFC), the premier competitive saltwater fishing series, has transformed its production workflow by adopting creative.space, the...
17/10/2025
QuickLink, a leading provider of award-winning multi-camera video productions and remote contribution solutions, announces the release of StudioPro Version 4, ...
17/10/2025
Although the annual Grammy Awards celebration is best known for recognizing achievements in the recording industry, the show often proves a visual spectacle as ...
17/10/2025
OpenDrives, Inc., a leading provider of software-defined data storage and data services, has promoted Alex Dunfey to Chief Technology Officer (CTO) from his for...
17/10/2025
The University of Arizona (UofA) has significantly upgraded its broadcast communication infrastructure with the integration of Riedel Communications' advanc...
17/10/2025
Harmonic (NASDAQ: HLIT) today announced that New England Sports Network (NESN), owned by Fenway Sports Group and Delaware North, has selected Harmonic as its en...
17/10/2025
Austin PBS has recently upgraded its facility-wide communications infrastructure, deploying Clear-Com 's Eclipse HX, FreeSpeak II beltpacks, and V-Series ...
17/10/2025
ZEISS announces an open call for the closed BETA testing phase of CinCraft Virtual Lens Technology, the innovative digital tool that brings authentic lens chara...
17/10/2025
Situated in the town of Kokkola, Centria University of Applied Sciences offers higher education across five core fields: engineering, business, social and healt...
17/10/2025
Public information channel in Georgia, USA, to implement a powerful, simple, and cost-effective playout automation platform.
Pebble, the leading automation, co...
17/10/2025
HBO Max is reporting that it has launched in 15 new markets, including Bangladesh, Cambodia, Macau, Pakistan, Sri Lanka and Ukraine, boosting the streaming serv...
17/10/2025
Netflix said it will make a major push into video podcasts, inking a wide-ranging deal with Spotify through which it will offer 16 podcasts in the U.S. starting...
17/10/2025
Lexington, Ky. As part of a push to highlight its advanced advertising capabilities, Viamedia has launched a new AI-powered ad tech platform and officially rebr...
17/10/2025
NEW YORK QuickLink has announced the release of StudioPro Version 4, which the company is calling the most significant upgrade yet to its flagship video product...
17/10/2025
NEW YORK and CUPERTINO, Calif. Apple and NBCUniversal said they will sell Apple TV and Peacock streaming bundles to U.S. subscribers starting Oct. 20....
17/10/2025
Q&A with Boston Conservatory Choral Conductor Stephen Spinelli How his research into the lost manuscripts of composer Florence Price led to a Grammy-winning c...
17/10/2025
Gexcon is a trusted safety and risk management partner for complex, high hazard environments. ICG has been a dedicated marketing partner to Gexcon since 2018, b...
17/10/2025
Here is your host, Patrick Kielty!
After an incredible breakthrough year, Kingf...
16/10/2025
SVG Sit-Down: FUJIFILM Execs on GFX ETERNA 55 Camera, Importance of Shallow-Dept...
16/10/2025
Squash's Most Ambitious Broadcast Production To Be Deployed at Comcast Busin...
16/10/2025
Main Street Sports Group Inks Deal With Omaha Productions, Launches Original-Con...
16/10/2025
A Historic Precursor? FIFA, HBS, DAZN Offer an Inside Look at Production of FIFA...
16/10/2025
Prime Video Offers Sneak Peak at New NBA on Prime StudioThe massive 13,000-sq-ft, two-story studio features a LED regulation half court and hoopBy Jason Dachman...
16/10/2025
SVG Remote Production Forum Draws Record Crowd for Visit to PGA TOUR Studios, De...
16/10/2025
BitFire's Ben Grafchik on How Growing Cloud Workflows Are Impacting the Live...
16/10/2025
AI technology is advancing quickly, bringing both new creative possibilities and...
16/10/2025
In 2017, Imani Ellis launched CultureCon, a conference that's become a must-attend event for more than 10,000 diverse creatives and Black professionals to c...
16/10/2025
It might still be a little early to break out the tinsel and mistletoe, but Spotify's already queuing up some holiday magic. This year's Spotify Singles...
16/10/2025
Earlier this year, our in-house publishing imprint, Spotify Audiobooks, put out ...
16/10/2025
VAMPIRE has been integrated onto GM Defenses Infantry Squad Vehicle (ISV), providing a mobile solution to effectively and affordably counter small drone threat...
16/10/2025
The AgilePod mounted on the host aircraft....
16/10/2025
60% say infotainment systems are a critical purchasing or leasing consideration,...
16/10/2025
NEW YORK The Broadcasters Foundation of America has launched its 2025 Year-End Giving Campaign, which seeks to raise donations from tax-deductible personal and ...
16/10/2025
Roku has added several new features to its user interface and operating system meant to make it easier to discover and personalize content, the company said....
16/10/2025
NEW YORK CNN has announced that its All Access subscription tier will launch in the U.S. on Oct. 28, providing audiences with a centralized destination for CNN&...
16/10/2025
Student Spotlight: Alan Villanueva The graduate student and saxophonist, who performed on Natalia Lafourcades Latin Grammy-nominated Live at Carnegie Hall alb...
16/10/2025
The XR Sports Alliance (XRSA) has announced the third cohort of members. The new members include Maple Leafs Sports & Entertainment (MLSE), the Vegas Golden Kni...
16/10/2025
NEW YORK The Broadcasters Foundation of America has launched its 2025 Year-End Giving Campaign which seeks to raise donations from tax-deductible personal and c...