
With persistence and the right tools, Deborah Tylor was able to do the impossible.
A data scientist, she was tasked to comb a 3+ terabyte dataset at the Internal Revenue Service for patterns that might help uncover fraud. But even when she let the job run all night on a large bank of CPU servers the data refused to line up.
She returned in the morning to find the job had failed, so she tried again. It failed again.
About that time, Nasheb Ismaily of Cloudera knocked on the door of Rahul Tikekar, manager of a technical team that supports data analysts at the IRS. The Cloudera solutions engineer asked if Tikekar's team had any uses for Cloudera Data Platform (CDP), implementing Apache Spark 3.0 software accelerated by GPUs.
I jumped at the opportunity, said Tikekar. We have NVIDIA graphics cards on standalone servers, but using Spark to run them on a distributed cluster had eluded us for a while, so this was perfect timing for us and Deb had the perfect use case, he said.
A Nerdy Knot Untied A quick test of the software immediately speeded up many parts of Tylor's work up to 5x with no code changes, but a few pieces still lagged.
Ismaily called in a team of data scientists at NVIDIA to examine the guts of the code. They quickly determined a few tasks with particularly gnarly data structures were still running on CPUs. They wrote code to handle those jobs and inserted it into Spark's software interface for RAPIDS, the open library for running data analytics on GPUs.
Tylor ran another test, and boom, it all went on the GPUs in a distributed Spark cluster and the speedup was remarkable - Deb's running the whole program on a four-node cluster right now, said Tikekar.
The Cloudera and NVIDIA integration will empower us to use data-driven insights to power mission-critical use cases, said Joe Ansaldi, technical branch chief of the research and applied analytics and statistics division at the IRS and Tikekar's boss.
We're currently implementing this integration, and already seeing over 20x speed improvements at half the cost for our data engineering and data science workflows, he added.
Spark 3.0 + GPUs = New Horizons The work promises several payoffs the IRS team is already exploring.
With a Spark cluster of GPU-powered servers, the group can accelerate all its current jobs and run others previously thought impractical. And those jobs can tackle big datasets the team has at its disposal.
Before Spark 3.0, this was not possible, but now we're upping the ante with GPUs and we can dream of solving problems that were once impossible, said Tikekar.
Charting a Course to AI The team plans to apply what it learned with its success in data preparation, the so-called extract/transform/load (ETL) work of data analytics. Its next big step is accelerating full-blown AI inference jobs.
The partnership with Cloudera and NVIDIA helped us harness GPUs in clusters. When such advances come along, it takes a while to realize their power and develop apps that can use them, so Deb is really charting a new course for us - she's definitely the hero of the story, Tikekar said.
Specifically, the team aims to provide this distributed Spark-GPU infrastructure to analysts. Together, they will build large deep learning neural networks to tackle natural language processing and other analytics jobs currently impossible on a single server.
Many Apps for Machine Learning It's the kind of transformation many enterprises are seeking today with machine learning.
My personal feeling is that machine learning brings an incredible potential to make things that were difficult to achieve possible, said Tikekar, a Ph.D. in computer science who spent a decade teaching at Southern Oregon University before joining the IRS more than 13 years ago.
For example, today we scan in forms and then apply optical character recognition to read pieces of them, but with AI we can do a much better job of reading forms and finding patterns that can help find ID theft or reduce waste - a lot of applications can benefit from AI in numerous ways, he added.
To learn more about accelerating Cloudera's CDP 7.1.6 with NVIDIA GPUs, watch a GTC talk (free to view with registration) from October 2020, when the two companies announced their partnership.
And view Cloudera's demo below of a 44x speed increase on a data science workload using NVIDIA GPUs and RAPIDS compared to CPUs.
Most recent headlines
06/10/2025
France T l visions, France's leading broadcaster, has received the 2025 EBU ...
04/09/2025
Monumental Sports & Entertainment (MSE), in collaboration with Dalet, has been a...
15/06/2025
July 2025 in Dublin, Berlin, Amsterdam & London
Photo: Thea Martre
Music Production for Women (MPW) have announced that they will be running a series of fo...
15/06/2025
Composer/producer launches free virtual instruments
Sulcata Sound is the latest venture of Jason Graves, a two-time British Academy Award-winnning composer,...
14/06/2025
NEW YORK Pluto TV and the All Womens Sports Network have launched a free ad-supported streaming TV (FAST) AWSN channel in the U.S., Canada, the U.K. and the Nor...
14/06/2025
NEW YORK and CINCINNATI E.W. Scripps has announced a new, multiyear agreement with the WNBA that will continue Ions regular-season coverage of the league on Fri...
14/06/2025
WASHINGTON The National Association of Broadcasters highlighted the hidden importance of spectrum in the production of major sporting events and described wha...
14/06/2025
WASHINGTON Sunsetting ATSC 1.0, expanding business opportunities for NextGen Broadcast and increasing international adoption of the ATSC 3.0 standard were top o...
14/06/2025
SAN FRANCISCO Samba TV and Acxiom have announced that they will dramatically expand their longstanding relationship....
14/06/2025
July 2025 in Dublin, Berlin, Amsterdam & London
Photo: Thea Martre
Music Production for Women (MPW) have announced that they will be running a series of fo...
14/06/2025
San Francisco State University's School of Cinema Uses Blackmagic Design
Brie Clayton June 13, 2025
0 Comments
More than 40 Blackmagic Design came...
14/06/2025
Boris FX Mocha Pro Adds New AI Tools To Tackle VFX Tasks Fast
Jessie Electa Petrov June 13, 2025
0 Comments
The 2025.5 release helps artists work more...
14/06/2025
AJA Debuts DRM2-Plus Mini-Converter Frame at InfoComm 2025
Brie Clayton June 13, 2025
0 Comments
Next-gen frame addresses diverse rackmount needs wit...
13/06/2025
(L-R) Lindsay Utz, Michelle Walshe, and The Right Honourable Dame Jacinda Ardern attend the 2025 Sundance Film Festival premiere of Prime Minister at Eccles T...
13/06/2025
Photo credit: Atsushi Nishijima
If you're a true lover of rom-coms, chances...
13/06/2025
Pure Drama and Fierce Rivalries set to dominate the world's most iconic spor...
13/06/2025
Johannesburg, 12 June 2025 - The National Film and Video Foundation (NFVF), an a...
13/06/2025
ABILENE. Texas A severe storm knocked down the tower and severely damaged the news studio and main facility of Sinclair-owned KTXS here on Sunday, June 8....
13/06/2025
Berklee's Music Business/Management Department Recognized by the Music Biz A...
13/06/2025
WASHINGTON The ATSC, the Broadcast Standards Association, honored veteran technologist Aldo Cugnini and Clarence Hau, Senior Vice President of Standards, Policy...
13/06/2025
(Editor's note: The 2025 UFL Championship Game between the D.C. Defenders and Michigan Panthers kicks off Saturday, June 14, at 8 p.m. Eastern. The game wil...
13/06/2025
New iPad/iPhone synth App announced
Following on from last year's release of Gradient Synth - which reached #6 on the App Store's Paid Music charts ...
13/06/2025
LONDON Warner Bros. Discovery has announced that HBO Max will launch direct-to-consumer in multiple new countries this July as the streamer becomes available in...
13/06/2025
AI voice transcription and captioning platform Verbit has added a new feature to its Captivate ASR solution the ability to identify specific features in automat...
13/06/2025
WASHINGTON Federal Communications Commission member Anna Gomez has wrapped up two weeks in California visiting broadcasters, television studio executives, enter...
13/06/2025
WASHINGTON The U.S. House of Representatives voted mostly along party lines to approve a rescission package that would cancel $9.4 billion in previously approve...
13/06/2025
At InfoComm 2025, AJA Video Systems announced DRM2-Plus, an intuitive, high-capacity 3RU frame that can neatly house up to 24 AJA Mini-Converters. Tailored to s...
13/06/2025
Cinema advertising leader to leverage AOS and suite of AI-enabled solutions to optimize forecasting, yield management, and streamlined ad sales and operations a...
13/06/2025
Manfrotto has launched the ONE Hybrid Tripod, a new support system designed specifically for professional content creators working with mirrorless cameras acros...
13/06/2025
Leading video software provider, Synamedia, today announced that its Media Edge Gateway (MEG), an ATSC 3.0 software-based IRD, now supports Device Security requ...
13/06/2025
LiveU, the global leader in live IP-video contribution, production and distribution solutions, is deepening its commitment to the German-speaking market with th...
13/06/2025
Chaos, the leader in architectural visualisation software, today announces Chaos Corona 13, giving archviz designers new ways to add eye-catching style and flai...
13/06/2025
PALI's Nena Music Video Shot with Blackmagic Design
Brie Clayton June 12, 2025
0 Comments
Blackmagic Cinema Camera 6K and DaVinci Resolve Studio b...
13/06/2025
OddBeast Powers Up iRobot's Newest Roombas with Suite of CGI Launch Assets
Brie Clayton June 12, 2025
0 Comments
The motion design and production ...
13/06/2025
On Chick Coreas Birthday, a Newly Uncovered Archival Release The Visitors, composed by Corea and performed by vibraphonist Gary Burton and pianist Kirill Gers...
13/06/2025
In fulfilment of a recommendation by the Government's Expert Advisory Commit...
13/06/2025
SVG Sit-Down: Backblaze's Gleb Budman Talks Products, Partnerships, and the ...
13/06/2025
SVG Sit-Down: DAZN's Walker Jacobs Calls Streaming the FIFA Club World Cup ...
13/06/2025
New Sponsor Spotlight: Vecima Networks' Paul Strickland on How Improving QoE...
13/06/2025
Pitch Perspective: Where's Next for Specialty Cameras in Soccer? Leaders from Sky Austria and ACS discuss the possibilities of camera placement pitchside B...
13/06/2025
Premiership Rugby Final 2025: Vintage clash between Bath and Leicester gets full...
13/06/2025
Premiership Rugby Final 2025: TNT Sports gears up for Bath vs Leicester battle w...
13/06/2025
NCAA Men's College World Series: ESPN Adds Two-Point SupraCam, Invests in Ne...
13/06/2025
New FSWX signal and spectrum analyzer with novel architecture overcomes limits o...
13/06/2025
Apple today announced the addition of iPad to Self Service Repair, providing iPad owners with access to repair manuals, genuine Apple parts, Apple Diagnostics t...
13/06/2025
CUPERTINO, CALIFORNIA Apple today previewed iOS 26, a major update that brings a beautiful new design, intelligent experiences, and improvements to the apps use...
13/06/2025
At Apple's Worldwide Developers Conference (WWDC), Apple unveiled Apple Games, an all-new destination designed to help players jump back into the games they...
13/06/2025
Industrial AI isn't slowing down. Germany is ready.
Following London Tech Week and GTC Paris at VivaTech, NVIDIA founder and CEO Jensen Huang's Europea...
12/06/2025
In 2018, Spotify launched Heart & Soul, a mental health initiative developed to ...
12/06/2025
50 Years Strong: SBS and NITV Supercharge NAIDOC Week 2025 in a joint 50th celeb...