
Of GTC's 900+ sessions, the most wildly popular was a conversation hosted by NVIDIA founder and CEO Jensen Huang with seven of the authors of the legendary research paper that introduced the aptly named transformer - a neural network architecture that went on to change the deep learning landscape and enable today's era of generative AI.
Everything that we're enjoying today can be traced back to that moment, Huang said to a packed room with hundreds of attendees, who heard him speak with the authors of Attention Is All You Need.
Sharing the stage for the first time, the research luminaries reflected on the factors that led to their original paper, which has been cited more than 100,000 times since it was first published and presented at the NeurIPS AI conference. They also discussed their latest projects and offered insights into future directions for the field of generative AI.
While they started as Google researchers, the collaborators are now spread across the industry, most as founders of their own AI companies.
We have a whole industry that is grateful for the work that you guys did, Huang said.
From L to R: Lukasz Kaiser, Noam Shazeer, Aidan Gomez, Jensen Huang, Llion Jones, Jakob Uszkoreit, Ashish Vaswani and Illia Polosukhin. Origins of the Transformer Model The research team initially sought to overcome the limitations of recurrent neural networks, or RNNs, which were then the state of the art for processing language data.
Noam Shazeer, cofounder and CEO of Character.AI, compared RNNs to the steam engine and transformers to the improved efficiency of internal combustion.
We could have done the industrial revolution on the steam engine, but it would just have been a pain, he said. Things went way, way better with internal combustion.
Now we're just waiting for the fusion, quipped Illia Polosukhin, cofounder of blockchain company NEAR Protocol.
The paper's title came from a realization that attention mechanisms - an element of neural networks that enable them to determine the relationship between different parts of input data - were the most critical component of their model's performance.
We had very recently started throwing bits of the model away, just to see how much worse it would get. And to our surprise it started getting better, said Llion Jones, cofounder and chief technology officer at Sakana AI.
Having a name as general as transformers spoke to the team's ambitions to build AI models that could process and transform every data type - including text, images, audio, tensors and biological data.
That North Star, it was there on day zero, and so it's been really exciting and gratifying to watch that come to fruition, said Aidan Gomez, cofounder and CEO of Cohere. We're actually seeing it happen now.
Packed house at the San Jose Convention Center. Envisioning the Road Ahead Adaptive computation, where a model adjusts how much computing power is used based on the complexity of a given problem, is a key factor the researchers see improving in future AI models.
It's really about spending the right amount of effort and ultimately energy on a given problem, said Jakob Uszkoreit, cofounder and CEO of biological software company Inceptive. You don't want to spend too much on a problem that's easy or too little on a problem that's hard.
A math problem like two plus two, for example, shouldn't be run through a trillion-parameter transformer model - it should run on a basic calculator, the group agreed.
They're also looking forward to the next generation of AI models.
I think the world needs something better than the transformer, said Gomez. I think all of us here hope it gets succeeded by something that will carry us to a new plateau of performance.
You don't want to miss these next 10 years, Huang said. Unbelievable new capabilities will be invented.
The conversation concluded with Huang presenting each researcher with a framed cover plate of the NVIDIA DGX-1 AI supercomputer, signed with the message, You transformed the world.
Jensen presents lead author Ashish Vaswani with a signed DGX-1 cover. There's still time to catch the session replay by registering for a virtual GTC pass - it's free.
To discover the latest in generative AI, watch Huang's GTC keynote address:
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
04/08/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
04/07/2026
April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...
01/06/2026
January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026
Throughout the week, Dolby brings to life the latest innovatio...
02/05/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
01/05/2026
January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...
21/04/2026
Layercake Deepens Bitmovin Integration to Power End-to-End Media Orchestration w...
21/04/2026
Cobalt Digital Announces Launch of blueCORE at NAB Show 2026: Compact Powerhouse...
21/04/2026
On-Premise AI Suite from Studio Network Solutions Debuts at NAB Show 2026
Melanie Ciotti April 21, 2026
0 Comments
Unlimited processing, no cloud depe...
21/04/2026
London, 21 April 2026 IBC today announced the appointment of Tim Banham as its first Chief Commercial Officer (CCO), a newly created role that reflects the or...
21/04/2026
Motion Design Tools - April 2026
Roland Kahlenberg April 21, 2026
0 Comments
Within 2 days, Maxon and Canva announced pro-level motion design apps - A...
21/04/2026
Chaos and Zero Density to Showcase Real-Time Ray Tracing for Virtual Studios and...
21/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
21/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
21/04/2026
Cinema 4D brings professional 3D workflows to iPad. The return of Autograph now free for individual users. ZBrush expands to Windows on Arm. See it all at NAB...
21/04/2026
Software version 1.6 extends enterprise functionality to place Buttons at the heart of media operations at any scale
Bitfocus, the Norwegian software develope...
21/04/2026
Cobalt Digital Announces Launch of blueCORE at NAB Show 2026: Compact Powerhouse Signal Processors for SDI and ST 2110 Workflows
Compact, multi-function stan...
21/04/2026
Applications open for 2026 AISF and Screen Australia Writer/Director Virtual Ses...
21/04/2026
Summer is nearly here and Super Garden is returning to our screens to spark some gardening inspiration. The new series kicks off on Thursday 23 April at 7pm on ...
20/04/2026
At the 2026 NAB Show, Sony is showcasing a broad slate of innovations across liv...
20/04/2026
At the 2026 NAB Show, Canon is doubling down on its commitment to live sports pr...
20/04/2026
Fujifilm is sharpening its focus on core broadcast production with a new wave of...
20/04/2026
This upcoming summer in North America is going to be a busy one. The 2026 FIFA M...
20/04/2026
Glookast (Booth W1661) announced a series of product updates at NAB Show 2026, c...
20/04/2026
Matrox Video and Amagi announced a collaboration to integrate the Matrox ORIGIN ...
20/04/2026
Riedel Communications (Booth C4908) announced that the Asociaci n del F tbol Arg...
20/04/2026
Ikegami (Booth C3819) announced the VFE-P07D monocular OLED viewfinder at NAB Sh...
20/04/2026
International Association of MediaTech (IAMT), formerly known as IABM, announced...
20/04/2026
Harmonic (Booth W2831) announced that DIRECTV is updating its US direct-to-home (DTH) video platform using Harmonic's VOS Media Software.
The deployment is...
20/04/2026
Wasabi Technologies announced that it has acquired the Lyve Cloud business from Seagate Technology. As part of the agreement, Seagate received equity in Wasabi ...
20/04/2026
EVS (Booth N1841) has launched Choreon, a robotics controller for media producti...
20/04/2026
The NAB Show is in full swing, and the SVG and SVG Europe editorial teams are chasing down the hottest stories from all over the Las Vegas Convention Center. He...
20/04/2026
Skyline Communications announced the availability of its DataMiner xOps platform...
20/04/2026
Studio Network Solutions (Booth N1129) introduced a set of new products at NAB S...
20/04/2026
Dell Technologies is showcasing its Dell AI Data Platform with NVIDIA at NAB Sho...
20/04/2026
Blackmagic Design has announced Fairlight Live, a software-based live audio mixer with SMPTE 2110 support and spatial audio mixing. A public beta is available n...
20/04/2026
At the 2026 NAB Show in Las Vegas, Imagine Communications VP of Sales, Sports an...
20/04/2026
At the 2026 NAB Show in Las Vegas, LiveU Senior Director of Sales, Sports Philli...
20/04/2026
A song that perfectly captures a moment is magic. But when you uncover the story behind it, who made it, what inspired it, and the meaning woven into the lyrics...
20/04/2026
Ultra-compact 32-bit recorder set for launch
Deity Microphones will soon be launching a new 32-bit six-track recorder that's been designed with producti...
20/04/2026
Uncoming lightweight shotgun mic announced
Production-sound experts Lectrosonics have recently announced the upcoming launch of a new lightweight shotgun mi...
20/04/2026
New 20-minute documentary explores iconic preamp
In 2025, Focusrite commissioned a new short-form documentary with filmmaker Chris Mayes-Wright - the direct...
20/04/2026
Turn quick sketches into real drum grooves
Sampleson have been experimenting with assitive production tools recently, and their latest creation aims to make...
20/04/2026
Rohde & Schwarz rolls out its full ARDRONIS counter UAS suite in a demonstration...
20/04/2026
L3Harris delivers integrated communications, navigation and C4ISR capabilities that empower the U.S. Coast Guard to protect Americas maritime interests and resp...
20/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
20/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
20/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
20/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
20/04/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...