
Of GTC's 900+ sessions, the most wildly popular was a conversation hosted by NVIDIA founder and CEO Jensen Huang with seven of the authors of the legendary research paper that introduced the aptly named transformer - a neural network architecture that went on to change the deep learning landscape and enable today's era of generative AI.
Everything that we're enjoying today can be traced back to that moment, Huang said to a packed room with hundreds of attendees, who heard him speak with the authors of Attention Is All You Need.
Sharing the stage for the first time, the research luminaries reflected on the factors that led to their original paper, which has been cited more than 100,000 times since it was first published and presented at the NeurIPS AI conference. They also discussed their latest projects and offered insights into future directions for the field of generative AI.
While they started as Google researchers, the collaborators are now spread across the industry, most as founders of their own AI companies.
We have a whole industry that is grateful for the work that you guys did, Huang said.
From L to R: Lukasz Kaiser, Noam Shazeer, Aidan Gomez, Jensen Huang, Llion Jones, Jakob Uszkoreit, Ashish Vaswani and Illia Polosukhin. Origins of the Transformer Model The research team initially sought to overcome the limitations of recurrent neural networks, or RNNs, which were then the state of the art for processing language data.
Noam Shazeer, cofounder and CEO of Character.AI, compared RNNs to the steam engine and transformers to the improved efficiency of internal combustion.
We could have done the industrial revolution on the steam engine, but it would just have been a pain, he said. Things went way, way better with internal combustion.
Now we're just waiting for the fusion, quipped Illia Polosukhin, cofounder of blockchain company NEAR Protocol.
The paper's title came from a realization that attention mechanisms - an element of neural networks that enable them to determine the relationship between different parts of input data - were the most critical component of their model's performance.
We had very recently started throwing bits of the model away, just to see how much worse it would get. And to our surprise it started getting better, said Llion Jones, cofounder and chief technology officer at Sakana AI.
Having a name as general as transformers spoke to the team's ambitions to build AI models that could process and transform every data type - including text, images, audio, tensors and biological data.
That North Star, it was there on day zero, and so it's been really exciting and gratifying to watch that come to fruition, said Aidan Gomez, cofounder and CEO of Cohere. We're actually seeing it happen now.
Packed house at the San Jose Convention Center. Envisioning the Road Ahead Adaptive computation, where a model adjusts how much computing power is used based on the complexity of a given problem, is a key factor the researchers see improving in future AI models.
It's really about spending the right amount of effort and ultimately energy on a given problem, said Jakob Uszkoreit, cofounder and CEO of biological software company Inceptive. You don't want to spend too much on a problem that's easy or too little on a problem that's hard.
A math problem like two plus two, for example, shouldn't be run through a trillion-parameter transformer model - it should run on a basic calculator, the group agreed.
They're also looking forward to the next generation of AI models.
I think the world needs something better than the transformer, said Gomez. I think all of us here hope it gets succeeded by something that will carry us to a new plateau of performance.
You don't want to miss these next 10 years, Huang said. Unbelievable new capabilities will be invented.
The conversation concluded with Huang presenting each researcher with a framed cover plate of the NVIDIA DGX-1 AI supercomputer, signed with the message, You transformed the world.
Jensen presents lead author Ashish Vaswani with a signed DGX-1 cover. There's still time to catch the session replay by registering for a virtual GTC pass - it's free.
To discover the latest in generative AI, watch Huang's GTC keynote address:
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
01/06/2026
January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026
Throughout the week, Dolby brings to life the latest innovatio...
02/05/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
01/05/2026
January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...
01/04/2026
January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION
Douyin Users Can Now Create And Share Videos With Stun...
14/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
14/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
14/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
14/02/2026
Boston Conservatory Orchestra Helps Peter and Leonardo Dugan Complete Their Dre...
13/02/2026
Olympic Broadcasting Services (OBS) has provided an update on its adoption of the cloud as it continues on its journey to fully migrate to IT-based systems by 2...
13/02/2026
France T l visions has successfully launched France 2 UHD featuring Dolby Vision...
13/02/2026
Partnering with Worldwide Olympic Partner TCL, OBS deploys connected Athlete Mom...
13/02/2026
The men's figure skating long-form program is tonight, and it promises to be an exciting night for fans in the stands, fans at home, and even the production...
13/02/2026
With new partnership between the league and NBC, workflows distinguish more between live, broadcast sound
There'll be a lot new for the 75th NBA All-Star W...
13/02/2026
In-venue and creative video staffers at the professional and collegiate level have one major thing in common: the intensity and attention to detail ramps up dur...
13/02/2026
Teradek announces the launch of RF-X Auto Switcher, a revolutionary appliance designed to deliver flawless, uncompromised signal integrity for the world's m...
13/02/2026
Globecast and Synamedia announces that Pitch International (Pitch), the leading London-based sports marketing agency, has gone live with cloud-based distributi...
13/02/2026
Ratings Roundup is a rundown of recent rating news and is derived from press rel...
13/02/2026
Far from the action in the snow and on the ice, the team controls the production...
13/02/2026
The Daytona 500 is called The Super Bowl of Racing for a reason. Whether it's the culmination to five days of action on the track, the sheer size and scop...
13/02/2026
For the Milano Cortina Games, Olympic Broadcasting Services (OBS) is delivering more than 6,500 hours of content, with more than 900 hours of live action, sprea...
13/02/2026
After 24-year absence, NBC Sports returns to NBA All-Star Weekend with unique ca...
13/02/2026
By Jessica Herndon
We may have just wrapped an unforgettable 2026 Sundance Film...
13/02/2026
By Jessica Herndon
One of the most exciting things about the Sundance Film Fest...
13/02/2026
This Wednesday in Los Angeles, Spotify brought together a group of podcast creat...
13/02/2026
Yesterday, Spotify and LoveShackFancy hosted a Galentine's and Gents Lunch a...
13/02/2026
The upgrade to a Project 25 network provides state agencies communicating on the Statewide Law Enforcement Radio System flexibility to tailor the network to the...
13/02/2026
Riedel Communications has officially opened a new office in Kuala Lumpur, Malaysia, marking a strategic expansion of its global Customer Success and IT software...
13/02/2026
Two of ES Broadcast Hire's longest-serving employees recently celebrated a decade working for the company.
Annie Breislin, Operations Manager, and Charles ...
13/02/2026
Disguise, the award-winning technology company powering global experiences, today unveils a new 8,000-square-foot office and Experience Center in Atlanta, creat...
13/02/2026
At BSC Expo 2026, Mavis announced full support for the Accsoon SeeMo series of iOS camera adapters across Mavis Camera and Mavis Monitor apps. This new integrat...
13/02/2026
Executing technically ambitious live streams, virtual productions, and immersive media today requires talent, creativity, and the right supporting technology. L...
13/02/2026
Michal Miskin-Amir, Jonathan Stanton and Bobby Bond to lead technical advances amid surge in demand for LTN's IP video transport services as satellite capac...
13/02/2026
Grass Valley, the pioneering media and entertainment technology innovator, has won a competitive NATO-wide tender to provide the new camera system for NATO'...
13/02/2026
Wireless IP intercom underpins agile, multi-location live production workflows
Digital Azul, the independent production powerhouse specialising in complex liv...
13/02/2026
Actus Digital, a LiveU company, will unveil major new enhancements to its Actus X Intelligent Monitoring Platform at NAB Show (LiveU booth N1740), reinforcing i...
13/02/2026
Globecast, a worldwide leader in broadcast services, and leading video software provider, Synamedia, today announced that Pitch International (Pitch), the leadi...
13/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
13/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
13/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
13/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
13/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
13/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
13/02/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
13/02/2026
What can I watch on UKTV this week?What can I stream on U this week?
This guide highlights romantic dramas for Valentine's Day, alternative relationship t...
13/02/2026
New RT series tells stranger-than-fiction stories of Irish con artists
Swindlers airs Wednesday 18 February, 9.35pm on RT One and RT Player
Swindlers, a...
12/02/2026
Chyron unveils PRIME 5.3, the latest software release of the company's powerful engine for live production graphics. PRIME 5.3 delivers the first official i...
12/02/2026
The vendor's VP of Product Management explains how quality assurance, monito...
12/02/2026
LTN announces the appointment of three experienced executives to lead its new Technology organization: Michal Miskin-Amir as EVP and Head of Technology, Jonatha...
12/02/2026
Riedel Communications has officially opened a new office in Kuala Lumpur, Malays...