Sony Pixel Power calrec Sony

You Transformed the World,' NVIDIA CEO Tells Researchers Behind Landmark AI Paper

21/03/2024

Of GTC's 900+ sessions, the most wildly popular was a conversation hosted by NVIDIA founder and CEO Jensen Huang with seven of the authors of the legendary research paper that introduced the aptly named transformer - a neural network architecture that went on to change the deep learning landscape and enable today's era of generative AI.

Everything that we're enjoying today can be traced back to that moment, Huang said to a packed room with hundreds of attendees, who heard him speak with the authors of Attention Is All You Need.

Sharing the stage for the first time, the research luminaries reflected on the factors that led to their original paper, which has been cited more than 100,000 times since it was first published and presented at the NeurIPS AI conference. They also discussed their latest projects and offered insights into future directions for the field of generative AI.

While they started as Google researchers, the collaborators are now spread across the industry, most as founders of their own AI companies.

We have a whole industry that is grateful for the work that you guys did, Huang said.

From L to R: Lukasz Kaiser, Noam Shazeer, Aidan Gomez, Jensen Huang, Llion Jones, Jakob Uszkoreit, Ashish Vaswani and Illia Polosukhin. Origins of the Transformer Model The research team initially sought to overcome the limitations of recurrent neural networks, or RNNs, which were then the state of the art for processing language data.

Noam Shazeer, cofounder and CEO of Character.AI, compared RNNs to the steam engine and transformers to the improved efficiency of internal combustion.

We could have done the industrial revolution on the steam engine, but it would just have been a pain, he said. Things went way, way better with internal combustion.

Now we're just waiting for the fusion, quipped Illia Polosukhin, cofounder of blockchain company NEAR Protocol.

The paper's title came from a realization that attention mechanisms - an element of neural networks that enable them to determine the relationship between different parts of input data - were the most critical component of their model's performance.

We had very recently started throwing bits of the model away, just to see how much worse it would get. And to our surprise it started getting better, said Llion Jones, cofounder and chief technology officer at Sakana AI.

Having a name as general as transformers spoke to the team's ambitions to build AI models that could process and transform every data type - including text, images, audio, tensors and biological data.

That North Star, it was there on day zero, and so it's been really exciting and gratifying to watch that come to fruition, said Aidan Gomez, cofounder and CEO of Cohere. We're actually seeing it happen now.

Packed house at the San Jose Convention Center. Envisioning the Road Ahead Adaptive computation, where a model adjusts how much computing power is used based on the complexity of a given problem, is a key factor the researchers see improving in future AI models.

It's really about spending the right amount of effort and ultimately energy on a given problem, said Jakob Uszkoreit, cofounder and CEO of biological software company Inceptive. You don't want to spend too much on a problem that's easy or too little on a problem that's hard.

A math problem like two plus two, for example, shouldn't be run through a trillion-parameter transformer model - it should run on a basic calculator, the group agreed.

They're also looking forward to the next generation of AI models.

I think the world needs something better than the transformer, said Gomez. I think all of us here hope it gets succeeded by something that will carry us to a new plateau of performance.

You don't want to miss these next 10 years, Huang said. Unbelievable new capabilities will be invented.

The conversation concluded with Huang presenting each researcher with a framed cover plate of the NVIDIA DGX-1 AI supercomputer, signed with the message, You transformed the world.

Jensen presents lead author Ashish Vaswani with a signed DGX-1 cover. There's still time to catch the session replay by registering for a virtual GTC pass - it's free.

To discover the latest in generative AI, watch Huang's GTC keynote address:
LINK: https://blogs.nvidia.com/blog/gtc-2024-transformer-ai-research-panel-j...
See more stories from nvidia

Most recent headlines

05/01/2027

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be demoed at CES 2026

Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...

01/06/2026

Dolby Sets the New Standard for Premium Entertainment at CES 2026

January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026 Throughout the week, Dolby brings to life the latest innovatio...

02/05/2026

Dalet Flex LTS Delivers Smarter Search, Faster Editing, and an AI-Ready Foundation for Modern Media

Dalet, a leading technology and service provider for media-rich organizations, t...

01/05/2026

NBCUniversal's Peacock to Be First Streamer to Integrate Dolby's Full Suite of Premium Picture and Sound Innovations

January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...

01/04/2026

DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION

January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION Douyin Users Can Now Create And Share Videos With Stun...

20/02/2026

Lightware and Cisco interoperability brings AV system suc...

A leading global investment bank, with offices at Two International Finance Centre in Hong Kong, partnered with systems integrators Global Vision Engineering (G...

20/02/2026

Rise AV and Rise Broadcast unite for International Womens...

Rise AV and Rise Broadcast, the global not-for-profit organisations dedicated to improving gender diversity across technical industries, have today announced a ...

20/02/2026

Open Broadcast Systems launches Two Hundred Gigabit Ether...

Open Broadcast Systems, the leader in software-based professional video transport, has added support for 200 Gigabit Ethernet to its range of encoders and decod...

20/02/2026

Signiant Launches Customer Advisory Board to Help Shape t...

Signiant today announced the formation of its Customer Advisory Board (CAB), bringing together a select group of customers to collaborate on product strategy, r...

20/02/2026

PTZOptics launches its Visual Reasoning initiative and pa...

PTZOptics today announced the launch of its Visual Reasoning initiative that makes video more actionable by combining robotic PTZ camera systems, AI, and open i...

20/02/2026

DELTA and Amino Complete Certification of Amigo 7N Androi...

Amino, a global media technology provider delivering devices, software and cloud services that simplify and elevate video delivery, today announced the successf...

20/02/2026

SMPTE Opens Call for Papers for 2026 Media Technology Sum...

SMPTE , the home of media professionals, technologists, and engineers, today announced its call for technical papers for the SMPTE 2026 Media Technology Summit....

20/02/2026

Granicus Standardizes Hybrid Government-Grade Video Infra...

Wowza Media Systems today announced that Granicus, a leading provider of digital engagement solutions for governments, continues to rely on Wowza to power its h...

20/02/2026

CBS Baltimore Launches New AR/VR Studio

Share Copy link Facebook X Linkedin Bluesky Email...

20/02/2026

IAB Tech Lab Opens Public Comment on Live Event Ad Playbook

Share Copy link Facebook X Linkedin Bluesky Email...

20/02/2026

Study: Sports Programming on Major Streamers Up 52% YoY

Share Copy link Facebook X Linkedin Bluesky Email...

20/02/2026

ESPN to launch Women's Sports Sundays

Share Copy link Facebook X Linkedin Bluesky Email...

20/02/2026

Tegna, Seattle Kraken Extend Broadcast Deal

Share Copy link Facebook X Linkedin Bluesky Email...

20/02/2026

Signiant Announces Customer Advisory Board

Share Copy link Facebook X Linkedin Bluesky Email...

20/02/2026

Other Voices returns to RT this Spring with performances from Dermot Kennedy, Amble, Florence Road and more

Other Voices returns to RT this Spring with performances from Dermot Kennedy, A...

19/02/2026

CBC Navigates Multi-Zone Winter Olympics With Bilingual Production, Remote Studios, Custom Content Hubs

The Canadian rightsholder deploys its most complex' Olympics setup an ever,...

19/02/2026

Suite Studios Announces S3 Native File Streaming for Real-Time Media Workflows at Petabyte Scale

Suite Studios, a cloud-native platform that connects creative teams to their med...

19/02/2026

Guitar Center Named Official Music Gear Retailer and AV Integrator for Tennessee Titans' New Nissan Stadium

Guitar Center and the Tennessee Titans announce a first-of-its-kind partnership ...

19/02/2026

DAZN, Matchroom Boxing Sign Five-Year Deal to Extend Long-Standing Partnership

DAZN is reinforcing its leadership in global boxing through a new five-year deal with Matchroom Boxing in the United States and the United Kingdom. The deal ext...

19/02/2026

PTZOptics Launches Visual Reasoning Initiative, Partners with Moondream to Automate Video Decision-Making

PTZOptics announces the launch of its visual reasoning initiative that makes vid...

19/02/2026

The Influencer Games? OBS Bakes Digital-Native Content Creators - and Athletes - Into Milano Cortina

Influencer Positions, AI-driven vertical video, and platform-native creators res...

19/02/2026

Scaling the Infinite Feed: OBS Redefines the Multi-Platform Olympic Experience

Host broadcaster evolves from world-feed producer to global content orchestrator, unlocking hidden moments for every platform...

19/02/2026

Got Drones? How OBS's FPV Strategy Changes the Game for Everyone

Custom drones raise expectations for bringing viewers closer to the action in new ways 2026 is not the first time OBS has used drones or event First Person Vie...

19/02/2026

Xfinity Delivered Super Bowl LX Faster Than Any Other Provider

Comcast's Xfinity announces record-setting performance from its new RealTime4K technology during Super Bowl LX, delivering the game to customers' homes ...

19/02/2026

Cortina Sliding Center Photo Gallery: Women's Monobob

The Women's Monobob turned out to be a historic event: Team USA's Elana Meyers Taylor captured the gold at age 41, making her the oldest Bobsleigh gold ...

19/02/2026

SVG Students To Watch: Philip Doherty, Elon University

The senior from Georgia has found his calling as a technical director and video engineer In the live-sports-video industry, the future is bright. Our series SV...

19/02/2026

Cosm Expands its Leadership Team to Scale Its Sports and Entertainment Division

Cosm announces the appointment of two new leaders - Jeff Hughes (President, Sports and Entertainment) and Rob Laycock (Vice President, Head of Venues Marketing)...

19/02/2026

NBC Olympics Brings Its Signature Studio Look Home' to the IBC in Milan

Two main sets feature Unreal Engine environment, LED architecture, camera-tracked virtual windows; a third, in the TV Tower,' overlooks the Duomo...

19/02/2026

Watch Robyn's Long-Awaited Return to the Stage in Spotify's Exclusive Concert Film

Last November, Spotify hosted an unforgettable night with Robyn, where top fans ...

19/02/2026

Warner Bros. Discovery returns to position one in Poland for first time since May 2025

WBD edges above Polsat in only change in the top 10 content distributor rankings...

19/02/2026

Nielsen data shows Kiwis' love of travel drives 10% lift in ad investment

Airlines and cruise operators ramp up spend as New Zealanders plan, compare and book online Auckland, February 19, 2026 - New Nielsen Ad Intel data shows adver...

19/02/2026

The Gauge: Poland | January 2026

The beginning of the new year brought a clear revival in front of television screens. The winter aura meant that Poles spent an average of 5% more time watching...

19/02/2026

Sports programming on major streaming platforms surges 52% year-over-year, according to Gracenote

New data also shows news content on FAST channels up 58% as streaming catalogs c...

19/02/2026

Sinclair Launches New True Crime Daily Video Podcast

Share Copy link Facebook X Linkedin Bluesky Email...

19/02/2026

Berklee Awards Fenway Neighborhood Improvement Grants to Six Organizations

Berklee Awards Fenway Neighborhood Improvement Grants to Six Organizations A total of $25,000 will be distributed among the Boston-based nonprofits. February...

19/02/2026

Submissions open for Best of Show Awards at NAB 2026

Share Copy link Facebook X Linkedin Bluesky Email...

19/02/2026

SMPTE Issues Call for Summit Papers

Share Copy link Facebook X Linkedin Bluesky Email...

19/02/2026

Broadband Usage Jumps by 9.9% in Q4

Share Copy link Facebook X Linkedin Bluesky Email...

19/02/2026

NBC Sports Details NEPs Extensive Work for Winter Olympic Coverage

Share Copy link Facebook X Linkedin Bluesky Email...

19/02/2026

Fuse Media and Complex to Launch Complex TV

Share Copy link Facebook X Linkedin Bluesky Email...

19/02/2026

Spincast Gets U.S. Patent for AI-Powered Shoppable TV

Share Copy link Facebook X Linkedin Bluesky Email...

19/02/2026

IBCAP Announces $21 Million Lawsuit Against DMTN IPTV

Share Copy link Facebook X Linkedin Bluesky Email...

19/02/2026

NRB Backs ATSC 3.0 Tuner and Must-Carry Requirements

Share Copy link Facebook X Linkedin Bluesky Email...

19/02/2026

Foundry Acquires Griptape to Accelerate AI Integration Ac...

Foundry, the leading developer of creative software for the Media and Entertainment industry, today announced the completion of its acquisition of Griptape, a p...

19/02/2026

Scott D Smith Captures the Chaos and Intimacy of The Bear...

Capturing the raw energy and emotional intensity of FX's hit series The Bear is no small feat, especially when the set itself is as hectic and unpredictab...