Sony Pixel Power calrec Sony

What Is a Transformer Model?

25/03/2022

If you want to ride the next big wave in AI, grab a transformer.

They're not the shape-shifting toy robots on TV or the trash-can-sized tubs on telephone poles.

So, What's a Transformer Model? A transformer model is a neural network that learns context and thus meaning by tracking relationships in sequential data like the words in this sentence.

Transformer models apply an evolving set of mathematical techniques, called attention or self-attention, to detect subtle ways even distant data elements in a series influence and depend on each other.

First described in a 2017 paper from Google, transformers are among the newest and one of the most powerful classes of models invented to date. They're driving a wave of advances in machine learning some have dubbed transformer AI.

Stanford researchers called transformers foundation models in an August 2021 paper because they see them driving a paradigm shift in AI. The sheer scale and scope of foundation models over the last few years have stretched our imagination of what is possible, they wrote.

What Can Transformer Models Do? Transformers are translating text and speech in near real-time, opening meetings and classrooms to diverse and hearing-impaired attendees.

They're helping researchers understand the chains of genes in DNA and amino acids in proteins in ways that can speed drug design.

Transformers, sometimes called foundation models, are already being used with many data sources for a host of applications. Transformers can detect trends and anomalies to prevent fraud, streamline manufacturing, make online recommendations or improve healthcare.

People use transformers every time they search on Google or Microsoft Bing.

The Virtuous Cycle of Transformer AI Any application using sequential text, image or video data is a candidate for transformer models.

That enables these models to ride a virtuous cycle in transformer AI. Created with large datasets, transformers make accurate predictions that drive their wider use, generating more data that can be used to create even better models.

Stanford researchers say transformers mark the next stage of AI's development, what some call the era of transformer AI. Transformers made self-supervised learning possible, and AI jumped to warp speed, said NVIDIA founder and CEO Jensen Huang in his keynote address this week at GTC.

Transformers Replace CNNs, RNNs Transformers are in many cases replacing convolutional and recurrent neural networks (CNNs and RNNs), the most popular types of deep learning models just five years ago.

Indeed, 70 percent of arXiv papers on AI posted in the last two years mention transformers. That's a radical shift from a 2017 IEEE study that reported RNNs and CNNs were the most popular models for pattern recognition.

No Labels, More Performance Before transformers arrived, users had to train neural networks with large, labeled datasets that were costly and time-consuming to produce. By finding patterns between elements mathematically, transformers eliminate that need, making available the trillions of images and petabytes of text data on the web and in corporate databases.

In addition, the math that transformers use lends itself to parallel processing, so these models can run fast.

Transformers now dominate popular performance leaderboards like SuperGLUE, a benchmark developed in 2019 for language-processing systems.

How Transformers Pay Attention Like most neural networks, transformer models are basically large encoder/decoder blocks that process data.

Small but strategic additions to these blocks (shown in the diagram below) make transformers uniquely powerful.

A look under the hood from a presentation by Aidan Gomez, one of eight co-authors of the 2017 paper that defined transformers. Transformers use positional encoders to tag data elements coming in and out of the network. Attention units follow these tags, calculating a kind of algebraic map of how each element relates to the others.

Attention queries are typically executed in parallel by calculating a matrix of equations in what's called multi-headed attention.

With these tools, computers can see the same patterns humans see.

Self-Attention Finds Meaning For example, in the sentence:

She poured water from the pitcher to the cup until it was full.

We know it refers to the cup, while in the sentence:

She poured water from the pitcher to the cup until it was empty.

We know it refers to the pitcher.

Meaning is a result of relationships between things, and self-attention is a general way of learning relationships, said Ashish Vaswani, a former senior staff research scientist at Google Brain who led work on the seminal 2017 paper.

Machine translation was a good vehicle to validate self-attention because you needed short- and long-distance relationships among words, said Vaswani.

Now we see self-attention is a powerful, flexible tool for learning, he added.

How Transformers Got Their Name Attention is so key to transformers the Google researchers almost used the term as the name for their 2017 model. Almost.

Attention Net didn't sound very exciting, said Vaswani, who started working with neural nets in 2011.

.Jakob Uszkoreit, a senior software engineer on the team, came up with the name Transformer.

I argued we were transforming representations, but that was just playing semantics, Vaswani said.

The Birth of Transformers In the paper for the 2017 NeurIPS conference, the Google team described their transformer and the accuracy records it set for machine translation.

Thanks to a basket of techniques, they trained their model in just 3.5 days on eight NVIDIA GPUs, a small fraction of the time and cost of training prior models. They trained it on datasets with up to a billion pairs of words.

It was an intense three-month sprint to the paper submissio
LINK: https://blogs.nvidia.com/blog/2022/03/25/what-is-a-transformer-model/...
See more stories from nvidia

Most recent headlines

04/08/2024

Dalet Appoints Santiago Solanas as CEO to Lead Next Era of Growth and Innovation

Dalet, a leading technology and service provider for media-rich organizations, is excited to announce Santiago Solanas as its new Chief Executive Officer (CEO)....

03/06/2024

Dalet and Veritone Reach Agreement to Distribute, Transact and Monetize Media Archives

Dalet, a leading technology and service provider for media-rich organizations, a...

08/05/2024

Clear-Com to Showcase Cutting-Edge Communication Solution...

Clear-Com , a renowned provider of professional real-time communication solutions, is thrilled to announce its participation at InfoComm 2024, the largest profe...

08/05/2024

MRMC Achieves its Second Kings Award for Enterprise

Mark Roberts Motion Control (MRMC), a Nikon company, is one of the 252 organisations recognised with a prestigious King's Award for Enterprise. Announced to...

08/05/2024

EditShare goes beyond storage at MPTS

EditShare, the technology leader that enables storytellers to create and manage collaborative workflows at every stage from storyboard to screen, is exhibiting ...

08/05/2024

Magnifi and Linius Fuse Live and Archive Content to Redef...

Magnifi by VideoVerse an AI-driven video technology company and powerful video-editing SaaS platform is pleased to announce a strategic partnership with Linius ...

08/05/2024

NVIDIA RTX GPU Connects DaVinci Resolve to Power

NVIDIA RTX GPU Connects DaVinci Resolve to Power Brie Clayton May 8, 2024 0 Comments Two of the most important questions that the Creative COW audienc...

08/05/2024

Q&A with Visiting Artist Lei Liang

Q&A with Visiting Artist Lei Liang The acclaimed composer discusses his collaboration with Boston Conservatory, his work as a research artist, and what it mea...

08/05/2024

More Stations in San Antonio Launch NextGen TV Broadcasts

A second group of stations has begun broadcasting NextGen TV signals in San Antonio, Texas....

08/05/2024

NBC Orders More Deal or No Deal Island'

NBC has ordered a second season of unscripted Deal or No Deal Island. Joe Manganiello hosts, and executive produces alongside Howie Mandel. Manganiello will ret...

08/05/2024

Tegna Names Greg Retsinas GM At KGW, Portland, Oregon

Tegna said it named Greg Retsinas as president and general manager of KGW, the NBC affiliate in Portland, Oregon, effective June 3....

08/05/2024

Measurement Company Comscore Posts $5.2 Million Loss in 1st Quarter

Comscore lost money in the first quarter as weak linear ad sales impacted its cross-platform measurement business....

08/05/2024

Funny Business as NBC Orders Happy's Place,' Renews Lopez vs. Lopez'

NBC said it bolstered its comedy slate by renewing Lopez vs. Lopez and ordering Happy's Place, starring Reba McEntire....

08/05/2024

Knuckles' Knocks Fallout' From Top of TVision Power Score Rankings

Paramount Plus's Knuckles knocked Amazon Prime Video's Fallout out of the top spot in TVision's Power Score ranking of the top show on connected TV ...

08/05/2024

Tegna Adds 11 Markets To Lineup for Indiana Fever Games

Tegna, which made a deal to broadcast 17 Indiana Fever games featuring top WNBA draft pick Caitlin Clark, said it made deals that will put the games on stations...

08/05/2024

EchoStar Loses 348,000 Pay TV Subs in 1st Quarter

EchoStar said it lost another 348,000 pay TV subscribers so far this year, closing the first quarter with 8.18 million subscribers. That total includes 6.26 mil...

08/05/2024

Fox Has Net Income of $666 Million in 3rd Quarter

Fox reported a profit for its fiscal third quarter....

08/05/2024

IPG's Kinesso Unit Adds Senior Executives to Lineup

Kinesso, the centralized tech and data unit at media agency IPG Mediabrands, named Tom Amies-Cull as global chief operating officer and Amie Owen as global chie...

08/05/2024

Post production houses adopt Cleanfeed Cinema solution

The solutions in-browser stream focuses on low latency, making it suited to low bandwidth scenarios By Matthew Corrigan Published: May 8, 2024 The solutio...

08/05/2024

TAG unveils subtitling language detection feature

Driven by algorithms, the solution performs a quality analysis informed by language-specific dictionaries By Matthew Corrigan Published: May 8, 2024 Drive...

08/05/2024

Mobilelinks acquires 2 SAT Europe to increase SNG truck fleet

The increase in fleet size will reduce travel distances, aligning with sustainability goals, said Mobilelinks By Matthew Corrigan Published: May 8, 2024 T...

08/05/2024

Arqiva adds Caroline Cardozo and James Lelyveld to technology team

The company said the new arrivals would drive collaboration and technological transformation across key business units By Matthew Corrigan Published: May 8, ...

08/05/2024

Actus Digital Set to Shine at CABSAT and Broadcast Asia

Following a successful NAB Las Vegas 2024 and winning a Best of Show Award, Actus Digital, a leading provider of Intelligent Monitoring Platforms, will bring it...

08/05/2024

TAG Revolutionizes Closed Captions and Subtitles Quality...

TAG Video Systems, a leading force in video monitoring solutions, has developed a new Language Detection feature set to transform how operators ensure quality a...

08/05/2024

Pliant Technologies Unveils New SmartBoom LITE Headset at...

Pliant Technologies, a leading provider of professional wireless intercom solutions,presents its new SmartBoom LITE Headset at InfoComm 2024 (Booth C5116). The ...

08/05/2024

Prism Sound Showcases High-Quality Audio Conversion at MP...

At the Media Production & Technology Show 2024, Prism Sound will showcase high-quality audio conversion products designed to suit the demands of professional us...

08/05/2024

Julian Day Joins FooEngine

Soho stalwart Julian Day has joined FooEngine as Business Development Director after 13 years at ZOO Digital. Julian has been at the heart of the London post pr...

08/05/2024

Cleanfeed Cinema Redefines Audio Post Production Workflow...

Following its successful launch at NAB 2024, Cleanfeed Cinema - the latest remote recording innovation from Emmy Award-winning Cleanfeed - is already helping au...

08/05/2024

DHD to Showcase New Product Line-up at MPTS 2024

DHD's range of digital broadcast equipment, systems and related software will be promoted on stand D22 at the Media Production & Technology Show by UK-regio...

08/05/2024

MSP CloudReso selects Cubbit hyper-resilient DS3 distribu...

Cubbit, the innovator behind Europe's first distributed cloud storage enabler, today announced that CloudReso, a France-based distributor of MSP security so...

08/05/2024

MwareTV boosts Americas presence with Daniel Conde Coto

MwareTV, a prominent cloud-based multi-tenant platform provider, has attracted Daniel Conde Coto to join the company as director, sales operations. This is a si...

08/05/2024

LiveU Demonstrates its Efficient IP-Video Workflows for L...

In a year set to see record-breaking IP-video adoption, with over 70 elections and global sports events, LiveU heads to Broadcast Asia 2024 with a focus on its ...

08/05/2024

nxtedition Showcases a Fully Automated AR Studio Gallery...

Pioneers in microservices-based production environments, nxtedition, will demonstrate the latest advances in storytelling technology at the Media Production & T...

08/05/2024

MPTS 2024 - Leader and PHABRIX to showcase multiple new T...

Test & measurement innovator, Leader Electronics of Europe, has announced that it will again exhibit at The Media Production & Technology Show (MPTS), which tak...

08/05/2024

Livepeer Studio cuts the cost of live streaming and trans...

Livepeer Studio unveils a revolutionary video streaming platform offering an unprecedented combination of quality and cost-efficiency to content creators, media...

08/05/2024

Ikegami to Demonstrate Complete Broadcast Media Productio...

Ikegami Electronics (Europe) will demonstrate a complete broadcast media production system on stand S1-A15 at CABSAT 2024 in Dubai, Tuesday May 21 through Thurs...

08/05/2024

Experience Commerce Bags Digital Agency Mandate for SAVSO...

Experience Commerce (EC), a leading digital agency within the Cheil Network, is pleased to announce that it has won the digital mandate for SAVSOL, the flagship...

08/05/2024

Global Telecom & Pay TV Services Market to Slowdown in 2024

NEEDHAM, Mass. The International Data Corporation is predicting that worldwide spending on telecom services and pay TV services will increase by 1.4% in 2024 to...

08/05/2024

Apple Unveils New iPad Live Multicam Production Studio

CUPERTINO, Calif. In a notable development for news and live video production, Apple has unveiled a number of significant upgrades to its Final Cut Pro software...

08/05/2024

Pliant Technologies to Showcase New Smartboom Lite Headset at InfoComm 2024

Pliant Technologies has announced that it will be presenting its new SmartBoom LITE Headset at InfoComm 2024 (Booth C5116). The latest updates include enhanceme...

08/05/2024

Tubi Launches 'Stubios' to Encourage Aspiring Filmmakers

SAN FRANCISCO Fox's ad-supported streaming service Tubi has launched Stubios, a fan-fueled studio for aspiring filmmakers and their fans that the company sa...

08/05/2024

Five More Stations Launch NextGen TV In San Antonio

SAN ANTONIO, Texas Five more stations have launched NextGen TV service, bringing to nine the number of local TV services on-air with ATSC 3.0....

08/05/2024

Tegna Names Greg Retsinas President & GM of KGW in Portland

TYSONS, Va. Tegna Inc. has announced that Greg Retsinas has been named president and general manager at KGW, Tegna's NBC affiliate serving the Portland area...

08/05/2024

Olivia Colman and John Lithgow Star in Sophie Hyde's New Project JIMPA

08 05 2024 - Media release Olivia Colman and John Lithgow Star in Sophie Hyde's New Project JIMPA Olivia Coleman and John Lithgow in JIMPA. Photo credit: ...

08/05/2024

OpenDrives Joins AWS Partner Network

OpenDrives Joins AWS Partner Network Brie Clayton May 7, 2024 0 Comments Atlas software-defined platform now available via AWS trusted partner network...

08/05/2024

Animate AI Matte Paintings in After Effects

Animate AI Matte Paintings in After Effects Graham Quince May 7, 2024 0 Comments Arguably the best video use for AI generated images is Matte Painting...

08/05/2024

Throw Expression in Adobe After Effects

Throw Expression in Adobe After Effects Andy Ford May 7, 2024 0 Comments The Throw expression is a time-saver in After Effects. It allows you to mov...

08/05/2024

Join FilmLight in Toronto: Baselight 6.1, Nara and more

Join FilmLight in Toronto: Baselight 6.1, Nara and more Brie Clayton May 7, 2024 0 Comments Thursday 30th May, from 2:30PM, Hosted by Alter Ego 488 ...

08/05/2024

Gray TV Reports $75 Million in Net Income for Second Quarter

Gray Television reported a first-quarter profit as advertising revenues rebounded to above pre-COVID-19 levels....

08/05/2024

Meghan Trainor, Bleachers, Chance the Rapper Lined Up for Today' Summer Concert Series

The Today show shared the performers on its summer concert series, which feature...