
Back in 2018, BERT got people talking about how machine learning models were learning to read and speak. Today, large language models, or LLMs, are growing up fast, showing dexterity in all sorts of applications.
They're, for one, speeding drug discovery, thanks to research from the Rostlab at Technical University of Munich, as well as work by a team from Harvard, Yale and New York University and others. In separate efforts, they applied LLMs to interpret the strings of amino acids that make up proteins, advancing our understanding of these building blocks of biology.
It's one of many inroads LLMs are making in healthcare, robotics and other fields.
A Brief History of LLMs Transformer models - neural networks, defined in 2017, that can learn context in sequential data - got LLMs started.
Researchers behind BERT and other transformer models made 2018 a watershed moment for natural language processing, a report on AI said at the end of that year. Quite a few experts have claimed that the release of BERT marks a new era in NLP, it added.
Developed by Google, BERT (aka Bidirectional Encoder Representations from Transformers) delivered state-of-the-art scores on benchmarks for NLP. In 2019, it announced BERT powers the company's search engine.
Google released BERT as open-source software, spawning a family of follow-ons and setting off a race to build ever larger, more powerful LLMs.
For instance, Meta created an enhanced version called RoBERTa, released as open-source code in July 2017. For training, it used an order of magnitude more data than BERT, the paper said, and leapt ahead on NLP leaderboards. A scrum followed.
Scaling Parameters and Markets For convenience, score is often kept by the number of an LLM's parameters or weights, measures of the strength of a connection between two nodes in a neural network. BERT had 110 million, RoBERTa had 123 million, then BERT-Large weighed in at 354 million, setting a new record, but not for long.
As LLMs expanded into new applications, their size and computing requirements grew. In 2020, researchers at OpenAI and Johns Hopkins University announced GPT-3, with a whopping 175 billion parameters, trained on a dataset with nearly a trillion words. It scored well on a slew of language tasks and even ciphered three-digit arithmetic.
Language models have a wide range of beneficial applications for society, the researchers wrote.
Experts Feel Blown Away' Within weeks, people were using GPT-3 to create poems, programs, songs, websites and more. Recently, GPT-3 even wrote an academic paper about itself.
I just remember being kind of blown away by the things that it could do, for being just a language model, said Percy Liang, a Stanford associate professor of computer science, speaking in a podcast.
GPT-3 helped motivate Stanford to create a center Liang now leads, exploring the implications of what it calls foundational models that can handle a wide variety of tasks well.
Toward Trillions of Parameters Last year, NVIDIA announced the Megatron 530B LLM that can be trained for new domains and languages. It debuted with tools and services for training language models with trillions of parameters.
Large language models have proven to be flexible and capable able to answer deep domain questions without specialized training or supervision, Bryan Catanzaro, vice president of applied deep learning research at NVIDIA, said at that time.
Making it even easier for users to adopt the powerful models, the NVIDIA Nemo LLM service debuted in September at GTC. It's an NVIDIA-managed cloud service to adapt pretrained LLMs to perform specific tasks.
Transformers Transform Drug Discovery The advances LLMs are making with proteins and chemical structures are also being applied to DNA.
Researchers aim to scale their work with NVIDIA BioNeMo, a software framework and cloud service to generate, predict and understand biomolecular data. Part of the NVIDIA Clara Discovery collection of frameworks, applications and AI models for drug discovery, it supports work in widely used protein, DNA and chemistry data formats.
NVIDIA BioNeMo features multiple pretrained AI models, including the MegaMolBART model, developed by NVIDIA and AstraZeneca.
In their paper on foundational models, Stanford researchers projected many uses for LLMs in healthcare. LLMs Enhance Computer Vision Transformers are also reshaping computer vision as powerful LLMs replace traditional convolutional AI models. For example, researchers at Meta AI and Dartmouth designed TimeSformer, an AI model that uses transformers to analyze video with state-of-the-art results.
Experts predict such models could spawn all sorts of new applications in computational photography, education and interactive experiences for mobile users.
In related work earlier this year, two companies released powerful AI models to generate images from text.
OpenAI announced DALL-E 2, a transformer model with 3.5 billion parameters designed to create realistic images from text descriptions. And recently, Stability AI, based in London, launched Stability Diffusion,
Writing Code, Controlling Robots LLMs also help developers write software. Tabnine - a member of NVIDIA Inception, a program that nurtures cutting-edge startups - claims it's automating up to 30% of the code generated by a million developers.
Taking the next step, researchers are using transformer-based models to teach robots used in manufacturing, construction, autonomous driving and personal assistants.
For example, DeepMind developed Gato, an LLM that taught a robotic arm how to stack blocks. The 1.2-billion parameter model was trained on more than 600 distinct tasks so it could be useful in a variety of modes and environments, whether playing games or animating chatbots.
The Gato LLM can analyze robot actions and images as
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
06/09/2026
June 9 2026, 23:00 (PDT) Dolby and MagentaTV Bring Fans Closer to the FIFA Worl...
04/08/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
04/07/2026
April 7 2026, 19:00 (PDT) Detective Conan: Fallen Angel of the Highway Opens in...
27/06/2026
There's no doubt that you've seen the world through Amy Vincent's ey...
27/06/2026
Brings together saturation & lo-fi effects
Following on from the release of their Voxcraft vocal-processing plug-in, UJAM have announced the launch of Retro...
27/06/2026
A record 4.84 million Australians choose SBS as the Socceroos advance at FIFA Wo...
27/06/2026
Why CRAS Upgraded to Symphony I/O MK II When an audio school runs studios all day, every day, gear doesn't just need to sound good , it needs to survive rea...
27/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
27/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
27/06/2026
Krotos Video to Sound Plugin Now Available for Adobe Premiere Pro
Brie Clayton June 26, 2026
0 Comments
Editors can analyze footage, generate synchron...
27/06/2026
Mirai Media Elevates Digital and Broadcast Productions with Blackmagic Design
Brie Clayton June 26, 2026
0 Comments
Studio uses Ultimatte 12 HD and Po...
27/06/2026
DURHAM, N.C. - JUNE 26, 2026 - Lutra Cafe & Bakery has opened its first brick-and-mortar location at American Tobacco Campus after owner Chris McLaurin operated...
26/06/2026
In-venue and creative video staffers at the professional and collegiate level ha...
26/06/2026
Strike Fighter League (SFL), a professional air combat digital sport combining f...
26/06/2026
Wisycom has announced three new additions to its professional wireless ecosystem...
26/06/2026
Eurovision Services inaugurated an expanded Master Control Room (MCR) in Madrid on June 1, 2026, building on a broadcast hub the company has operated in the cit...
26/06/2026
Midco Sports and the University of North Dakota (UND) have announced a two-year ...
26/06/2026
Guntermann and Drunck (G&D) and VuWall, both part of the Panoptec Technologies Group, have appointed Vutec (Pty) Ltd as exclusive distributor for their KVM and ...
26/06/2026
Visit Seattle, the official destination marketing organization for Seattle and King County, has launched what it describes as the world's first drone scoreb...
26/06/2026
CP Communications provided RF video, audio, and crew communications support for ...
26/06/2026
Produced by longtime partner Echo Entertainment, the action-sports property is now a team-based year-round league
The inaugural season of the MoonPay X Games L...
26/06/2026
The deal establishes MultiDyne Robotics and Motion Control, maintaining the well-known MRMC brand.MultiDyne Video & Fiber Optic Systems has acquired the assets ...
26/06/2026
PX1 will debut at Sonoma as TNT leans into super-slo-mo, drones, SMT data integr...
26/06/2026
Ratings Roundup is a rundown of recent rating news and is derived from press rel...
26/06/2026
Virtual session musician plug-in gains new percussion options
Celemony's latest update for their virtual session musician platform complements the exist...
26/06/2026
Half-size model joins Console 1 line-up
Shortly after the release of their new Flow Studio controller, Softube have announced the launch of another new surf...
26/06/2026
ELT Group and Rohde & Schwarz sign a cooperation agreement to explore commercial...
26/06/2026
For Teddy Swims sold-out I've Tried Everything But Therapy tour, event technology specialists, PRG, provided video, automation and lighting across 19 date...
26/06/2026
Modern exhibition and event venues face the challenge of seamlessly integrating traditional conference technology, professional broadcast workflows and IP-based...
26/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/06/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...
26/06/2026
Neko Oji: The Guy That Got Reincarnated as a Cat Edited with DaVinci Resolve Stu...
26/06/2026
Adobe to Acquire Topaz Labs
Brie Clayton June 25, 2026
0 Comments
Adobe has seen strong demand for its AI products for creatives, including Adobe Fire...
26/06/2026
Berklee Students Earn Dedicated Section at Raindance Film Festival in London Five documentary short films produced in the Africana Studies Department screen a...
26/06/2026
How IMS Productions and FOX Sports scaled coverage of the 109th Indianapolis 500.
The last lap of this year's Indianapolis 500 delivered the kind of ending...
26/06/2026
Flicker Productions to produce five-part docu-reality series following women who have fallen for men in prison and have become TikTok sensations, with brands an...
26/06/2026
Catch up on the latest developments across Baselight and Daylight v7, Nara and F...
26/06/2026
26. June 2026 News
DFT is pleased to announce that a second Polar HQ film s...
26/06/2026
New documentary Freedom Founder: Thomas McKean and the American Revolution airs ...
25/06/2026
Launching a Career in Broadcast Engineering: Academic Paths and Essential Certif...
25/06/2026
This superstar shooter/storyteller from Central Indiana hopes to make his mark in the blossoming sports-documentary and -features space
In the live-sports-vid...
25/06/2026
Presidio and the National Hockey League have announced a multiyear renewal of their North American partnership. Presidio will remain an Official Technology Inno...
25/06/2026
Strike Fighter League (SFL) is the world's first professional air combat digital sport that combines elite human performance and physical immersion with cut...
25/06/2026
Rise, the award-winning advocacy group for gender diversity in the broadcast and media technology sector, is pleased to announce the global mentoring cohort for...
25/06/2026
The 2026 American Association of Professional Baseball (AAPB) All-Star Game will...
25/06/2026
Mediaproxy has named Heartland Video Systems (HVS) as its exclusive partner for US television broadcasting. The Wisconsin-based systems integrator will represen...
25/06/2026
Backblaze has formed an agreement with CoreWeave to create The Essential Cloud for AI.
Under the multi-exabyte, $335 million agreement, Backblaze will provide...