
A widely acclaimed large language model for genomic data has demonstrated its ability to generate gene sequences that closely resemble real-world variants of SARS-CoV-2, the virus behind COVID-19.
Called GenSLMs, the model, which last year won the Gordon Bell special prize for high performance computing-based COVID-19 research, was trained on a dataset of nucleotide sequences - the building blocks of DNA and RNA. It was developed by researchers from Argonne National Laboratory, NVIDIA, the University of Chicago and a score of other academic and commercial collaborators.
When the researchers looked back at the nucleotide sequences generated by GenSLMs, they discovered that specific characteristics of the AI-generated sequences closely matched the real-world Eris and Pirola subvariants that have been prevalent this year - even though the AI was only trained on COVID-19 virus genomes from the first year of the pandemic.
Our model's generative process is extremely naive, lacking any specific information or constraints around what a new COVID variant should look like, said Arvind Ramanathan, lead researcher on the project and a computational biologist at Argonne. The AI's ability to predict the kinds of gene mutations present in recent COVID strains - despite having only seen the Alpha and Beta variants during training - is a strong validation of its capabilities.
In addition to generating its own sequences, GenSLMs can also classify and cluster different COVID genome sequences by distinguishing between variants. In a demo available on NGC, NVIDIA's hub for accelerated software, users can explore visualizations of GenSLMs' analysis of the evolutionary patterns of various proteins within the COVID viral genome.
https://blogs.nvidia.com/wp-content/uploads/2023/11/GenSLM.mp4
Reading Between the Lines, Uncovering Evolutionary Patterns A key feature of GenSLMs is its ability to interpret long strings of nucleotides - represented with sequences of the letters A, T, G and C in DNA, or A, U, G and C in RNA - in the same way an LLM trained on English text would interpret a sentence. This capability enables the model to understand the relationship between different areas of the genome, which in coronaviruses consists of around 30,000 nucleotides.
In the NGC demo, users can choose from among eight different COVID variants to understand how the AI model tracks mutations across various proteins of the viral genome. The visualization depicts evolutionary couplings across the viral proteins - highlighting which snippets of the genome are likely to be seen in a given variant.
Understanding how different parts of the genome are co-evolving gives us clues about how the virus may develop new vulnerabilities or new forms of resistance, Ramanathan said. Looking at the model's understanding of which mutations are particularly strong in a variant may help scientists with downstream tasks like determining how a specific strain can evade the human immune system.
https://blogs.nvidia.com/wp-content/uploads/2023/11/SLM.mp4
GenSLMs was trained on more than 110 million prokaryotic genome sequences and fine-tuned with a global dataset of around 1.5 million COVID viral sequences using open-source data from the Bacterial and Viral Bioinformatics Resource Center. In the future, the model could be fine-tuned on the genomes of other viruses or bacteria, enabling new research applications.
To train the model, the researchers used NVIDIA A100 Tensor Core GPU-powered supercomputers, including Argonne's Polaris system, the U.S. Department of Energy's Perlmutter and NVIDIA's Selene.
The GenSLMs research team's Gordon Bell special prize was awarded at last year's SC22 supercomputing conference. At this week's SC23, in Denver, NVIDIA is sharing a new range of groundbreaking work in the field of accelerated computing. View the full schedule and catch the replay of NVIDIA's special address below.
NVIDIA Research comprises hundreds of scientists and engineers worldwide, with teams focused on topics including AI, computer graphics, computer vision, self-driving cars and robotics. Learn more about NVIDIA Research and subscribe to NVIDIA healthcare news.
Main image courtesy of Argonne National Laboratory's Bharat Kale.
This research was supported by the Exascale Computing Project (17-SC-20-SC), a collaborative effort of the U.S. DOE Office of Science and the National Nuclear Security Administration. Research was supported by the DOE through the National Virtual Biotechnology Laboratory, a consortium of DOE national laboratories focused on response to COVID-19, with funding from the Coronavirus CARES Act.
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
01/06/2026
January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026
Throughout the week, Dolby brings to life the latest innovatio...
01/05/2026
January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...
01/04/2026
January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION
Douyin Users Can Now Create And Share Videos With Stun...
27/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
27/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
27/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
27/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
27/01/2026
New agreement for uninterrupted UHF connectivity for Australian Defence Force through 2033, With Options Extending to 2041
Luxembourg, January 13, 2026 - Satel...
26/01/2026
Music videos play a huge part in how fans connect with their favorite songs, but...
26/01/2026
NAIDOC launches 2026 theme 50 Years of Deadly , marks major milestone
25 January, 2026
Media releases
The National NAIDOC Committee has today unveiled its...
26/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
26/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
26/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
26/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
26/01/2026
Advanced Systems Group (ASG) today announced an exclusive partnership with Nu Studio, the company behind the first modular, portable studio designed for immersi...
26/01/2026
OpenDrives, the leader in high-end video data management and workflow solutions, today announced an add-on investment to its previous funding rounds, led by IAG...
26/01/2026
SmallHD today announced a major update to its award-winning PageOS software, introducing Fleet Control, Portrait Mode, expanded Camera Control features, and Dow...
26/01/2026
Whether rigging a PTZ on a truss, sliding a Blackmagic camera across a mezzanine railing, or capturing the mayhem from inside a drum, Filipic delivers visuals t...
26/01/2026
26 Jan 2026
VEON Unveils the New Beeline Uzbekistan Network Operations Center, ...
26/01/2026
Bookish is created by and stars Mark Gatiss
Images available HERE
Following th...
26/01/2026
Monday 26 January 2026
New Sky Original Series explores Entertainment Juggernaut, The X Factor
Sky today confirmed it has greenlit a premium, definitive docum...
26/01/2026
Monday 26 January 2026
Sky confirms major investment to transform Livingston ca...
26/01/2026
Back to All News
Netflix Unveils Official Trailer for Salvador
Entertainment
26 January 2026
GlobalSpain
Link copied to clipboard
DISCOVER THE TRAILER
DO...
26/01/2026
Back to All News
Netflix Presents Our 2026 Series and Films and Announces New Projects
Entertainment
26 January 2026
GlobalSpain
Link copied to clipboard
...
26/01/2026
Back to All News
Made in Louisiana: People We Meet on Vacation' Lights Up New Orleans
Emily Bader and Tom Blyth film on Royal Street in New Orleans' ...
26/01/2026
LinkedIn Gives Professionals the Edge with Verified Skills and Tools to Navigate the Job Search Show Proficiency of AI Tools such as Descript, Lovable, Relay.ap...
26/01/2026
Alfalite, Brainstorm, Dejero, Domo Broadcast Systems, FOR-A, KitPlus, Ontario So...
26/01/2026
Tyngsboro, Mass. - January 26, 2026 - Broadcast Pix is excited to exhibit at the Alliance for Community Media West Region Conference and Trade Show, taking plac...
26/01/2026
Software for Stable Data Management and Lasting Business Success in Uncertain Ti...
26/01/2026
RT is today publishing a statistical summary of the Register of External Activities for the third quarter of 2025.
The RT Register of External Activities com...
25/01/2026
Back to All News
Sins of Kujo' Premieres April 2: Teaser Trailer, Art and ...
24/01/2026
Masami Kawai Selected as the 2026 Merata Mita Fellow; Isabella Madrigal and Tsanavi Spoonhunter Named 2026 Graton Fellows During Native Forum Celebration in Par...
24/01/2026
The MC-55A Peregrine aircraft will give the Royal Australian Air Force information superiority and serve as strategic assets for future Australian Defence Force...
24/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
24/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
24/01/2026
Share Share by:
Copy link
Facebook
X
Linkedin
Pinterest
Bluesky
Email...
24/01/2026
RT and Virgin Media Television kick off comprehensive free-to-air coverage of t...
23/01/2026
Staines-upon-Thames, UK, 29 July, 2025 - Yospace, the global leader in Dynamic A...
23/01/2026
WWE's Virtual Production Playbook: How the Professional Wrestling Super Powe...
23/01/2026
Tight set up: Squeezing the PSA's Tournament of Champions into Grand Central...
23/01/2026
Evolving production: The PSA on bringing squash to more viewers at the Tournamen...
23/01/2026
AFC Championship Preview: Behind the Scenes With NFL on CBS' Producer Jim R...
23/01/2026
NFC Championship Preview: FOX Sports Director Rich Russo Talks Technology, Story...
23/01/2026
Spotify's annual Best New Artist celebration honors the rising stars whose talent, creativity, and dedication have propelled them to the music industry'...
23/01/2026
Coalition military forces operating across the vast geography of the Indo-Pacific rely on interoperable, secure data links to share intelligence, surveillance a...
23/01/2026
Artist rendering of L3Harris Technologies' AERIS next generation airborne early warning and control solution....
23/01/2026
The U.S. Air Force AMP Increment II aircraft at L3Harris' facility in Waco, Texas. L3Harris has modernized C-130 avionics since 1985, delivering digital coc...
23/01/2026
Paramount is transforming its operations by unifying the media supply chains of their top brands into a scalable global pipeline.
This transformation enhances ...
23/01/2026
Every delay costs. When a subtitle fails QC, even the smallest issue can mean missed deadlines, extra vendor costs, or frustrated teams. The new Accurate.Video ...