
The University of Florida's academic health center, UF Health, has teamed up with NVIDIA to develop a neural network that generates synthetic clinical data - a powerful resource that researchers can use to train other AI models in healthcare.
Trained on a decade of data representing more than 2 million patients, SynGatorTron is a language model that can create synthetic patient profiles that mimic the health records it's learned from. The 5 billion-parameter model is the largest language generator in healthcare.
Synthetic data isn't actually linked to a real human being, but it has similar characteristics to real patients, said Dr. Duane Mitchell, an assistant vice president for research and director of the UF Clinical and Translational Science Institute. SynGatorTron can, for example, create health records of digital diabetes patients that have features just like a real population.
Using this synthetic data, researchers can create tools, models and tasks without risks or privacy concerns. These can then be used on real data to ask clinical questions, look for associations and even explore patient outcomes.
Working with synthetic data also makes it easier for different research institutions to collaborate and share models. And since the amount of data that can be synthesized is virtually limitless, researchers can use SynGatorTron-generated data to augment small datasets of rare disease patients or minority populations to reduce model bias.
SynGatorTron was developed using the open-source NVIDIA Megatron-LM and NeMo frameworks. It's based on UF Health's GatorTron model, announced last year at NVIDIA GTC. The models were trained on HiPerGator-AI, the university's in-house NVIDIA DGX SuperPOD system, which ranks among the world's top 30 supercomputers.
GatorTron-S, a BERT-style transformer model trained on synthetic data generated by SynGatorTron, will be available for developers next month on the NGC software hub.
SynGatorTron Opens Gate to Robust Training Data To a doctor, an AI-generated doctor's note can appear impractical at first glance - it doesn't represent a real patient and won't read as logical to an expert eye. So a clinician can't make a direct analysis or diagnosis from it. But to an untrained AI, real and synthetic clinical data are both highly valuable.
SynGatorTron's generative capability is a great enabler of natural language processing for medicine, said Dr. Mona Flores, global head of medical AI at NVIDIA. Synthesizing different types of clinical records will democratize the ability to create all sorts of applications dependent on such data by addressing data sparsity and privacy.
Once it's available, research institutions outside UF Health could fine-tune the pretrained SynGatorTron model with their own localized data and apply it to their AI projects. For example, if a given condition or a patient population is underrepresented in a health system's clinical data, SynGatorTron can be prompted to generate additional data with characteristics of that disease or population.
These AI-generated records could then be used to supplement and balance out real healthcare datasets used to train other neural networks, so that they better represent the population.
Since synthetic training datasets mimic real medical notes without being associated with specific patients, they can also be more readily shared across research institutions without raising privacy concerns.
When you have the ability to mimic population characteristics without being tethered to real patients, it opens the imagination to see if we can generate realistic datasets that allow us to answer questions we couldn't otherwise, due to constraints on access to data or limited information on patients of interest, Mitchell said.
One potential application is in clinical trials, which often divide patients into treatment and control groups to measure the effectiveness of a new medication. An application derived from SynGatorTron-generated data could parse through real records and create a digital twin of patient records. These records could then be used as the control group in a clinical trial, instead of having a control group derived by giving real patients a placebo treatment.
Researchers developing a deep learning model to study a rare disease, or the effects of a treatment on a specific population, could also use SynGatorTron for data augmentation, generating more training data to supplement the limited amount of real medical records available.
Healthcare at GTC Register free for GTC, running online March 21-24, to discover the latest in AI and healthcare. Hear from SynGatorTron collaborators in the session A Next-Generation Clinical Language Model, taking place March 23 at 7 a.m. Pacific.
Watch the replay of NVIDIA founder and CEO Jensen Huang's keynote address below:
Most recent headlines
05/01/2027
Worlds first 802.15.4ab-UWB chip verified by Calterah and Rohde & Schwarz to be ...
01/06/2026
January 6 2026, 05:30 (PST) Dolby Sets the New Standard for Premium Entertainment at CES 2026
Throughout the week, Dolby brings to life the latest innovatio...
02/05/2026
Dalet, a leading technology and service provider for media-rich organizations, t...
01/05/2026
January 5 2026, 18:30 (PST) NBCUniversal's Peacock to Be First Streamer to ...
01/04/2026
January 4 2026, 18:00 (PST) DOLBY AND DOUYIN EMPOWER THE NEXT GENERATON OF CREATORS WITH DOLBY VISION
Douyin Users Can Now Create And Share Videos With Stun...
19/03/2026
Live sports production increases complexity, with dynamic audio levels and an overall philosophy that encourages transient volume spikes
Fourteen years ago, Am...
19/03/2026
Advanced Systems Group, a technology and services provider for media creatives and content owners, announced the appointment of Peter Thordarson to the newly cr...
19/03/2026
For this senior from the Bay Area, the speed and pressure of live sports production play right into her strengths
In the live-sports-video industry, the future...
19/03/2026
Grass Valley has expanded its long-term partnership with University of Pittsburg...
19/03/2026
Audio-Technica has released the ATV-SG1 and ATV-SG1LE On-Camera Shotgun Microphones, designed for use with DSLR, mirrorless SLR, and other cameras.
The ATV-SG1...
19/03/2026
Harmonic (booth W2831) announces updates to its XOS Advanced Media Processor aim...
19/03/2026
DAZN and Top Rank have announced a multi-year partnership that will bring Top Ra...
19/03/2026
IHSE, a provider of KVM systems, has announced a partnership with Cyviz AS, a provider of technology solutions for collaboration and mission-critical operations...
19/03/2026
Net Insight has appointed Larissa G rner-Meeus as Chief Product Officer. She joins the company's executive management team.
G rner-Meeus holds a Dipl-Ing. ...
19/03/2026
Leader Electronics of Europe has appointed Rob Stanley as Regional Sales Manager for the UK and Northern Europe. In the role, he will manage key accounts and ha...
19/03/2026
FIFA has announced that YouTube will be a Preferred Platform for the FIFA World Cup 2026.
Under the agreement, FIFA's Media Partners will be able to publis...
19/03/2026
New features across mobile, connected devices, and automotive platforms undersco...
19/03/2026
PSSI Global Services has appointed Ben Bradshaw as Director of Product and Netwo...
19/03/2026
Cobalt Digital has announced its NAB 2026 product lineup, which includes additio...
19/03/2026
Sportradar has released a new report, Innovation in Sports Media: The Next Era of Sports Viewing, examining how the sports viewing experience in the U.S. is evo...
19/03/2026
Matrox Video has been awarded a three-year framework agreement to supply its Con...
19/03/2026
CBS Sports' Jason Cohen and TNT Sports' Chris Brown lead the charge on n...
19/03/2026
A1 Dave Grundtvig and his team deploy plenty of mics to capture the sounds and energy from the stands as well the court
March Madness is a tournament in which ...
19/03/2026
In 2021, we launched EQUAL, a program designed to address an industry reality that persists: Women artists, songwriters, and producers too often face fewer oppo...
19/03/2026
Latest EZKeys 2 expansion arrives
Toontrack's staggering collection of EZKeys 2 expansions has grown once again, and the latest instalment delivers a on...
19/03/2026
New generative AI plug-in due in May 2026
Roland have announced the upcoming launch of a new generative AI tool created in collaboration with Sony Computer ...
19/03/2026
Nick Williams updates users on insolvency process
Nick Williams, the CEO of Native Instruments, has released the following official statement regarding thei...
19/03/2026
Iconic Swedish mic manufacturer back in action
Legendary Swedish microphone manufacturer Milab have announced that production is now fully underway, and mic...
19/03/2026
Acclaimed saturation unit goes virtual
Freqport's Freqtube FT1 (reviewed here in SOS February 2023) offers a convenient way to integrate real valve-base...
19/03/2026
The discontinuation of loss-making business activities as part of the restructur...
19/03/2026
Silicon Valley satire The Audacity premieres 15 April on SBS and SBS On Demand
19 March, 2026
Media releases
From one of the writer/producers of Succession...
19/03/2026
SBS brings communities together at Bondi Pavilion for Harmony Week multilingual ...
19/03/2026
Clarification from SBS regarding Western Sydney expansion
19 March, 2026
Media releases
From an SBS spokesperson:
SBS wishes to clarify some media coverag...
19/03/2026
Test & measurement innovator, Leader Electronics of Europe, is pleased to announce the appointment of Rob Stanley as Regional Sales Manager - UK & Northern Euro...
19/03/2026
The recently announced joint venture between Accedo One and Magine Pro has been officially launched as Leyra. The new company will combine the two complementary...
19/03/2026
Budapest, Hungary, March 2026 - Demand for traditional matrix switching remains strong across live events, rental and staging markets. With a reputation for rel...
19/03/2026
DPA Microphones adds to its CORE microphone selection with the 4097 CORE Micro Shotgun, which delivers a new level of clarity, headroom and sonic transparency...
19/03/2026
Starfish Technologies will present the latest releases of its TS Splicer (Win) and TS Splicer (K8) at NAB Show 2026, together with a new Monitoring Dashboard de...
19/03/2026
Bitmovin, a leading provider of video streaming solutions, has announced that TrueVisions NOW, a leading streaming platform in Thailand, and part of the TrueVis...
19/03/2026
Harmonic (NASDAQ: HLIT) today announced significant enhancements to its XOS Advanced Media Processor that lower the cost of broadcast distribution while enablin...
19/03/2026
Cobalt Digital, the leading designer and manufacturer of award-winning signal processing products, and a founding partner in the openGear initiative has announ...
19/03/2026
Magewell a developer of innovative, high-performance video I/O and IP workflow solutions will be at the 2026 NAB Show on booth C6113. In addition to several...
19/03/2026
Triveni Digital, a trusted leader in ATSC 1.0 and ATSC 3.0 service delivery, data broadcasting and quality assurance solutions, today announced it will showcase...
19/03/2026
2026 NAB Show Exhibitor Preview
April 19 22
Las Vegas
Booth N.1328
Summary:
At the 2026 NAB Show in Las Vegas, Imagine Communications will showcase the lat...
19/03/2026
Broadcast playout leader highlights innovation and industry collaboration
Pebble, the leading automation, content management and integrated channel specialist...
19/03/2026
Tuxera, a leading provider of quality-assured file systems and networking technologies, is highlighting remarkable advances in performance at NAB Show (booth N1...
19/03/2026
Intinor will showcase several new developments to the Direkt platform at NAB 2026, including enhanced SRT monitoring, expanded HDR transport capabilities and su...
19/03/2026
Broadcasters are rebuilding live operations workflows around IP delivery as satellite and dedicated fibre distribution decline, according to new research from C...
19/03/2026
Berklee Popular Music Institute's 24th Showcase Channels '80s Prom Nosta...
19/03/2026
Share
Copy link
Facebook
X
Linkedin
Bluesky
Email...