Sony Pixel Power calrec Sony

What Is Synthetic Data?

08/06/2021

Data is the new oil in today's age of AI, but only a lucky few are sitting on a gusher. So, many are making their own fuel, one that's both inexpensive and effective. It's called synthetic data.

What Is Synthetic Data? Synthetic data is annotated information that computer simulations or algorithms generate as an alternative to real-world data.

Put another way, synthetic data is created in digital worlds rather than collected from or measured in the real world.

It may be artificial, but synthetic data reflects real-world data, mathematically or statistically. Research demonstrates it can be as good or even better for training an AI model than data based on actual objects, events or people.

Users can generate synthetic data for autonomous vehicles using Python inside NVIDIA Omniverse. That's why developers of deep neural networks increasingly use synthetic data to train their models. Indeed, a 2019 survey of the field calls use of synthetic data one of the most promising general techniques on the rise in modern deep learning, especially computer vision that relies on unstructured data like images and video.

The 156-page report by Sergey I. Nikolenko of the Steklov Institute of Mathematics in St. Petersburg, Russia, cites 719 papers on synthetic data. Nikolenko concludes synthetic data is essential for further development of deep learning [and] many more potential use cases still remain to be discovered.

The rise of synthetic data comes as AI pioneer Andrew Ng is calling for a broad shift to a more data-centric approach to machine learning. He's rallying support for a benchmark or competition on data quality which many claim represents 80 percent of the work in AI.

Most benchmarks provide a fixed set of data and invite researchers to iterate on the code perhaps it's time to hold the code fixed and invite researchers to improve the data, he wrote in his newsletter, The Batch.

Augmented and Anonymized Versus Synthetic Data Most developers are already familiar with data augmentation, a technique that involves adding new data to an existing real-world dataset. For example, they might rotate or brighten an existing image to create a new one.

Given concerns and government policies about privacy, removing personal information from a dataset is an increasingly common practice. This is called data anonymization, and it's especially popular for text, a kind of structured data used in industries like finance and healthcare.

Augmented and anonymized data are not typically considered synthetic data. However, it's possible to create synthetic data using these techniques. For example, developers could blend two images of real-world cars to create a new synthetic image with two cars.

Why Is Synthetic Data So Important? Developers need large, carefully labeled datasets to train neural networks. More diverse training data generally makes for more accurate AI models.

The problem is gathering and labeling datasets that may contain a few thousand to tens of millions of elements is time consuming and often prohibitively expensive.

Enter synthetic data. A single image that could cost $6 from a labeling service can be artificially generated for six cents, estimates Paul Walborsky, who co-founded one of the first dedicated synthetic data services, AI.Reverie.

Cost savings are just the start. Synthetic data is key in dealing with privacy issues and reducing bias by ensuring you have the data diversity to represent the real world, Walborsky added.

Because synthetic datasets are automatically labeled and can deliberately include rare but crucial corner cases, it's sometimes better than real-world data.

What's the History of Synthetic Data? Synthetic data has been around in one form or another for decades. It's in computer games like flight simulators and scientific simulations of everything from atoms to galaxies.

Donald B. Rubin, a Harvard statistics professor, was helping branches of the U.S. government sort out issues such as an undercount especially of poor people in a census when he hit upon an idea. He described it in a 1993 paper often cited as the birth of synthetic data.

I used the term synthetic data in that paper referring to multiple simulated datasets, Rubin explained.

Each one looks like it could have been created by the same process that created the actual dataset, but none of the datasets reveal any real data - this has a tremendous advantage when studying personal, confidential datasets, he added.

In the wake of the Big Bang of AI, the ImageNet competition of 2012 when a neural network recognized objects faster than a human could, researchers started hunting in earnest for synthetic data.

Within a couple years, researchers were using rendered images in experiments, and it was paying off well enough that people started investing in products and tools to generate data with their 3D engines and content pipelines, said Gavriel State, a senior director of simulation technology and AI at NVIDIA.

Ford, BMW Generate Synthetic Data Banks, car makers, drones, factories, hospitals, retailers, robots and scientists use synthetic data today.

In a recent podcast, researchers from Ford described how they combine gaming engines and generative adversarial networks (GANs) to create synthetic data for AI training.

To optimize the process of how it makes cars, BMW created a virtual factory using NVIDIA Omniverse, a simulation platform that lets companies collaborate using multiple tools. The data BMW generates helps fine tune how assembly workers and robots work together to build cars efficiently.

Synthetic Data at the Hospital, Bank and Store Healthcare providers in fields such as medical imaging use synthetic data to train AI models while protecting patient privacy. For example, startup Curai trained a diagnostic model on 400,000 simul
LINK: https://blogs.nvidia.com/blog/2021/06/08/what-is-synthetic-data/...
See more stories from nvidia

Most recent headlines

04/08/2024

Dalet Appoints Santiago Solanas as CEO to Lead Next Era of Growth and Innovation

Dalet, a leading technology and service provider for media-rich organizations, is excited to announce Santiago Solanas as its new Chief Executive Officer (CEO)....

03/06/2024

Dalet and Veritone Reach Agreement to Distribute, Transact and Monetize Media Archives

Dalet, a leading technology and service provider for media-rich organizations, a...

30/05/2024

Screen Australia, Australians in Film and VicScreen announce FUTURE VISION

30 05 2024 - Media release Screen Australia, Australians in Film and VicScreen announce FUTURE VISION Lee Sung Jin and Joanna Calo Australians in Film, Scree...

30/05/2024

AI Benefits Come at a Cost

AI Benefits Come at a Cost Brie Clayton May 29, 2024 0 Comments After this, there is no turning back. You take the blue pill the story ends, you wa...

30/05/2024

Morgan Spurlock, Super Size Me' Director, Has Died

Morgan Spurlock, director of Super Size Me, died May 24 in New York. He was 53 and had been battling cancer....

30/05/2024

Departing Longtime Local Anchors Share Their Lessons Learned

A pair of beloved local anchors with extraordinary runs at their stations are finally stepping down. Tom Wills signs off at WJXT Jacksonville May 31, 49 years a...

30/05/2024

Inscape Founder Zeev Neumeier Launches GraySwan To Optimize CTV Ads

Zeev Neumeier, who founded Inscape and was a pioneer in automatic content recognition, is beta-testing his new venture, GraySwan....

30/05/2024

Robert De Niro To Get Service to America Award From NABLF

Robert De Niro will get the 2024 Service to America Leadership Award from the National Association of Broadcasters Leadership Foundation. The award is presented...

30/05/2024

AGT' is Back, With More Golden Buzzers and Youngest Contestant Ever

Season 19 of America's Got Talent is on NBC starting May 28. The new season features more Golden Buzzers, which send a standout act directly to the live sho...

30/05/2024

The Quiz With Balls', Game Show That Pits Brains Against Boldness, Starts on Fox

Jay Pharoah hosts The Quiz with Balls, a game show that Fox says pits brains ag...

30/05/2024

Netflix's Bridgerton' Keeps Top Spot in TVision Power Score Rankings

Netflix's Bridgerton (season three) repeated as the top show in TVision's Power Score ranking of programs on connected TV for the week of May 20....

30/05/2024

It's Not The Ads, It's How They're Delivered, New FreeWheel Study Finds

A new study from FreeWheel, Comcast's ad-tech unit, found that it's not ...

30/05/2024

First Blippi' FAST Channel Launches on Samsung TV Plus

Candle Media's Moonbug Entertainment unit said it launched its first free ad-supported streaming channel featuring its kid show Blippi on Samsung TV Plus....

30/05/2024

Kantar Names Nicole Gileadi Chief Strategy Officer and North America Managing Director

Kantar Media said it named Nicole Gileadi chief strategy officer and managing di...

29/05/2024

ST Engineering iDirect Next-Generation Hub Infrastructure Selected for Indonesia's First Multifunction Satellite

Highly scalable and flexible solution supporting Satria-1 satellite to facilitat...

29/05/2024

L3Harris Empowers Future Canadian Leaders Through the CILA Program

In 2018, Rich Foster, Vice President of L3Harris Canada, envisioned a transformative initiative to address the gender and diversity gap in science, technology, ...

29/05/2024

Charting the Future of the U.S. Navy

The christening of OUSV Vanguard, the U.S. Navys newest Unmanned Surface Vehicle, marks a pivotal moment in Naval technology. Developed through the joint Strate...

29/05/2024

EditShare Boosts Sales Direction With Alumnus Grant Carroll

EditShare Boosts Sales Direction With Alumnus Grant Carroll Long-term leader moves up to head sales in the Americas Boston, MA, May 29, 2024 - EditShare, the...

29/05/2024

Broadcasters Foundation of America Designates June 13 its Annual Giving Day

The Broadcasters Foundation of America has announced its annual Giving Day will take place Thursday, June 13. The campaign's purpose is to raise money to su...

29/05/2024

Grant Carroll Returns to EditShare as its New SVP-Americas

BOSTON EditShare has hired Grant Carroll as its new Senior Vice President for Sales for the Americas....

29/05/2024

Vizrt joins forces with Dalet to enhance newsroom operations

Vizrt joins forces with Dalet to enhance newsroom operations Brie Clayton May 29, 2024 0 Comments The integration between Dalet Galaxy five and Viz Pi...

29/05/2024

Tucson TV Stations Launch NextGen TV Services

TUSCON Six stations have launched NextGen TV, aka ATSC 3.0 broadcasts in the Tucson, Ariz., area....

29/05/2024

France Tlvisions Upgrades to Grass Valley Kaleido-IP Video Multiviewer

MONTREAL Grass Valley is reporting that French National Public TV Broadcaster France T l visions, rebranded as france tv, has selected its next-generation Kalei...

29/05/2024

John Abbot Joins Google Fiber as Its First CFO

MOUNTAIN VIEW, Calif. Google Fiber has announced that John Abbot has recently joined its team as the company's first chief financial officer (CFO)....

29/05/2024

Obsidian Lighting Control ONYX 4.10 Software Now Available

Obsidian Control Systems has introduced ONYX 4.10, the latest iteration of the popular lighting control software for NX consoles and PC systems....

29/05/2024

ZOO Establishes ZOO Italy, Launches Dubbing Studios in Milan

LONDON ZOO Digital, a global provider of localization and media services to the entertainment industry, has launched ZOO dubbing studios in Milan and establishe...

29/05/2024

Vizrt Integrates HTML Graphics System with Dalet News Production System

BERGEN, Norway Vizrt has integrated Viz Pilot Edge, the company's newsroom HTML-based templated graphics system, with the Dalet Galaxy five news production ...

29/05/2024

Elettroformati Audio Post House Installs PMC Monitors In...

Italian audio production company Elettroformati has chosen PMC monitors and an Avid management system for its new Dolby Atmos music mixing studio in Milan. Fo...

29/05/2024

Pixotope Enables Remote Intercontinental Camera Tracking...

Pixotope, the leading software platform for end-to-end real-time virtual production solutions, is breaking new ground by enabling remote real time virtual produ...

29/05/2024

Vizrt joins forces with Dalet to enhance newsroom operati...

Vizrt, the leader in real-time graphics and live production solutions for content creators, today announces that its flagship newsroom HTML-based templated grap...

29/05/2024

Ateme Leads TVRIs Transition to 4K UHD OTT Streaming

Ateme, the global leader in video compression, delivery and streaming solutions with innovation at its core, today announced TVRI s historic transition to 4K UH...

29/05/2024

WRAL-TV's Shrader, Holland Talk Historic Hurricane Forecast on WRAL Daily Download

NOAA (the National Oceanic and Atmospheric Administration) issued a forecast las...

29/05/2024

Riding the Wayve of AV 2.0, Driven by Generative AI

Generative AI is propelling AV 2.0, a new era in autonomous vehicle technology characterized by large, unified, end-to-end AI models capable of managing various...

29/05/2024

VEON appoints UHY LLP as auditors for VEON Group's 2023 PCAOB Audit and shares compliance plan with Nasdaq

29 May 2024 VEON appoints UHY LLP as auditors for VEON Group's 2023 PCAOB A...

29/05/2024

Scripps Spelling Bee Is Its Own Kind Of Sport - and Has Its Own Kind of Broadcast on Ion Television

Scripps Spelling Bee Is Its Own Kind Of Sport - and Has Its Own Kind of Broadcas...

29/05/2024

TikTok's Tim Edwards Talks Long Form Content, Monetization and the Power of Search

TikTok's Tim Edwards Talks Long Form Content, Monetization and the Power of ...

29/05/2024

PWHL Finals: Raycom Sports, Sky Candy Studios Deploy Live Drone Over the Ice for Decisive Game 5 in Boston

PWHL Finals: Raycom Sports, Sky Candy Studios Deploy Live Drone Over the Ice for...

29/05/2024

Introducing R&SGSACSM: The most advanced communications system monitoring solution for armed forces

Introducing R&S GSACSM: The most advanced communications system monitoring solut...

29/05/2024

Netflix and Yash Raj Films Announce Maharaj': A Story of One Man's Courage in Pre-Independence India', Premiering June 14

Back to All News Netflix and Yash Raj Films Announce Maharaj': A Story of ...

29/05/2024

IBM Study: 6 Hard Truths CEOs Must Face - As CEOs rush to adopt generative AI adoption, workforce and culture concerns intensify

LONDON, UK, 29 May 2024 A new study by the IBM (NYSE: IBM) Institute for Busin...

29/05/2024

Arvato Systems wins Gold again at the Service Provider Awards

Arvato Systems wins Gold again at the Service Provider Awards Award in the Managed Cloud Service Provider category Arvato Systems receives Gold as Managed C...

29/05/2024

RT Brings You to the Heart of the Action this June Bank Holiday Weekend

An action-packed weekend of live sport, including the Women's Euro 2025 Qualifier, the GAA Championship, URC Live and the Champions League Final Catch al...

29/05/2024

Tidy Tech: How Two Stanford Students Are Building Robots for Handling Household Chores

Imagine having a robot that could help you clean up after a party - or fold heap...

29/05/2024

Decoding How NVIDIA RTX AI PCs and Workstations Tap the Cloud to Supercharge Generative AI

Editor's note: This post is part of the AI Decoded series, which demystifies...

29/05/2024

Thales' FlytEDGE - the first cloud-based IFE in the world Winner of Crystal Cabin Award

Facebook Twitter LinkedIn The Crystal Cabin Award Association recognized T...

29/05/2024

VIZIO and Dolby Usher in Premium Sound Era For All

29 May 2024, 05:30 (PDT) VIZIO and Dolby Usher in Premium Sound Era For All With Dolby Atmos across its entire 2024 soundbar lineup, VIZIO and Dolby are lea...

29/05/2024

SWR moves to software playout with integrated Pixel Power solution from Rohde & Schwarz

SWR moves to software playout with integrated Pixel Power solution from Rohde & ...

28/05/2024

AI and Disinformation in Taiwan's 2024 Election

This is a summary of the report commissioned by Thomson on AI Disinformation Attacks during Taiwans 2024 Presidential Elections, written by Professor Chen-ling ...

28/05/2024

In A Violent Nature: Festivalgoers Look Through the Eyes of a Murderer

PARK CITY, UTAH - JANUARY 22: Chris Nash attends the 2024 Sundance Film Festival In A Violent Nature premiere at the Library Center Theatre on January 22, 202...

28/05/2024

Aerojet Rocketdyne Expanding Huntsville Operations to Increase Solid Rocket Motor Deliveries

Aerojet Rocketdyne's Advanced Manufacturing Facility opened in 2019. The com...