A synthetic dataset primer for the biobehavioural sciences to promote reproducibility and hypothesis generation

Bibliographic Details
Title: A synthetic dataset primer for the biobehavioural sciences to promote reproducibility and hypothesis generation
Authors: Daniel S Quintana
Source: eLife, Vol 9 (2020)
Publisher Information: eLife Sciences Publications Ltd, 2020.
Publication Year: 2020
Collection: LCC:Medicine
LCC:Science
LCC:Biology (General)
Subject Terms: meta-research, data, statistics, Medicine, Science, Biology (General), QH301-705.5
More Details: Open research data provide considerable scientific, societal, and economic benefits. However, disclosure risks can sometimes limit the sharing of open data, especially in datasets that include sensitive details or information from individuals with rare disorders. This article introduces the concept of synthetic datasets, which is an emerging method originally developed to permit the sharing of confidential census data. Synthetic datasets mimic real datasets by preserving their statistical properties and the relationships between variables. Importantly, this method also reduces disclosure risk to essentially nil as no record in the synthetic dataset represents a real individual. This practical guide with accompanying R script enables biobehavioural researchers to create synthetic datasets and assess their utility via the synthpop R package. By sharing synthetic datasets that mimic original datasets that could not otherwise be made open, researchers can ensure the reproducibility of their results and facilitate data exploration while maintaining participant privacy.
Document Type: article
File Description: electronic resource
Language: English
ISSN: 2050-084X
Relation: https://elifesciences.org/articles/53275; https://doaj.org/toc/2050-084X
DOI: 10.7554/eLife.53275
Access URL: https://doaj.org/article/70798b7c6e6f4ab4af1c0a3ca5beb430
Accession Number: edsdoj.70798b7c6e6f4ab4af1c0a3ca5beb430
Database: Directory of Open Access Journals
More Details
ISSN:2050084X
DOI:10.7554/eLife.53275
Published in:eLife
Language:English