Whole genome and exome sequencing reference datasets from a multi-center and cross-platform benchmark study.

Bibliographic Details
Title: Whole genome and exome sequencing reference datasets from a multi-center and cross-platform benchmark study.
Authors: Zhao, Yongmei, Fang, Li Tai, Shen, Tsai-wei, Choudhari, Sulbha, Talsania, Keyur, Chen, Xiongfong, Shetty, Jyoti, Kriga, Yuliya, Tran, Bao, Zhu, Bin, Chen, Zhong, Chen, Wanqiu, Wang, Charles, Jaeger, Erich, Meerzaman, Daoud, Lu, Charles, Idler, Kenneth, Ren, Luyao, Zheng, Yuanting, Shi, Leming
Source: Scientific Data; 11/9/2021, Vol. 8 Issue 1, p1-14, 14p
Subject Terms: WHOLE genome sequencing, GENETIC mutation, SOMATIC mutation, EARLY detection of cancer, INDIVIDUALIZED medicine, GENOMES
Abstract: With the rapid advancement of sequencing technologies, next generation sequencing (NGS) analysis has been widely applied in cancer genomics research. More recently, NGS has been adopted in clinical oncology to advance personalized medicine. Clinical applications of precision oncology require accurate tests that can distinguish tumor-specific mutations from artifacts introduced during NGS processes or data analysis. Therefore, there is an urgent need to develop best practices in cancer mutation detection using NGS and the need for standard reference data sets for systematically measuring accuracy and reproducibility across platforms and methods. Within the SEQC2 consortium context, we established paired tumor-normal reference samples and generated whole-genome (WGS) and whole-exome sequencing (WES) data using sixteen library protocols, seven sequencing platforms at six different centers. We systematically interrogated somatic mutations in the reference samples to identify factors affecting detection reproducibility and accuracy in cancer genomes. These large cross-platform/site WGS and WES datasets using well-characterized reference samples will represent a powerful resource for benchmarking NGS technologies, bioinformatics pipelines, and for the cancer genomics studies. Measurement(s) Somatic Mutation Analysis Technology Type(s) whole genome sequencing • Whole Exome Sequencing Factor Type(s) sequencing platform • sample prepration • library preparation • bioinformatics method Sample Characteristic - Organism Homo sapiens Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.16713655 [ABSTRACT FROM AUTHOR]
Copyright of Scientific Data is the property of Springer Nature and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Complementary Index
More Details
ISSN:20524463
DOI:10.1038/s41597-021-01077-5
Published in:Scientific Data
Language:English