Academic Journal
Improvement of large copy number variant detection by whole genome nanopore sequencing
Title: | Improvement of large copy number variant detection by whole genome nanopore sequencing |
---|---|
Authors: | Javier Cuenca-Guardiola, Belén de la Morena-Barrio, Juan L. García, Alba Sanchis-Juan, Javier Corral, Jesualdo T. Fernández-Breis |
Source: | Journal of Advanced Research, Vol 50, Iss , Pp 145-158 (2023) |
Publisher Information: | Elsevier, 2023. |
Publication Year: | 2023 |
Collection: | LCC:Medicine (General) LCC:Science (General) |
Subject Terms: | Nanopore, Structural variant, Third-generation sequencing, SERPINC1, Medicine (General), R5-920, Science (General), Q1-390 |
More Details: | Introduction: Whole-genome sequencing using nanopore technologies can uncover structural variants, which are DNA rearrangements larger than 50 base pairs. Nanopore technologies can also characterize their boundaries with single-base accuracy, owing to the kilobase-long reads that encompass either full variants or their junctions. Other methods, such as next-generation short read sequencing or PCR assays, are limited in their capabilities to detect or characterize structural variants. However, the existing software for nanopore sequencing data analysis still reports incomplete variant sets, which also contain erroneous calls, a considerable obstacle for the molecular diagnosis or accurate genotyping of populations. Methods: We compared multiple factors affecting variant calling, such as reference genome version, aligner (minimap2, NGMLR, and lra) choice, and variant caller combinations (Sniffles, CuteSV, SVIM, and NanoVar), to find the optimal group of tools for calling large (>50 kb) deletions and duplications, using data from seven patients exhibiting gross gene defects on SERPINC1 and from a reference variant set as the control. The goal was to obtain the most complete, yet reasonably specific group of large variants using a single cell of PromethION sequencing, which yielded lower depth coverage than short-read sequencing. We also used a custom method for the statistical analysis of the coverage value to refine the resulting datasets. Results: We found that for large deletions and duplications (>50 kb), the existing software performed worse than for smaller ones, in terms of both sensitivity and specificity, and newer tools had not improved this. Our novel software, disCoverage, could polish variant callers’ results, improving specificity by up to 62% and sensitivity by 15%, the latter requiring other data or samples. Conclusion: We analyzed the current situation of >50-kb copy number variants with nanopore sequencing, which could be improved. The methods presented in this work could help to identify the known deletions and duplications in a set of patients, while also helping to filter out erroneous calls for these variants, which might aid the efforts to characterize a not-yet well-known fraction of genetic variability in the human genome. |
Document Type: | article |
File Description: | electronic resource |
Language: | English |
ISSN: | 2090-1232 |
Relation: | http://www.sciencedirect.com/science/article/pii/S2090123222002417; https://doaj.org/toc/2090-1232 |
DOI: | 10.1016/j.jare.2022.10.012 |
Access URL: | https://doaj.org/article/e0c75bf3cae14b18b5e67066ff4e599c |
Accession Number: | edsdoj.0c75bf3cae14b18b5e67066ff4e599c |
Database: | Directory of Open Access Journals |
FullText | Links: – Type: other Url: https://resolver.ebsco.com:443/public/rma-ftfapi/ejs/direct?AccessToken=4A7681684BD90D3B7AD4&Show=Object Text: Availability: 0 CustomLinks: – Url: https://www.doi.org/10.1016/j.jare.2022.10.012? Name: ScienceDirect (all content)-s8985755 Category: fullText Text: View record from ScienceDirect MouseOverText: View record from ScienceDirect – Url: https://resolver.ebsco.com/c/xy5jbn/result?sid=EBSCO:edsdoj&genre=article&issn=20901232&ISBN=&volume=50&issue=145-158&date=20230801&spage=145&pages=145-158&title=Journal of Advanced Research&atitle=Improvement%20of%20large%20copy%20number%20variant%20detection%20by%20whole%20genome%20nanopore%20sequencing&aulast=Javier%20Cuenca-Guardiola&id=DOI:10.1016/j.jare.2022.10.012 Name: Full Text Finder (for New FTF UI) (s8985755) Category: fullText Text: Find It @ SCU Libraries MouseOverText: Find It @ SCU Libraries – Url: https://doaj.org/article/e0c75bf3cae14b18b5e67066ff4e599c Name: EDS - DOAJ (s8985755) Category: fullText Text: View record from DOAJ MouseOverText: View record from DOAJ |
---|---|
Header | DbId: edsdoj DbLabel: Directory of Open Access Journals An: edsdoj.0c75bf3cae14b18b5e67066ff4e599c RelevancyScore: 1000 AccessLevel: 3 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 1000.24273681641 |
IllustrationInfo | |
Items | – Name: Title Label: Title Group: Ti Data: Improvement of large copy number variant detection by whole genome nanopore sequencing – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Javier+Cuenca-Guardiola%22">Javier Cuenca-Guardiola</searchLink><br /><searchLink fieldCode="AR" term="%22Belén+de+la+Morena-Barrio%22">Belén de la Morena-Barrio</searchLink><br /><searchLink fieldCode="AR" term="%22Juan+L%2E+García%22">Juan L. García</searchLink><br /><searchLink fieldCode="AR" term="%22Alba+Sanchis-Juan%22">Alba Sanchis-Juan</searchLink><br /><searchLink fieldCode="AR" term="%22Javier+Corral%22">Javier Corral</searchLink><br /><searchLink fieldCode="AR" term="%22Jesualdo+T%2E+Fernández-Breis%22">Jesualdo T. Fernández-Breis</searchLink> – Name: TitleSource Label: Source Group: Src Data: Journal of Advanced Research, Vol 50, Iss , Pp 145-158 (2023) – Name: Publisher Label: Publisher Information Group: PubInfo Data: Elsevier, 2023. – Name: DatePubCY Label: Publication Year Group: Date Data: 2023 – Name: Subset Label: Collection Group: HoldingsInfo Data: LCC:Medicine (General)<br />LCC:Science (General) – Name: Subject Label: Subject Terms Group: Su Data: <searchLink fieldCode="DE" term="%22Nanopore%22">Nanopore</searchLink><br /><searchLink fieldCode="DE" term="%22Structural+variant%22">Structural variant</searchLink><br /><searchLink fieldCode="DE" term="%22Third-generation+sequencing%22">Third-generation sequencing</searchLink><br /><searchLink fieldCode="DE" term="%22SERPINC1%22">SERPINC1</searchLink><br /><searchLink fieldCode="DE" term="%22Medicine+%28General%29%22">Medicine (General)</searchLink><br /><searchLink fieldCode="DE" term="%22R5-920%22">R5-920</searchLink><br /><searchLink fieldCode="DE" term="%22Science+%28General%29%22">Science (General)</searchLink><br /><searchLink fieldCode="DE" term="%22Q1-390%22">Q1-390</searchLink> – Name: Abstract Label: Description Group: Ab Data: Introduction: Whole-genome sequencing using nanopore technologies can uncover structural variants, which are DNA rearrangements larger than 50 base pairs. Nanopore technologies can also characterize their boundaries with single-base accuracy, owing to the kilobase-long reads that encompass either full variants or their junctions. Other methods, such as next-generation short read sequencing or PCR assays, are limited in their capabilities to detect or characterize structural variants. However, the existing software for nanopore sequencing data analysis still reports incomplete variant sets, which also contain erroneous calls, a considerable obstacle for the molecular diagnosis or accurate genotyping of populations. Methods: We compared multiple factors affecting variant calling, such as reference genome version, aligner (minimap2, NGMLR, and lra) choice, and variant caller combinations (Sniffles, CuteSV, SVIM, and NanoVar), to find the optimal group of tools for calling large (>50 kb) deletions and duplications, using data from seven patients exhibiting gross gene defects on SERPINC1 and from a reference variant set as the control. The goal was to obtain the most complete, yet reasonably specific group of large variants using a single cell of PromethION sequencing, which yielded lower depth coverage than short-read sequencing. We also used a custom method for the statistical analysis of the coverage value to refine the resulting datasets. Results: We found that for large deletions and duplications (>50 kb), the existing software performed worse than for smaller ones, in terms of both sensitivity and specificity, and newer tools had not improved this. Our novel software, disCoverage, could polish variant callers’ results, improving specificity by up to 62% and sensitivity by 15%, the latter requiring other data or samples. Conclusion: We analyzed the current situation of >50-kb copy number variants with nanopore sequencing, which could be improved. The methods presented in this work could help to identify the known deletions and duplications in a set of patients, while also helping to filter out erroneous calls for these variants, which might aid the efforts to characterize a not-yet well-known fraction of genetic variability in the human genome. – Name: TypeDocument Label: Document Type Group: TypDoc Data: article – Name: Format Label: File Description Group: SrcInfo Data: electronic resource – Name: Language Label: Language Group: Lang Data: English – Name: ISSN Label: ISSN Group: ISSN Data: 2090-1232 – Name: NoteTitleSource Label: Relation Group: SrcInfo Data: http://www.sciencedirect.com/science/article/pii/S2090123222002417; https://doaj.org/toc/2090-1232 – Name: DOI Label: DOI Group: ID Data: 10.1016/j.jare.2022.10.012 – Name: URL Label: Access URL Group: URL Data: <link linkTarget="URL" linkTerm="https://doaj.org/article/e0c75bf3cae14b18b5e67066ff4e599c" linkWindow="_blank">https://doaj.org/article/e0c75bf3cae14b18b5e67066ff4e599c</link> – Name: AN Label: Accession Number Group: ID Data: edsdoj.0c75bf3cae14b18b5e67066ff4e599c |
PLink | https://login.libproxy.scu.edu/login?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsdoj&AN=edsdoj.0c75bf3cae14b18b5e67066ff4e599c |
RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.1016/j.jare.2022.10.012 Languages: – Text: English PhysicalDescription: Pagination: PageCount: 14 StartPage: 145 Subjects: – SubjectFull: Nanopore Type: general – SubjectFull: Structural variant Type: general – SubjectFull: Third-generation sequencing Type: general – SubjectFull: SERPINC1 Type: general – SubjectFull: Medicine (General) Type: general – SubjectFull: R5-920 Type: general – SubjectFull: Science (General) Type: general – SubjectFull: Q1-390 Type: general Titles: – TitleFull: Improvement of large copy number variant detection by whole genome nanopore sequencing Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Javier Cuenca-Guardiola – PersonEntity: Name: NameFull: Belén de la Morena-Barrio – PersonEntity: Name: NameFull: Juan L. García – PersonEntity: Name: NameFull: Alba Sanchis-Juan – PersonEntity: Name: NameFull: Javier Corral – PersonEntity: Name: NameFull: Jesualdo T. Fernández-Breis IsPartOfRelationships: – BibEntity: Dates: – D: 01 M: 08 Type: published Y: 2023 Identifiers: – Type: issn-print Value: 20901232 Numbering: – Type: volume Value: 50 – Type: issue Value: 145-158 Titles: – TitleFull: Journal of Advanced Research Type: main |
ResultId | 1 |