Improvement of large copy number variant detection by whole genome nanopore sequencing

Bibliographic Details
Title: Improvement of large copy number variant detection by whole genome nanopore sequencing
Authors: Javier Cuenca-Guardiola, Belén de la Morena-Barrio, Juan L. García, Alba Sanchis-Juan, Javier Corral, Jesualdo T. Fernández-Breis
Source: Journal of Advanced Research, Vol 50, Iss , Pp 145-158 (2023)
Publisher Information: Elsevier, 2023.
Publication Year: 2023
Collection: LCC:Medicine (General)
LCC:Science (General)
Subject Terms: Nanopore, Structural variant, Third-generation sequencing, SERPINC1, Medicine (General), R5-920, Science (General), Q1-390
More Details: Introduction: Whole-genome sequencing using nanopore technologies can uncover structural variants, which are DNA rearrangements larger than 50 base pairs. Nanopore technologies can also characterize their boundaries with single-base accuracy, owing to the kilobase-long reads that encompass either full variants or their junctions. Other methods, such as next-generation short read sequencing or PCR assays, are limited in their capabilities to detect or characterize structural variants. However, the existing software for nanopore sequencing data analysis still reports incomplete variant sets, which also contain erroneous calls, a considerable obstacle for the molecular diagnosis or accurate genotyping of populations. Methods: We compared multiple factors affecting variant calling, such as reference genome version, aligner (minimap2, NGMLR, and lra) choice, and variant caller combinations (Sniffles, CuteSV, SVIM, and NanoVar), to find the optimal group of tools for calling large (>50 kb) deletions and duplications, using data from seven patients exhibiting gross gene defects on SERPINC1 and from a reference variant set as the control. The goal was to obtain the most complete, yet reasonably specific group of large variants using a single cell of PromethION sequencing, which yielded lower depth coverage than short-read sequencing. We also used a custom method for the statistical analysis of the coverage value to refine the resulting datasets. Results: We found that for large deletions and duplications (>50 kb), the existing software performed worse than for smaller ones, in terms of both sensitivity and specificity, and newer tools had not improved this. Our novel software, disCoverage, could polish variant callers’ results, improving specificity by up to 62% and sensitivity by 15%, the latter requiring other data or samples. Conclusion: We analyzed the current situation of >50-kb copy number variants with nanopore sequencing, which could be improved. The methods presented in this work could help to identify the known deletions and duplications in a set of patients, while also helping to filter out erroneous calls for these variants, which might aid the efforts to characterize a not-yet well-known fraction of genetic variability in the human genome.
Document Type: article
File Description: electronic resource
Language: English
ISSN: 2090-1232
Relation: http://www.sciencedirect.com/science/article/pii/S2090123222002417; https://doaj.org/toc/2090-1232
DOI: 10.1016/j.jare.2022.10.012
Access URL: https://doaj.org/article/e0c75bf3cae14b18b5e67066ff4e599c
Accession Number: edsdoj.0c75bf3cae14b18b5e67066ff4e599c
Database: Directory of Open Access Journals
FullText Links:
  – Type: other
    Url: https://resolver.ebsco.com:443/public/rma-ftfapi/ejs/direct?AccessToken=4A7681684BD90D3B7AD4&Show=Object
Text:
  Availability: 0
CustomLinks:
  – Url: https://www.doi.org/10.1016/j.jare.2022.10.012?
    Name: ScienceDirect (all content)-s8985755
    Category: fullText
    Text: View record from ScienceDirect
    MouseOverText: View record from ScienceDirect
  – Url: https://resolver.ebsco.com/c/xy5jbn/result?sid=EBSCO:edsdoj&genre=article&issn=20901232&ISBN=&volume=50&issue=145-158&date=20230801&spage=145&pages=145-158&title=Journal of Advanced Research&atitle=Improvement%20of%20large%20copy%20number%20variant%20detection%20by%20whole%20genome%20nanopore%20sequencing&aulast=Javier%20Cuenca-Guardiola&id=DOI:10.1016/j.jare.2022.10.012
    Name: Full Text Finder (for New FTF UI) (s8985755)
    Category: fullText
    Text: Find It @ SCU Libraries
    MouseOverText: Find It @ SCU Libraries
  – Url: https://doaj.org/article/e0c75bf3cae14b18b5e67066ff4e599c
    Name: EDS - DOAJ (s8985755)
    Category: fullText
    Text: View record from DOAJ
    MouseOverText: View record from DOAJ
Header DbId: edsdoj
DbLabel: Directory of Open Access Journals
An: edsdoj.0c75bf3cae14b18b5e67066ff4e599c
RelevancyScore: 1000
AccessLevel: 3
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 1000.24273681641
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Improvement of large copy number variant detection by whole genome nanopore sequencing
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Javier+Cuenca-Guardiola%22">Javier Cuenca-Guardiola</searchLink><br /><searchLink fieldCode="AR" term="%22Belén+de+la+Morena-Barrio%22">Belén de la Morena-Barrio</searchLink><br /><searchLink fieldCode="AR" term="%22Juan+L%2E+García%22">Juan L. García</searchLink><br /><searchLink fieldCode="AR" term="%22Alba+Sanchis-Juan%22">Alba Sanchis-Juan</searchLink><br /><searchLink fieldCode="AR" term="%22Javier+Corral%22">Javier Corral</searchLink><br /><searchLink fieldCode="AR" term="%22Jesualdo+T%2E+Fernández-Breis%22">Jesualdo T. Fernández-Breis</searchLink>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: Journal of Advanced Research, Vol 50, Iss , Pp 145-158 (2023)
– Name: Publisher
  Label: Publisher Information
  Group: PubInfo
  Data: Elsevier, 2023.
– Name: DatePubCY
  Label: Publication Year
  Group: Date
  Data: 2023
– Name: Subset
  Label: Collection
  Group: HoldingsInfo
  Data: LCC:Medicine (General)<br />LCC:Science (General)
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Nanopore%22">Nanopore</searchLink><br /><searchLink fieldCode="DE" term="%22Structural+variant%22">Structural variant</searchLink><br /><searchLink fieldCode="DE" term="%22Third-generation+sequencing%22">Third-generation sequencing</searchLink><br /><searchLink fieldCode="DE" term="%22SERPINC1%22">SERPINC1</searchLink><br /><searchLink fieldCode="DE" term="%22Medicine+%28General%29%22">Medicine (General)</searchLink><br /><searchLink fieldCode="DE" term="%22R5-920%22">R5-920</searchLink><br /><searchLink fieldCode="DE" term="%22Science+%28General%29%22">Science (General)</searchLink><br /><searchLink fieldCode="DE" term="%22Q1-390%22">Q1-390</searchLink>
– Name: Abstract
  Label: Description
  Group: Ab
  Data: Introduction: Whole-genome sequencing using nanopore technologies can uncover structural variants, which are DNA rearrangements larger than 50 base pairs. Nanopore technologies can also characterize their boundaries with single-base accuracy, owing to the kilobase-long reads that encompass either full variants or their junctions. Other methods, such as next-generation short read sequencing or PCR assays, are limited in their capabilities to detect or characterize structural variants. However, the existing software for nanopore sequencing data analysis still reports incomplete variant sets, which also contain erroneous calls, a considerable obstacle for the molecular diagnosis or accurate genotyping of populations. Methods: We compared multiple factors affecting variant calling, such as reference genome version, aligner (minimap2, NGMLR, and lra) choice, and variant caller combinations (Sniffles, CuteSV, SVIM, and NanoVar), to find the optimal group of tools for calling large (>50 kb) deletions and duplications, using data from seven patients exhibiting gross gene defects on SERPINC1 and from a reference variant set as the control. The goal was to obtain the most complete, yet reasonably specific group of large variants using a single cell of PromethION sequencing, which yielded lower depth coverage than short-read sequencing. We also used a custom method for the statistical analysis of the coverage value to refine the resulting datasets. Results: We found that for large deletions and duplications (>50 kb), the existing software performed worse than for smaller ones, in terms of both sensitivity and specificity, and newer tools had not improved this. Our novel software, disCoverage, could polish variant callers’ results, improving specificity by up to 62% and sensitivity by 15%, the latter requiring other data or samples. Conclusion: We analyzed the current situation of >50-kb copy number variants with nanopore sequencing, which could be improved. The methods presented in this work could help to identify the known deletions and duplications in a set of patients, while also helping to filter out erroneous calls for these variants, which might aid the efforts to characterize a not-yet well-known fraction of genetic variability in the human genome.
– Name: TypeDocument
  Label: Document Type
  Group: TypDoc
  Data: article
– Name: Format
  Label: File Description
  Group: SrcInfo
  Data: electronic resource
– Name: Language
  Label: Language
  Group: Lang
  Data: English
– Name: ISSN
  Label: ISSN
  Group: ISSN
  Data: 2090-1232
– Name: NoteTitleSource
  Label: Relation
  Group: SrcInfo
  Data: http://www.sciencedirect.com/science/article/pii/S2090123222002417; https://doaj.org/toc/2090-1232
– Name: DOI
  Label: DOI
  Group: ID
  Data: 10.1016/j.jare.2022.10.012
– Name: URL
  Label: Access URL
  Group: URL
  Data: <link linkTarget="URL" linkTerm="https://doaj.org/article/e0c75bf3cae14b18b5e67066ff4e599c" linkWindow="_blank">https://doaj.org/article/e0c75bf3cae14b18b5e67066ff4e599c</link>
– Name: AN
  Label: Accession Number
  Group: ID
  Data: edsdoj.0c75bf3cae14b18b5e67066ff4e599c
PLink https://login.libproxy.scu.edu/login?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsdoj&AN=edsdoj.0c75bf3cae14b18b5e67066ff4e599c
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1016/j.jare.2022.10.012
    Languages:
      – Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 14
        StartPage: 145
    Subjects:
      – SubjectFull: Nanopore
        Type: general
      – SubjectFull: Structural variant
        Type: general
      – SubjectFull: Third-generation sequencing
        Type: general
      – SubjectFull: SERPINC1
        Type: general
      – SubjectFull: Medicine (General)
        Type: general
      – SubjectFull: R5-920
        Type: general
      – SubjectFull: Science (General)
        Type: general
      – SubjectFull: Q1-390
        Type: general
    Titles:
      – TitleFull: Improvement of large copy number variant detection by whole genome nanopore sequencing
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Javier Cuenca-Guardiola
      – PersonEntity:
          Name:
            NameFull: Belén de la Morena-Barrio
      – PersonEntity:
          Name:
            NameFull: Juan L. García
      – PersonEntity:
          Name:
            NameFull: Alba Sanchis-Juan
      – PersonEntity:
          Name:
            NameFull: Javier Corral
      – PersonEntity:
          Name:
            NameFull: Jesualdo T. Fernández-Breis
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 08
              Type: published
              Y: 2023
          Identifiers:
            – Type: issn-print
              Value: 20901232
          Numbering:
            – Type: volume
              Value: 50
            – Type: issue
              Value: 145-158
          Titles:
            – TitleFull: Journal of Advanced Research
              Type: main
ResultId 1