Resolving complex duplication variants in autism spectrum disorder using long-read genome sequencing

  1. Lars Feuk5
  1. 1Department of Molecular Medicine and Surgery, Karolinska Institutet, 171 76 Stockholm, Sweden;
  2. 2Department of Clinical Genetics and Genomics, Karolinska University Hospital, 171 77 Stockholm, Sweden;
  3. 3The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada;
  4. 4Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario M5G 1X8, Canada;
  5. 5Department of Immunology, Genetics and Pathology, Science for Life Laboratory Uppsala, Uppsala University, 751 85 Uppsala, Sweden;
  6. 6Department of Pediatrics and The Saban Research Institute, Children's Hospital Los Angeles, Keck School of Medicine of University of Southern California, Los Angeles, California 90033, USA;
  7. 7Discipline of Genetics, Faculty of Medicine, Memorial University of Newfoundland, St John's, Newfoundland and Labrador A1B 3V6, Canada;
  8. 8Department of Molecular Genetics, McLaughlin Centre, University of Toronto, Toronto, Ontario M5S 1A8, Canada
  • Corresponding author: lars.feuk{at}igp.uu.se
  • Abstract

    Rare or de novo structural variation, primarily in the form of copy number variants, is detected in 5%–10% of autism spectrum disorder (ASD) families. While complex structural variants involving duplications can generally be detected using microarray or short-read genome sequencing (GS), these methods frequently fail to characterize breakpoints at nucleotide resolution, requiring additional molecular methods for validation and fine-mapping. Here, we use Oxford Nanopore Technologies PromethION long-read GS to characterize complex genomic rearrangements (CGRs) involving large duplications that segregate with ASD in five families. In total, we investigated 13 CGR carriers and were able to resolve all breakpoint junctions at nucleotide resolution. While all breakpoints were identified, the precise genomic architecture of one rearrangement remained unresolved with three different potential structures. The findings in two families include potential fusion genes formed through duplication rearrangements, involving IL1RAPL1–DMD and SUPT16H–CHD8. In two of the families originating from the same geographical region, an identical rearrangement involving ANK2 was identified, which likely represents a founder variant. In addition, we analyze methylation status directly from the long-read data, allowing us to assess the activity of rearranged genes and regulatory regions. Investigation of methylation across the CGRs reveals aberrant methylation status in carriers across a rearrangement affecting the CREBBP locus. In aggregate, our results demonstrate the utility of nanopore sequencing to pinpoint CGRs associated with ASD in five unrelated families, and highlight the importance of a gene-centric description of disease-associated complex chromosomal rearrangements.

    Footnotes

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at https://www.genome.org/cgi/doi/10.1101/gr.279263.124.

    • Freely available online through the Genome Research Open Access option.

    • Received March 4, 2024.
    • Accepted September 27, 2024.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

    Articles citing this article

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server