High-coverage nanopore sequencing of samples from the 1000 Genomes Project to build a comprehensive catalog of human genetic variation

Jonas A Gustafson; Sophia B Gibson; Nikhita Damaraju; Miranda PG Zalusky; Kendra Hoekzema; David Twesigomwe; Lei Yang; Anthony A Snead; Phillip A Richmond; Wouter De Coster; Nathan D Olson; Andrea Guarracino; Qiuhui Li; Angela L Miller; Joy Goffena; Zachery B Anderson; Sophie HR Storz; Sydney A Ward; Maisha Sinha; Claudia Gonzaga-Jauregui; Wayne E Clarke; Anna O Basile; Andre Corvelo; Catherine E Reeves; Adrienne Helland; Rajeeva Lochan Musunuri; Mahler Revsine; Karynne E Patterson; Cate Paschal; Christina Zakarian; Sara Goodwin; Tanner D Jensen; Esther Robb; 1000 Genomes ONT Sequencing Consortium; University of Washington Center for Rare Disease Research (UW-CRDR); Genomics Research to Elucidate the Genetics of Rare Diseases (GREGoR) Consortium; W. Richard McCombie; Fritz J Sedlazeck; Justin M Zook; Stephen B Montgomery; Erik Garrison; Mikhail Kolmogorov; Michael C Schatz; Richard N McLaughlin, Jr.; Harriet Dashnow; Michael C Zody; Matthew Loose; Miten Jain; Evan E Eichler; Danny E. Miller

doi:10.1101/gr.279273.124

Resource

High-coverage nanopore sequencing of samples from the 1000 Genomes Project to build a comprehensive catalog of human genetic variation

Jonas A Gustafson ¹
Sophia B Gibson ¹
Nikhita Damaraju ²
Miranda PG Zalusky ¹
Kendra Hoekzema ¹
David Twesigomwe ³
Lei Yang ⁴
Anthony A Snead ⁵
Phillip A Richmond ⁶
Wouter De Coster ⁷
Nathan D Olson ⁸
Andrea Guarracino ⁹
Qiuhui Li ¹⁰
Angela L Miller ¹
Joy Goffena ¹
Zachery B Anderson ¹
Sophie HR Storz ¹
Sydney A Ward ¹
Maisha Sinha ¹
Claudia Gonzaga-Jauregui ¹¹
Wayne E Clarke ¹²
Anna O Basile ¹³
Andre Corvelo ¹³
Catherine E Reeves ¹³
Adrienne Helland ¹³
Rajeeva Lochan Musunuri ¹³
Mahler Revsine ¹⁰
Karynne E Patterson ¹
Cate Paschal ¹⁴
Christina Zakarian ¹
Sara Goodwin ¹⁵
Tanner D Jensen ¹⁶
Esther Robb ¹⁶
1000 Genomes ONT Sequencing Consortium
University of Washington Center for Rare Disease Research (UW-CRDR)
Genomics Research to Elucidate the Genetics of Rare Diseases (GREGoR) Consortium
W. Richard McCombie ¹⁵
Fritz J Sedlazeck ¹⁸
Justin M Zook ⁸
Stephen B Montgomery ¹⁶
Erik Garrison ¹⁹
Mikhail Kolmogorov ²⁰
Michael C Schatz ¹⁰
Richard N McLaughlin Jr. ²¹
Harriet Dashnow ²²
Michael C Zody ¹³
... [+41 authors] ...
Matthew Loose ²³
Miten Jain ²⁴
Evan E Eichler ²⁵
Danny E. Miller ²⁶
Show affiliations

- ¹ University of Washington;
- ² Institute for Public Health Genetics, University of Washington;
- ³ Sydney Brenner Institute for Molecular Bioscience, University of the Witwatersrand;
- ⁴ Pacific Northwest Research Institute;
- ⁵ New York University;
- ⁶ Alamya Health;
- ⁷ VIB Center for Molecular Neurology, University of Antwerp;
- ⁸ National Institute of Standards and Technology;
- ⁹ University of Tennessee Health Science Center, Human Technopole;
- ¹⁰ Johns Hopkins University;
- ¹¹ International Laboratory for Human Genome Research, Laboratorio Internacional de Investigacion sobre el Genoma Humano, Universidad Nacional Autonoma de Mexico;
- ¹² New York Genome Center, Outlier Informatics Inc.;
- ¹³ New York Genome Center;
- ¹⁴ Seattle Children's Hospital, University of Washington;
- ¹⁵ Cold Spring Harbor Laboratory;
- ¹⁶ Stanford University;
- ¹⁷ -;
- ¹⁸ Baylor College of Medicine, Rice University;
- ¹⁹ University of Tennessee Health Science Center;
- ²⁰ Cancer Data Science Laboratory, National Cancer Institute, NIH;
- ²¹ University of Washington, Pacific Northwest Research Institute;
- ²² University of Utah, University of Colorado School of Medicine;
- ²³ Deep Seq, University of Nottingham;
- ²⁴ Northeastern University;
- ²⁵ Brotman Baty Institute for Precision Medicine, Howard Hughes Medical Institute, University of Washington;
- ²⁶ Seattle Children's Hospital

Published October 2, 2024. https://doi.org/10.1101/gr.279273.124

Download PDF Cite Article Permissions

Current Issue:

June 2026, Vol. 36, No. 6

Focus view

Abstract

Fewer than half of individuals with a suspected Mendelian or monogenic condition receive a precise molecular diagnosis after comprehensive clinical genetic testing. Improvements in data quality and costs have heightened interest in using long-read sequencing (LRS) to streamline clinical genomic testing, but the absence of control datasets for variant filtering and prioritization has made tertiary analysis of LRS data challenging. To address this, the 1000 Genomes Project ONT Sequencing Consortium aims to generate LRS data from at least 800 of the 1000 Genomes Project samples. Our goal is to use LRS to identify a broader spectrum of variation so we may improve our understanding of normal patterns of human variation. Here, we present data from analysis of the first 100 samples, representing all 5 superpopulations and 19 subpopulations. These samples, sequenced to an average depth of coverage of 37x and sequence read N50 of 54 kbp, have high concordance with previous studies for identifying single nucleotide and indel variants outside of homopolymer regions. Using multiple structural variant (SV) callers, we identify an average of 24,543 high-confidence SVs per genome, including shared and private SVs likely to disrupt gene function as well as pathogenic expansions within disease-associated repeats that were not detected using short reads. Evaluation of methylation signatures revealed expected patterns at known imprinted loci, samples with skewed X-inactivation patterns, and novel differentially methylated regions. All raw sequencing data, processed data, and summary statistics are publicly available, providing a valuable resource for the clinical genetics community to discover pathogenic SVs.

Article contents

Article (Back to top)
- Abstract

Announcement(s)

Resource

High-coverage nanopore sequencing of samples from the 1000 Genomes Project to build a comprehensive catalog of human genetic variation

Cite this article

Share

Current Issue:

Abstract

Article contents

Announcement(s)