Phased nanopore assembly with Shasta and modular graph phasing with GFAse

  1. Benedict Paten1
  1. 1UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, California 95060, USA;
  2. 2Department of Bioengineering, Department of Physics, Northeastern University, Boston, Massachusetts 02120, USA;
  3. 3Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA;
  4. 4The Mortimer B. Zuckerman Mind Brain Behavior Institute, Department of Neuroscience, Columbia University, New York, New York 10027, USA;
  5. 5Howard Hughes Medical Institute, Columbia University, New York, New York 10032, USA;
  6. 6Oxford Nanopore Technologies Incorporated, New York, New York 10013, USA;
  7. 7Chan Zuckerberg Initiative Foundation, Redwood City, California 94063, USA;
  8. 8Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20894, USA;
  9. 9Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA;
  10. 10Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, USA
  • Corresponding authors: rlorigro{at}ucsc.edu, pacarnev{at}ucsc.edu, bpaten{at}ucsc.edu
  • Abstract

    Reference-free genome phasing is vital for understanding allele inheritance and the impact of single-molecule DNA variation on phenotypes. To achieve thorough phasing across homozygous or repetitive regions of the genome, long-read sequencing technologies are often used to perform phased de novo assembly. As a step toward reducing the cost and complexity of this type of analysis, we describe new methods for accurately phasing Oxford Nanopore Technologies (ONT) sequence data with the Shasta genome assembler and a modular tool for extending phasing to the chromosome scale called GFAse. We test using new variants of ONT PromethION sequencing, including those using proximity ligation, and show that newer, higher accuracy ONT reads substantially improve assembly quality.

    Footnotes

    • Received July 19, 2023.
    • Accepted March 19, 2024.

    This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

    | Table of Contents

    Preprint Server