Third-generation sequencing revises the molecular karyotype for Toxoplasma gondii and identifies emerging copy number variants in sexual recombinants

  1. Jon P. Boyle1
  1. 1Department of Biological Sciences, Dietrich School of Arts and Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, USA;
  2. 2Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520, USA;
  3. 3La Jolla Institute for Immunology, La Jolla, California 92037, USA;
  4. 4University of Texas-Southwestern, Dallas, Texas 75390, USA;
  5. 5Department of Molecular, Cell and Systems Biology, College of Agricultural and Life Sciences, University of California-Riverside, Riverside, California 92521, USA;
  6. 6School of Medicine, University of California-San Diego, La Jolla, California 92093, USA
  • Corresponding author: boylej{at}pitt.edu
  • Abstract

    Toxoplasma gondii is a useful model for intracellular parasitism given its ease of culture in the laboratory and genomic resources. However, as for many other eukaryotes, the T. gondii genome contains hundreds of sequence gaps owing to repetitive and/or unclonable sequences that disrupt the assembly process. Here, we use the Oxford Nanopore Minion platform to generate near-complete de novo genome assemblies for multiple strains of T. gondii and its near relative, N. caninum. We significantly improved T. gondii genome contiguity (average N50 of ∼6.6 Mb) and added ∼2 Mb of newly assembled sequence. For all of the T. gondii strains that we sequenced (RH, ME49, CTG, II×III progeny clones CL13, S27, S21, S26, and D3X1), the largest contig ranged in size between 11.9 and 12.1 Mb in size, which is larger than any previously reported T. gondii chromosome, and found to be due to a consistent fusion of Chromosomes VIIb and VIII. These data were validated by mapping existing T. gondii ME49 Hi-C data to our assembly, providing parallel lines of evidence that the T. gondii karyotype consists of 13, rather than 14, chromosomes. By using this technology, we also resolved hundreds of tandem repeats of varying lengths, including in well-known host-targeting effector loci like rhoptry protein 5 (ROP5) and ROP38. Finally, when we compared T. gondii with N. caninum, we found that although the 13-chromosome karyotype was conserved, extensive, previously unappreciated chromosome-scale rearrangements had occurred in T. gondii and N. caninum since their most recent common ancestry.

    Footnotes

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at https://www.genome.org/cgi/doi/10.1101/gr.262816.120.

    • Freely available online through the Genome Research Open Access option.

    • Received February 26, 2020.
    • Accepted February 3, 2021.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

    Articles citing this article

    Related Articles

    OPEN ACCESS ARTICLE

    Preprint Server