Table 1.

Genome assembly statistics for all genomes used in this study

SpeciesStrainSubtypeSourceCitationScaffolds (#)Size (Mb)N50 (kb)GC content (% GC)BUSCO contentaProtein-coding genes
Blastocystis sp.JDR1HumanThis study11417.555054.5%61.2%7891
Blastocystis sp.NandII1HumanThis study5416.093954.7%61.2%7157
Blastocystis sp.NMH3HumanThis study3614.658251.3%58.0%6571
Blastocystis sp.DL3HumanThis study5614.050651.6%55.7%6154
Blastocystis sp.BT14HumanThis study3415.388239.4%57.6%7421
Blastocystis sp.Rus-BN/ARussian tortoiseThis study3333.3132323.1%43.2%8547
Blastocystis sp.Rus-SN/ARussian tortoiseThis study25826.150521.8%42.0%8501
Blastocystis sp.HermannsN/AHermann's tortoiseThis study22927.128921.5%42.4%8349
Blastocystis sp.WR14Laboratory Wistar ratNo publication130112.93039.5%56.5%5707
Blastocystis sp.NandII1HumanGentekaki et al. 201758016.57955.0%61.1%6544
Blastocystis sp.Singapore7HumanDenoeud et al. 20115418.8901b45.0%52.2%6020
Proteromonas lacertaeLAN/ASand lizardZáhonová et al. 2023c144752.29327.0%62.7%23,189
Cafeteria burkhardaeBVIN/AEnvironmentalHackl et al. 202016936.346570.0%54.9%8591

[i] aBUSCO version 5.6.1 with eukaryote_odb10 lineage, metaeuk genome mode.

[ii] bGapped scaffold N50.

[iii] cCitation for genome only. Annotations not publicly available and generated for comparative genomics in this work.