Seamless, rapid, and accurate analyses of outbreak genomic data using split k-mer analysis

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 2.
Figure 2.

Average recall of SKA2 in simulations across increasing sequence divergence between a pair of sequences (πn or SNPs per site). Lines show recall using different split k-mer lengths k. (Left) Recall when allowing ambiguous bases, showing typical divergence thresholds used to define species, strain, and lineage boundaries. (Right) Recall when requiring exact matches of the middle base, with inset showing recall over the within-lineage range.

This Article

  1. Genome Res. 34: 1661-1673

Preprint Server