
k-mer content as a function of genome length across organisms. (A) Number of different k-mers observed as a function of genome size for k-mer lengths of 15 bp, 16 bp, and 17 bp. (B) Number of expected versus the number of observed k-mers for each reference genome for k-mer lengths of 15 bp, 16 bp, and 17 bp. Viral, archaeal, bacterial, and eukaryotic genomes are colored pink, blue, yellow, and green, respectively, across the A and B figure panels. (C) GC content percentage of nucleic prime sequences. (D) Average number of GpC and CpG occurrences per prime. Error bars in D represent SD. (E) Number of quasi-primes detected in each reference genome.











