Table 4.

Sensitivity and Specificity of Single Perfect Amino Acid K-mer Matches as a Search Criterion

(A) Columns are for K sizes of 3–7. Rows represent various percentage identities between the homologous sequences. The table entries show the fraction of homologies detected as calculated from equation 3 assuming a homologous region of 33 amino acids. (B) K represents the size of the perfect match. F shows how many perfect matches of this size are expected to occur by chance according to equation 4 in a translated genome of 3 billion bases using a query of 167 amino acids (corresponding to 500 bases).

`BLAT`—The `BLAST`-Like Alignment Tool

BLAT—The BLAST-Like Alignment Tool