Table 9.

Negative Correlation of KL distances and False-Negative Prediction Error Rates Produced by GeneMark in Short Fragment Identification

Model (order 3) B. subtilis E. coli M. genitalium M. jannaschii M. pneumoniae
False-negative error rate
AT0.1860.2230.1710.1180.165
T0.0980.0830.0930.0440.078
HT0.0540.0450.0670.0170.052
KL distance (coding vs. noncoding model)
AT0.06480.06190.08370.09990.0883
T0.09300.11020.08930.12990.1149
HT0.13180.16870.12890.19090.1587
R =−0.961−0.931−0.770−0.896−0.905

[i] For five genomes the error rates are shown along with the KL distance between corresponding coding and noncoding models of the third order. The correlation coefficients (bottom row) for A. fulgidus, H. influenzae, H. pylori, M. thermoautotrophicum, andSynechocystis, averaged with regard to A-, B-, and C-type models, were equal to −0.97, −0.91, −0.96, −0.96, and −0.99, respectively.