The KA/KS Ratio Test for Assessing the Protein-Coding Potential of Genomic Regions: An Empirical and Simulation Study

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 1.
Figure 1.

(A) Distribution of exon lengths. Exons were stratified into six length classes. For example, the 100-bp class contains exons with lengths ranging from 75 to 125 bp. The white area of each bar represents the number of exons that showK A/K S significantly smaller than 1 (passing the test), whereas the shaded area corresponds to the number of exons that have K A/K S not statistically different from 1 (failing the test). The numbers above each bar indicates the ratio of the number of exons in the shaded area to the number of exons in the white area. For example, the group with the mean length 50 bp contains 81 exons; 34 of them did not pass theK A/K S test. (B) Relationship between human–mouse sequence divergence and the number of false negatives (exons that fail the test). Bars represent the proportions of exons in each of the five divergence classes. Points on the curve indicate the proportion of false negatives within each identity class. For example, ∼35% of exons in our dataset belong to a class in which divergence ranges from 10% to 15%. Within this class ,∼4% exons did not pass theK A/K S test.

This Article

  1. Genome Res. 12: 198-202

Preprint Server