A Genome-Wide Survey of Human Pseudogenes

Table 2.

Cross-Validation for the Discrimination Between Genes and Pseudogenes from KA/KS Benchmark Distributions


Known test fractions

Training set

Estimated fractions of pseudogenes
Functional
Pseudogene
Functional
Pseudogene
Averagea
SDb
1000 0 659 1703 16.1 15
900 100 759 1603 109.6 24.8
800 200 859 1503 205.4 20.5
700 300 959 1403 310.2 23.7
600 400 1059 1303 401.9 19.7
500 500 1159 1203 500.8 21.3
400 600 1259 1103 600.5 24.2
300 700 1359 1003 694.7 24.1
200 800 1459 903 793.2 26.1
100 900 1559 803 893.2 26.4
0
1000
1659
703
983.9
20
  • a Average estimation of the 100 iterations

  • b Standard Deviation from the complete test set

This Article

  1. Genome Res. 13: 2559-2567

Preprint Server