Table 2.

Assessment of the Annotation Set

Total annotation set Coding genes only ORF of coding genes Pseudogenes
Sn Sp G E Sn Sp G E Sn Sp G E Sn Sp G E
cDNA
dbEST0.770.420.970.920.800.320.980.920.850.170.980.930.710.080.970.93
EMBL_vertrna0.690.790.880.870.740.620.880.870.870.360.920.910.620.140.920.87
 Human0.680.790.820.83
 Mouse0.100.960.490.28
 Other0.110.930.380.24
Protein databases0.550.310.910.890.580.240.930.900.940.190.980.940.550.060.910.87
Comparative
 Mus blat0.290.660.770.590.330.550.810.600.600.490.880.640.240.110.730.59
 Mus exo0.260.570.790.540.280.450.800.530.490.400.880.560.250.110.790.60
 Exofish0.130.920.590.380.150.760.640.380.290.750.720.420.120.160.540.45
Prediction
 Genscan0.390.680.790.790.460.590.900.820.880.560.950.870.240.080.610.57
 fgenesh0.350.760.700.750.420.670.820.790.820.630.890.850.190.080.480.51
Multiple
 GS+blat[ii] 0.270.950.620.490.320.850.750.520.650.830.850.580.130.090.380.30
 Fish + mouse[iii] 0.110.950.540.310.120.790.590.310.250.780.670.340.090.160.470.35

[i] Note that for the subdivided data, the specificity calculation includes all matches and so will be reduced relative to the total annotation. Comparisons of specificity are therefore only fair within columns.

[ii] GenScan exons aligned with a blat mouse match.

[iii] Exofish matches that align with a mouse match from either method.

[iv] (Sn) Sensitivity (including all immunoglobulin λ [IGL] segments); (Sp) specificity (including all IGL segments); (G) gene hits (only counting [GLC] segments); (E) exon hits (only counting immunoglobulin λ constant [IGLC] segments).