Table 1.

Results on the Test Sets

Test set Nucleotide level Exon level
SN SP AC SN SP (SN+SP)/2 ME WE
The ROSETTA set
 ROSETTA0.9350.9780.9490.8330.8290.8310.0480.047
 SGP-10.9400.9600.9400.7000.7600.7300.1200.040
 SLAM0.9510.9810.9600.7830.7550.7690.0380.057
 TWINSCAN.p0.9600.9410.9400.8550.8240.8400.0450.081
 TWINSCAN0.9840.8890.9230.8390.7670.8030.0340.118
 GENSCAN0.9750.9080.9290.8170.7700.7930.0570.107
HoxA
 SLAM0.8520.8960.8640.7270.5330.6300.0000.333
 TWINSCAN.p0.9760.8290.8960.7730.5310.6520.0000.312
 TWINSCAN0.9490.5110.7040.5910.1730.3820.0000.707
 SGP-20.6400.6370.6190.4090.1730.2910.0910.596
 GENSCAN0.9320.6870.7960.5450.2350.3900.0000.569
Elastin
 SLAM0.8760.9810.9260.8020.8590.8310.1210.059
 TWINSCAN.p0.9420.9500.9450.8790.8890.8840.0660.056
 TWINSCAN0.9330.8770.9030.8350.8260.8310.1100.120
 SGP-20.7550.9980.8730.5930.9000.2910.3520.017
 GENSCAN0.9470.7660.8520.8350.7310.7830.1210.231

[i] The measures of sensitivity SN = TP/TP + FN and specificity SP = TP/TP + FP (where TP = true positives, TN = true negatives, FP = false positives and FN = false negatives) are shown at both the nucleotide and exon level. ME is entirely missed exons, WE is wrong exons, and the approximate correlation AC = 1/2 (TP/TP + FN + TP/TP + FP + TN/TN + FP + TN/TN + FN) − 1 summarizes the overall nucleotide sensitivity and specificity by one number. Within each of the three data sets the methods are divided into three classes: those operating on a syntenic DNA pair, those operating on a human sequence using as evidence matches against a database of mouse sequences, and a single-organism gene finder (GENSCAN).