Table 2.

Summary of the Rule Model

A. Annotations, rules, and classifications
Annotated genes
 Within the 23 broad classes of GO biological process273
Gene probes
 Associated with the 273 genes within the 23 broad biological process classes284
Training examples
 Annotations associated with the genes in the 23 broad biological process classes549
 Coannotations[i]associated with the genes in the 23 broad biological process classes444
Rules
 Generated from the training examples18064
Estimated quality of classifications of unknown genes (cross-validation estimates)
 Sensitivity84%
 Specificity91%
 Fraction of classifications that are correct49%
Classifications for unknown (uncharacterized) genes
 Classifications were obtained for 211 of the 213 unknown genes548
Reclassifications for training examples728
 True positive classifications519
 True positive coclassifications[ii] 356
 False positive classifications219
 False negative (missing) classifications30
For 272 of the 273 training examples at least one correct reclassification was obtained
B. Number of biological processes annotated or classified per gene
Number of biological processes per gene Annotations for training example genes Reclassifications for training example genes Classifications for unknown genes
11053027
21009384
3419659
≥4275441

[i] Pairs of two different biological processes annotated to the genes in the data set.

[ii] Classification of two different biological processes to one gene.