Summary of the Rule Model
| A. Annotations, rules, and classifications | |
| Annotated genes | |
| Within the 23 broad classes of GO biological process | 273 |
| Gene probes | |
| Associated with the 273 genes within the 23 broad biological process classes | 284 |
| Training examples | |
| Annotations associated with the genes in the 23 broad biological process classes | 549 |
| Coannotations[i]associated with the genes in the 23 broad biological process classes | 444 |
| Rules | |
| Generated from the training examples | 18064 |
| Estimated quality of classifications of unknown genes (cross-validation estimates) | |
| Sensitivity | 84% |
| Specificity | 91% |
| Fraction of classifications that are correct | 49% |
| Classifications for unknown (uncharacterized) genes | |
| Classifications were obtained for 211 of the 213 unknown genes | 548 |
| Reclassifications for training examples | 728 |
| True positive classifications | 519 |
| True positive coclassifications[ii] | 356 |
| False positive classifications | 219 |
| False negative (missing) classifications | 30 |
| For 272 of the 273 training examples at least one correct reclassification was obtained |
| B. Number of biological processes annotated or classified per gene | |||
| Number of biological processes per gene | Annotations for training example genes | Reclassifications for training example genes | Classifications for unknown genes |
| 1 | 105 | 30 | 27 |
| 2 | 100 | 93 | 84 |
| 3 | 41 | 96 | 59 |
| ≥4 | 27 | 54 | 41 |
[i] Pairs of two different biological processes annotated to the genes in the data set.
[ii] Classification of two different biological processes to one gene.