Table 8.

Pairs of Biological Processes With High Frequency of Coclassifications to the Same Gene

Pairs of processes A Genes with coclassification B Genes with coclassification and coannotation C Genes with coclassification, but no coannotation D Genes with coannotation but no coclassification E Genes without coclassification and coannotation F P-value
DNA metabolism–cell cycle25121302197.36E-14
Transcription–oncogenesis2251722207.08E-05
Cell organization and biogenesis–cell cycle1813532233.58E-15
Transcription–cell proliferation1810822241.59E-11
Transcription–cell cycle1871122241.13E-07
Cell motility–defense (immune) response1611522269.00E-14
DNA metabolism–transcription159612286.72E-12
Oncogenesis–cell proliferation145932271.47E-05
Defense (immune) response–cell proliferation137642275.52E-08
Transcription–intracellular signaling cascade1321132282.40E-02
Cell motility–cell adhesion1210212314.21E-15
Protein metabolism and modification–stress response125702321.15E-07
Protein metabolism and modification–cell organization and biogenesis116502331.68E-09
Cell motility–cell proliferation116522314.53E-08
Cell proliferation–intracellular signaling cascade114712321.12E-05
Transcription–lipid metabolism1111002334.51E-02

[i] Pairs of biological processes that were classified to the same gene for at least 11 different genes are shown.

[ii] The dependence between the coannotations and the coclassifications was tested with Fisher's exact test (see, e.g., Everitt 1992). A 2 × 2 contingency table was constructed for each process pair, and values in this table appear in columns B–F. The number of genes without a coannotation and a coclassification for a pair was computed by subtracting numbers in the other three columns from the total number of genes with at least 2 annotations or classifications. TheP-value appears in column F. All but two pairs (transcription–intracellular signaling cascade and transcription–lipid metabolism) were significant at the 0.0001 level.