Table 3.

Comparison between DIAN and MGI Ontological Assignments

Results from the comparative approach are shown. A number of intrinsic problems were identified from this approach, such that type I and type II variances described here are for comparative purposes only and cannot be interpreted strictly as type I and II errors.
Table 3A. Comparing Assignments Made to the Cellular Role Ontology
Concept DIAN node number Highest level matching GO modes Present in Variation Sensitivity Selectivity
DIAN and GO DIAN only GO only Type I Type II
Chromosome structure1.1GO:0007001;GO:00063237440.2670.2670.6360.636
Transcription factors1.4GO:00037005954380.3580.2520.6080.522
DNA duplication3.2GO:0006260;GO:000396410220.1430.1430.8330.833
Cell-cell adhesion5.2GO:00071553515140.2340.2190.7140.700
Transcription factors9.1.1.1GO:00081351010.0000.5000.5001.000
Microtubule6.2GO:00070179020.0000.1820.8181.000
DNA repair8.1GO:000628114290.0800.3600.6090.875
Programmed cell death8.2GO:0006915147230.1590.5230.3780.667
Channel and transporter4.6GO:0006810;GO:0005216478270.0980.3290.6350.855
Amino acid metabolism9.2GO:00065194170.0830.4070.4760.625
Stress response8.4GO:000695056550.0910.8330.0830.455
Nucleotide metabolism9.4GO:0006140:00062050270.2220.7780.0000.000
GO:0006143
Cofactor metabolism9.5GO:00067314030.0000.4290.5711.000
Total DIAN and GO: 229
TotalDIAN only: 113
Total GO only: 214
Total: 556
Average type I: 0.203
Average type II: 0.385
Sensitivity: 0.517
Selectivity: 0.670
DIAN assignments made to a group of well-characterized, nonredundant mouse sequences were compared to assignments made by the MGI to the GO Process and Function ontologies. GO modes corresponding to DIAN nodes are listed, along with the abbreviated essential concept from theDIAN Role ontology. For brevity, only the highest level GO nodes are listed. The number of sequences whose assignment is shared to both sets of ontologies is indicated (DIAN and GO), as well as the number of sequence assignments which differed (DIANonly, GO only). These numbers are used to calculate Type I and II variation using the following equations: Type I variation =DIANonly/(DIAN and GO +DIAN only + GO only); Type II variation = GO only/(DIAN and GO +DIAN only + GO only); Sensitivity =DIAN and GO/(DIAN and GO + GO only); Selectivity =DIAN and GO/(DIAN and GO +DIAN only). Sensitivity is defined as the ability of the DIAN algorithm to make what are believed to be all possible correct assignments. Selectivity is defined as the ability of the DIANalgorithm to not make what is believed to be an incorrect assignment.
Table 3B. Comparing Assignments Made to the Protein Function Ontology
Concept DIAN node number Highest level matching GO modes Present in Variation Sensitivity Selectivity
DIAN and GO DIAN only GO only Type I Type II
Hormones and active peptides10GO:0005179;GO:00051037380.1670.4440.4670.700
GO:0005104;GO:0005105
GO:0005106;GO:0005109
GO:0005110;GO:0005111
GO:0005112;GO:0005113
GO:0005114;GO:0005115
GO:0005116;GO:0005117
GO:0005118;GO:0005119
GO:0005120;GO:0005121
GO:0005122;GO:0005123
GO:0005124;GO:0005177
GO:0005178;GO:0005186
Inhibitors12GO:0004857;GO:000818912390.1250.3750.5710.800
GO:0005074;GO:0005092
GO:0008200;GO:0005517
DNA or RNA associated proteins3GO:0003676;GO:000373525521260.0700.0860.9070.924
GO:0004748;GO:0003910
GO:0003911;GO:0004518
GO:0003899;GO:0008534
GO:0008263;GO:0003907
GO:0003905;GO:0003906
GO:0003904
GO:0004844;GO:0003908
Protein secretion and chaperones13GO:0003754;GO:000856511320.1880.1250.8460.786
GO:0006605
Electron transport proteins5GO:00054890760.5380.4620.0000.000
Other tranport proteins6GO:00052156217190.1730.1940.7650.785
Structural proteins7GO:00051983123400.2450.4260.4370.574
Receptors8GO:00048726743150.3440.1200.8170.609
Cytokines and growth factors9GO:0008083;GO:00051253519100.2970.1560.7780.648
GO:0008009
TotalDIAN and GO: 480
TotalDIAN only: 139
Total GO only: 135
Total: 754
Average type I: 0.184
Average type II: 0.179
Sensitivity: 0.780
Selectivity: 0.775

[i] A group of well-characterized, nonredundant mouse sequences were assigned to the Protein Function ontology by theDIAN domain-based mapping algorithm. These assignments were compared to assignments made to the GO Process and Function ontology by the MGI.