DIAN: A Novel Algorithm for Genome Ontological Classification

Table 1.

Example Queries that Cannot be Resolved Accurately by Conventional Querying Systems

Case type Source record: DIAN system annotation
Identifier EGAD celluar role Protein function Enzyme classification DoubleTwist biological role Structure (SCOP)
Annotation
Case 1: Novel gene with no predicted function UniGene Hs. 104305 •cell/organism defense•homeostasis•apoptosis•cell division•apoptosis None None •Non-immune cell defense•Apoptosis •All alpha proteins•DEATH domain•DEATH domain•DEATH domain
None
Case 2: Hypothetical gene with predicted function SWISS-PROT P39959 •gene/protein expression•RNA synthesis•transcription factors •DNA or RNA  associated  proteins None •Genome structure and  Gene expression•Transcription factors •Small proteins•Classic zinc finger, C2H2•Classic zinc finger, C2H2•Classic zinc finger, C2H2
Putative Zinc Protein
Case 3: Known gene with novel predicted function UniGene Hs. 2157 •cell division•apoptosis •Enzymes•Transferase•Post-translational modifications •Transferases •Non-immune cell defense•Apoptosis •All beta proteins•PH domain-like•PH domain-like•Enabled/VASP homology  1 domain (EVH1 domain)
Wiskott-Aldrich syndrome proteinOMIM: 30100 SWISS-PROT: P42768
  • Three illustrative cases of records that cannot be returned by conventional keyword-based querying systems but that were returned byDIAN are described here.

This Article

  1. Genome Res. 11: 1766-1779

Preprint Server