Table 4.

Comparison of Various Prediction Methods on Selected Domain Groups

Domain type (number in SWISS-PROT 38) Number of errors[ii]
  Ann (present method) PROSITE PRINTS PFAM
EGF (340)31210817
Fibronectin type 3 (241)2n.a.277
Trypsin (273)02600
ANK-repeat (119)0n.a.[iii] n.a.9
WD-repeat (247)725158
Cytochrome C (78)8453

[i] The total number of sequences that contain the given domain type in Swiss-Prot 38. The release numbers of PROSITE, PRINTS, and PFAM are those used by Swiss-Prot 38.

[ii] Errors for PROSITE, PRINTS and PFAM were determined from the Swiss-Prot annotation (e.g., a sequence annotated as having an EGF repeat but having no cross-reference to a given pattern database was considered not detected by the corresponding method). Thus, this number contains only false negatives. On the other hand, the errors of ANN contain both false positives and false negatives.

[iii] n.a. = Not available (i.e., the domain type is not included in the corresponding method).