Table 2.

Comparison of Sequence Preprocessing Methods in Various Domain Groups

  Training set Test set Total
Group tp fp fn tn tp fp fn tn tp fp fn tn
EGF-like domainS281131034988134131117488415222152480
R291503291454013643670467
Fibronectin III DomainS21412335054100121317521325112552577
R237105071130022835020734
Sushi domain (SCR repeat)S10501351865950175821650052938
R106007759003216500109
ANK repeatS98043519055522175391513652778
R9913565401221552277
ABC transportersS530003476223900174077690052169
R5303090239204376910137
WD repeatS184363509977400175292613652668
R1862416876019226225260

[i] S = whole sequence vs. R = regions.