Table 5.

Comparison Results with BLAST

GFScan TBLASTN BLASTN
E<E m E<1e− 4 E<10 E<10
A. NGIC Family (E m = 9e − 6)[i]
Known member3737373737
Location found3845485933
Known location found2929292928
Potential candidates[ii] 88885
False positives[iii] 1 8 11 22 0
Known location missed88889
B. CA Family (E m = 9e − 10)
Known member1414141414
Location found1919233816
Known location found1212121211
Potential candidates[ii] 6 6 6 6 5
False positives[iii] 115200
Known location missed22223
C. DH-Domain Family (E m = 1c − 8)
Known member88888
Location found91116445
Known location found77775
Potential candidates[ii] 1 1 1 1 0
False positives[iii] 138360
Known location missed11113
D. ETS-Domain Family (E m = 1c − 10)
Known member1919191919
Location found2634375815
Known location found1818181814
Potential candidates[ii] 8 8 8 8 1
False positives[iii] 0811320
Known location missed11115

[i] E m: The minimum E-value used to find all known members by TBLASTN.

[ii] Genomic location that is not related to known members. The translated protein could match regular expression pattern of the gene family in the PROSITE database.

[iii] Genomic location where no gene family member locates (see detail in Methods).