Visualizing Sequence Similarity of Protein Families

Table 1.

Clustering Quality Results for the 452 Nontrivial Families




(i, c)

i

c

r
(100,100) 340 (75%) 274 (61%) 229 (51%) 332 (73%)
(90,90)
369 (82%)
290 (64%)
256 (57%)
362 (80%)
  • The columns represent different clustering schemes — column labeled i refers to clustering using percentage identity alone, column labeled c refers to clustering using percentage cover alone, etc. The first row lists families that are (100,100) clusterable, whereas the second includes families that are at least (90,90) clusterable.

This Article

  1. Genome Res. 14: 1160-1169

Preprint Server