Visualizing Sequence Similarity of Protein Families

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 2
Figure 2

A similarity graph G of eight proteins is shown at left. The weights on the edges show the percentage identity and cover of the best match between the pairs of proteins. When clustering with threshold (30, 20), G30, 20 is created from G by removing edges ce, cf, and dg. G30,20 contains, three connected components that form the clusters C1, C2 C3 shown at right.

This Article

  1. Genome Res. 14: 1160-1169

Preprint Server