A novel method for multiple alignment of sequences with repeated and shuffled elements

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 5.
Figure 5.

(A) Dot matrix representation of similarities between Q9BI25 and ABL1_HUMAN protein sequences as revealed by BLAST (Altschul et al. 1997). The two diagonals of length 274 and 86 represent two domains: Pkinase (gray) and SH2 (black). (B) The corresponding ABA graph. Each multiple edge has a label of the form l(m), where l is the length of the sequences represented by that edge, and m is the multiplicity of the edge. Each single edge is labeled simply as l (length) for brevity. Source/sink vertices are labeled A and Q for protein sequences ABL1 HUMAN and Q9BI25, respectively. Other vertices are numbered. The gray path through the graph corresponds to Q9BI25 and the black path through the graph corresponds to ABL1 HUMAN. The Pkinase domain corresponds to the edge (1 → 2) of length 274, and the SH2 domain corresponds the edge (3 → 4) of length 86.

This Article

  1. Genome Res. 14: 2336-2346

Preprint Server