Figure 4.

Diagram illustrating the compression and scoring algorithm. The object on the left represents a relatively simple (2:2) UniGene object represented by U1, U2:u1, u2. UniGene U1, defined by sequences S11, S12, S13, S14 and U2, by sequence S21, have potential similarity to u1(s11, s12, s13) and u2(s21, s22). Sequence alignments of the various constituent sequences represented by the two-ended arrows between sequence vertical bars. Alignment relationships are summarized on theupper right, and calculated scores for the four possible clustered links are shown on the lower right. The C-score calculates the ratio of the number of observed clustered links among aligned sequences. The A-score is calculated using all possible links between any aligned sequences within a UniGene object.

39397-14f4_L1TT_rev1