Martin C. Frith

A simple method for finding related sequences by adding probabilities of alternative alignments

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 6.

E-values for identical alignments found by Algorithm 1 (horizontal axis) and Algorithm 2 (vertical axis). Each point is one alignment. The diagonal gray lines indicate equal E-values. These E-values were calculated with m = total length of all reference sequences (e.g., all A. aeolicus proteins) and n = total length of all query sequences (e.g., all P. fumarii proteins). For DNA, one query strand was searched against both reference strands, so m was multiplied by two. (A) Human repeat DNA; (B) proteins; (C) U2 DNA fragments; (D) U2 DNA fragments with fitted match/mismatch/gap probabilities.

This Article

Published in Advance August 16, 2024, doi: 10.1101/gr.279464.124 Genome Res. 2024. 34: 1165-1173

AbstractFree
Full TextFree
Full Text (PDF)
Supplemental Material

A simple method for finding related sequences by adding probabilities of alternative alignments

This Article

Preprint Server

Current Issue

In This Issue