Genome-wide identification of conserved regulatory function in diverged sequences

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 2.
Figure 2.

Alignment scores as a function of the number of TFBSs in the target window. Trend lines for correct (solid green line) and incorrect (solid red line) alignments; (black dotted line) perfectly separates correct from incorrect alignments. (Inset) The receiver operating characteristic (ROC) curve for the linear SVM classifier separating orthologous from control alignments; the curve profiles the performance in terms of the number of orthologous sequences that are correctly identified among all orthologous sequences (TPR), and the number of control sequences that are incorrectly identified as orthologs among all control sequences (FPR). Gray dotted lines show the standard deviation. The red dotted line displays the ROC curve for a random classifier. The solid red lines indicate the selected operating point (FPR = 0.5).

This Article

  1. Genome Res. 21: 1139-1149

Preprint Server