Identifying genes within pathways in unannotated genomes with PaGeSearch

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 1.
Figure 1.

Process of PaGeSearch. (A) Genes in the specified pathway are identified from the Reactome database, and their sequences are downloaded from the Ensembl database for use as the PaGeSearch query. (B) PaGeSearch initiates by finding seed regions through a sequence similarity search, followed by gene prediction within these regions. The final results are filtered using a neural network model that evaluates sequence similarity, gene prediction metrics, and protein alignment statistics.

This Article

  1. Genome Res. 34: 784-795

Preprint Server