The Ensembl Analysis Pipeline

Table 1.

Principal Analyses of the'Raw Compute' for the Human Genome With the Size of Input Sequence Used


Analysis

Input sequence size
CpG island prediction chromosome
RepeatMasker contig
Dust (low-complexity repeats) chromosome
TRF (tandem repeats) contig
Eponine (transcription start site prediction) 1-Mb slice
Genscan contig
e-PCR (STS markers) 1-Mb slice
tRNAscan contig
BLAST vs. Swall* contig
BLAST vs. Unigene* contig
BLAST vs. EMBL Vertebrate RNA*
contig
  • * BLAST analyses are only run on the peptides predicted by Genscan and not on the full genomic sequence. This is done to speed up the analysis

This Article

  1. Genome Res. 14: 934-941

Preprint Server