Correlations between various tracks and experimentally verified gene annotations
| Track | Covers | Yield Tx | Yield Co | Enrich Tx | Enrich Co |
| Human EST | 5.83% | 83.7% | 82.8% | 14.4 | 14.2 |
| Spliced EST | 1.11% | 59.2% | 72.3% | 53.3 | 65.1 |
| Mouse Blat | 3.62% | 60.4% | 82.3% | 16.7 | 22.7 |
| Other mRNA | 0.77% | 49.3% | 67.4% | 64.0 | 87.5 |
| Other EST | 0.95% | 53.0% | 69.6% | 55.8 | 73.3 |
| Exofish | 0.40% | 23.7% | 36.6% | 59.3 | 91.5 |
| Genscan | 1.65% | 57.0% | 86.0% | 34.5 | 52.1 |
| RefSeq Tx | 0.79% | 100% | 100% | 126.6 | 126.6 |
| RefSeq Co | 0.50% | 63.3% | 100% | 126.6 | 200.0 |
| A—Whole genome using RefSeq Annotations | |||||
| Track | Covers | Yield Tx | Yield Co | Enrich Tx | Enrich Co |
| Human EST | 8.40% | 78.1% | 74.7% | 9.3 | 8.9 |
| Spliced EST | 1.80% | 43.9% | 55.9% | 24.4 | 31.0 |
| Mouse Blat | 2.89% | 44.3% | 65.3% | 15.3 | 22.6 |
| Other mRNA | 1.05% | 27.8% | 41.3% | 26.5 | 39.3 |
| Other EST | 1.37% | 37.6% | 53.1% | 27.4 | 38.8 |
| Exofish | 0.61% | 16.3% | 27.1% | 26.7 | 44.4 |
| Genscan | 3.00% | 47.7% | 76.4% | 15.9 | 25.5 |
| Sanger Tx | 2.80% | 100% | 100% | 35.7 | 35.7 |
| Sanger Co | 1.60% | 57.1% | 100% | 35.7 | 62.5 |
| B—Chromosome 22 using Sanger Centre Annotations | |||||
-
The Covers column shows the percentage of the genome (A) or chromosome 22 (B) covered by a particular track. The Yield Tx column describes the percentage of bases in the annotated gene transcripts (from known genes in RefSeq in A and the Sanger Centre annotated genes in B) covered by the track, while the Yield Co column describes the percentage of the annotated protein coding regions covered. The Enrich Tx and Enrich Co columns show how many times enriched the track is for transcribed and coding regions compared to the genome as a whole. The yield columns correspond directly to sensitivity of the feature for detecting genes. Because the annotations, particularly the whole-genome annotations, are incomplete, it is not possible to do traditional specificity calculations. However, the enrichment columns allow one to compare the relative specificity of the tracks. The rows for the tracks RefSeq Tx (transcribed regions in RefSeq), RefSeq Co (coding regions in RefSeq), Sanger Tx (transcribed regions for Sanger annotated genes), and Sanger Co (coding regions from Sanger annotated genes) are included to show the maximum possible yields and enrichments for transcript and coding tracks.











