Using Database Matches with HMMGene for Automated Gene Detection in Drosophila

Table 4.

Label Probabilities for Database Matches of the Six Types Considered

Type of annotation Coding Intron in coding Intron in UTR Intergenic 5′ UTR 3′ UTR
Coding 0.300 0.150 0.050 0.050 0.050 0.050
cDNA 0.320 0.007 0.007 0.007 0.320 0.320
Intron 0.088 0.130 0.130 0.088 0.088 0.088
EST 0.120 0.107 0.107 0.107 0.120 0.120
Repeat 0.000 0.125 0.125 0.125 0.125 0.125
Intergenic 0.013 0.013 0.013 0.990 0.013 0.013
  • There are three different labels for introns in coding regions (corresponding to the three possible phases) and two intron labels for UTRs, those in 5′ UTRs, and those in 3′ UTRs. Therefore, the rows sum to 1 (e.g., for coding 0.3 + 3 × 0.15 + 2 × 0.05 + 0.05 + 0.05 + 0.05 = 1).

This Article

  1. Genome Res. 10: 523-528

Preprint Server