Table 4.
Label Probabilities for Database Matches of the Six Types Considered
| Type of annotation | Coding | Intron in coding | Intron in UTR | Intergenic | 5′ UTR | 3′ UTR |
| Coding | 0.300 | 0.150 | 0.050 | 0.050 | 0.050 | 0.050 |
| cDNA | 0.320 | 0.007 | 0.007 | 0.007 | 0.320 | 0.320 |
| Intron | 0.088 | 0.130 | 0.130 | 0.088 | 0.088 | 0.088 |
| EST | 0.120 | 0.107 | 0.107 | 0.107 | 0.120 | 0.120 |
| Repeat | 0.000 | 0.125 | 0.125 | 0.125 | 0.125 | 0.125 |
| Intergenic | 0.013 | 0.013 | 0.013 | 0.990 | 0.013 | 0.013 |
-
There are three different labels for introns in coding regions (corresponding to the three possible phases) and two intron labels for UTRs, those in 5′ UTRs, and those in 3′ UTRs. Therefore, the rows sum to 1 (e.g., for coding 0.3 + 3 × 0.15 + 2 × 0.05 + 0.05 + 0.05 + 0.05 = 1).











