Table 4.

Label Probabilities for Database Matches of the Six Types Considered

Type of annotation Coding Intron in coding Intron in UTR Intergenic 5′ UTR 3′ UTR
Coding0.3000.1500.0500.0500.0500.050
cDNA0.3200.0070.0070.0070.3200.320
Intron0.0880.1300.1300.0880.0880.088
EST0.1200.1070.1070.1070.1200.120
Repeat0.0000.1250.1250.1250.1250.125
Intergenic0.0130.0130.0130.9900.0130.013

[i] There are three different labels for introns in coding regions (corresponding to the three possible phases) and two intron labels for UTRs, those in 5′ UTRs, and those in 3′ UTRs. Therefore, the rows sum to 1 (e.g., for coding 0.3 + 3 × 0.15 + 2 × 0.05 + 0.05 + 0.05 + 0.05 = 1).