Table 3.

Sequence Analysis of Putative Promoters

All Total % Total Strict TATA Less strict TATA
obs exp obs exp
25%–45% GC146214.210231218.8014501460.90
45%–65% GC631861.515771442.2045354078.50
65%–85% GC249624.321963.9640246
102761002819272566255785
Experimental Total % Total Strict TATA Less strict TATA
obs exp obs exp
25%–45% GC138.6910.71213
45%–65% GC11978.32629.57581
65%–85% GC2013.220.7552.9
1521003740.959296.9

[i] We grouped the putative promoters in the entire dataset of 10,276 and, separately, the 152 experimental promoters into three classes: 25%–45%, 45%–65%, and 65%–85% GC. The number of promoters in each class are shown. Based on the nucleotide frequency within each group, we also calculated the expected number of promoter fragments in which a strict TATA-box (TATA[T/A][T/A]) or a less-strict TATA-box (TA[T/A][T/A][T/A][T/A]) would appear at least once by chance. Then we calculated the number of promoter fragments in both our experimental and total dataset in which these elements appeared at least once. The shaded boxes indicate those cases in which the observed and expected frequencies of TATA elements are significantly different from one another (P < 0.05).