
Test for pentamers over-represented in a position-dependent manner. (A) Early G1: ACGCG. The height of the white bar represents the number of genes in each interval in each data set. The purple shading indicates the number of upstream regions in that interval that contain one or more copy of the sequence element. The number of upstream regions expected to contain that sequence element, if the element were evenly distributed among all intervals in all data sets, is marked with the pink box. The blue line indicates the contribution that each interval makes to the χ2 score. (B) The pentamer ACGCG is over-represented in late G1 upstream regions in the interval −104 to −202 nucleotides upstream of the ATG start codon—the observed number of elements (purple) is greater than the expected number (pink). (C) It is somewhat over-represented in the same interval in S. The total χ2 score for ACGCG is 239.8. (D) G1: ACGCG; (E) M: ACGCG.











