A large number of novel coding small open reading frames in the intergenic regions of the Arabidopsis thaliana genome are transcribed and/or under purifying selection

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 3.
Figure 3.

Sliding window calculation of pp in genomic sequences surrounding IDA. The pp values were determined in 75-bp windows with 3-bp steps for A. thaliana chromosome sequences. The pp values in a region containing the small protein gene IDA and flanking sequences are shown. The diagram on top indicates the locations of exons (white box, untranslated regions; black box, CDS), introns (bent lines), transcriptional starts (small arrows), and intergenic sequences (thick gray lines). The six plots below the annotation diagram are the results of pp calculations in six reading frames (forward, +; reverse, −). The dotted line indicates pp = 0.2239, the threshold value for calling whether a 75-bp window is likely a CDS or not. The shaded areas highlight the overlap between IDA CDS and regions with a high pp. The arrow indicates the correct frame for the IDA CDS.

This Article

  1. Genome Res. 17: 632-640

Preprint Server