Zhi John Lu; Kevin Y. Yip; Guilin Wang; Chong Shou; LaDeana W. Hillier; Ekta Khurana; Ashish Agarwal; Raymond Auerbach; Joel Rozowsky; Chao Cheng; Masaomi Kato; David M. Miller; Frank Slack; Michael Snyder; Robert H. Waterston; Valerie Reinke; Mark B. Gerstein

Prediction and characterization of noncoding RNAs in C. elegans by integrating conservation, secondary structure, and high-throughput sequencing and array data

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 1.

Distributions of nine genomic feature values. The distributions of values of the nine features are shown for the gold-standard set (for the definition of the gold-standard set, see Supplemental Methods) of the four types of genomic elements: known ncRNAs, coding sequences (CDSs), untranslated regions (UTRs), and intergenic regions. The values of each expression feature are the maximum of the corresponding values from all the expression data sets of the same type. (A) Box plots of individual features (normalized values). (B) Two-dimensional scatter-plot of the maximum small RNA-seq signal against the maximum poly-A+ RNA-seq signal. (C) Two-dimensional scatter-plot of the maximum poly-A+ RNA tiling array signal against the predicted secondary structure conservation. Expression values in B and C are the log-transformed normalized read counts (DCPM, depth of coverage per million reads).

This Article

Published in Advance December 22, 2010, doi: 10.1101/gr.110189.110 Genome Res. 2011. 21: 276-285

Prediction and characterization of noncoding RNAs in C. elegans by integrating conservation, secondary structure, and high-throughput sequencing and array data

This Article

Preprint Server

Current Issue

In This Issue