Genome wide analysis of 3′ UTR sequence elements and proteins regulating mRNA stability during maternal-to-zygotic transition in zebrafish

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 5.
Figure 5.

Modeling the effect of sequence on mRNA stability in vivo. (A) Scheme of the procedure for building the random forest model on RESA-targeted profiles. Data generated from window-sliding across the RESA profiles are used to train a random forest model. (B) Example of predicted (red) and RESA-targeted experimental (black) profiles for epb41l5 gene. (C) Model performance per window using fivefold cross-validation; model achieved 0.68 Pearson correlation between predicted stability and measured according to RESA-targeted library. (D, left) Top selected motifs according to the random forest model trained on RESA-targeted library (motifs in green text represent the miR-430 target sites). Columns represent WT, LNA430, and α-amanitin treatment. Color intensity represents the RESA fold-change difference between windows that do or do not contain each motif. Blue represents stabilizing motifs, and red represents destabilizing motifs (all motifs have P below 4.6 × 10−56 (Mann–Whitney U test followed by Bonferroni multiple testing adjustment). (Right) Heatmap representing motif enrichment in the top 50 hexamers enriched in iCLIP experiments. (E) Same as D with random forest model trained on the RESA targeted with poly(A) selection library. (F,G) U-rich (F) and ARE (G) motifs’ enrichment between 2 and 6 hpf within total and/or poly(A) selected RNA-seq. In heatmap, overrepresentation (yellow) and underrepresentation (blue) patterns are shown. Also shown are the mutual information values, Z-scores associated with a randomization-based statistical test, and robustness scores from a threefold jackknifing test. (H) RESA-targeted profile from the trip10a locus. Genetic deletion of a sequence spanning regulated region (black line) results in stabilization of the trip10a transcript as assessed by in situ hybridization (n = 20/20; I). (J) Random forest model validation. Barplot comparing experimental (RNA level by high-throughput sequencing) and predicted average stability of trip10a decay peak and four mutated sites (Spearman's correlation coefficient r = 0.60).

This Article

  1. Genome Res. 29: 1100-1114

Preprint Server