Genome-wide detection of novel regulatory RNAs in E. coli

  1. Howard Ochman2,3
  1. 1 Yale university, New haven CT;
  2. 2 Yale University, New haven, CT
  1. * Corresponding author; email: howard.ochman{at}yale.edu

Abstract

The intergenic regions in bacterial genomes can contain regulatory leader sequences and small RNAs (sRNAs), which both serve to modulate gene expression. Computational analyses have predicted the presence of hundreds of these non-coding regulatory RNAs in Escherichia coli; however, only about 80 have been experimentally validated. By applying a deep-sequencing approach, we detected and quantified the vast majority of the previously validated regulatory elements, and identified 10 new sRNAs and 9 new regulatory leader sequences in the intergenic regions of E. coli. Half of the newly discovered sRNAs displayed enhanced stability in the presence of the RNA-binding protein Hfq, which is vital to the function of many of the known E. coli sRNAs. Whereas previous methods have often relied on phylogenetic conservation to identify regulatory leader sequences, only five of the newly discovered E. coli leader sequences were present in the genomes of other enteric species. For those newly identified regulatory elements having orthologs in Salmonella, evolutionary analyses showed that these regions encoded new non-coding elements rather than small, unannotated protein-coding transcripts. In addition to discovering new non-coding regulatory elements, we validated 53 sRNAs that were previously predicted but never detected and showed that the presence, within intergenic regions, of σ70 promoters and sequences with compensatory mutations that maintain stable RNA secondary structures across related species are good predictors of novel sRNAs.

  • Received December 17, 2010.
  • Accepted June 8, 2011.
ACCEPTED MANUSCRIPT

Preprint Server