Gene Length and Proximity to Neighbors Affect Genome-Wide Expression Levels

  1. Francesca Chiaromonte1,
  2. Webb Miller2, and
  3. Eric E. Bouhassira,3,4
  1. 1 Department of Statistics and Department of Health Evaluation Sciences, Pennsylvania State University, University Park, Pennsylvania 16802, USA
  2. 2 Department of Computer Science and Engineering and Department of Biology, Pennsylvania State University, University Park, Pennsylvania 16802, USA
  3. 3 Department of Medicine, Division of Hematology and Department of Cell Biology, Albert Einstein College of Medicine, Bronx, New York 10461, USA

Abstract

Steady-state levels of mRNA in cells theoretically depend on the rate and efficiency of transcription and posttranscriptional processing, on mRNA stability, on transcriptional interference from other genes, and on poorly defined long-range chromatin effects.Although each of these cellular processes has been studied in detail for a few genes, it is not possible to predict expression levels by simply examining gene sequences.In this report, we have used a bioinformatics approach to identify critical factors that influence expression levels. To simplify the problem, we have limited our analysis to the collection of genes expressed in all tissues, because such genes provide a unique opportunity to distinguish the role of general genomic features that constrain gene expression from the effect of tissue-specific factors.Using correlation and regression techniques, we have investigated the dependence between expression level and morphological parameters (distance to neighbors, gene, mRNA or 3′-UTR length, number of exons, etc.) that can be directly related to transcription, posttranscriptional processing, mRNA stability, or transcriptional interference.We found that, on a genome-wide scale, highly expressed genes are significantly farther from their closest neighboring genes, are smaller, contain a moderate number of exons, and produce shorter mRNAs with shorter 3′-UTRs.This confirms that transcriptional and posttranscriptional processes are highly interrelated and implies that transcriptional interference plays a role in determining steady-state levels of mRNA in cells.

Footnotes

  • [Supplemental material is available online at www.genome.org. The data sets and details on data preparation and preprocessing can be found at http://bio.cse.psu.edu/dist/bouhassira/. The complete list of genes used in this study is available at http://bio.cse.psu.edu/.]

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1169203. Article published online before print in November 2003.

  • 4 Corresponding author. E-MAIL bouhassi{at}aecom.yu.edu; FAX (718) 824-3153.

    • Accepted September 3, 2003.
    • Received January 14, 2003.

Articles citing this article

| Table of Contents

Preprint Server