Research

A quantitative framework for characterizing the evolutionary history of mammalian gene expression

    • 1Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA;
    • 2Division of Health Science and Technology, MIT, Cambridge, Massachusetts 02139, USA;
    • 3Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, Massachusetts 02115, USA;
    • 4Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA;
    • 5Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, 752 36 Uppsala, Sweden;
    • 6Earlham Institute, Norwich NR4 7UZ, United Kingdom;
    • 7Department of Biological and Medical Sciences, University of East Anglia, Norwich NR4 7TJ, United Kingdom;
    • 8Department of Biology and Koch Institute, MIT, Cambridge, Massachusetts 02142, USA;
    • 9Howard Hughes Medical Institute, Chevy Chase, Maryland 20815, USA
Published December 14, 2018. Vol 29 Issue 1, pp. 53-63. https://doi.org/10.1101/gr.237636.118
Download PDF Please log-in to or register for your personal account in order to access PDF Cite Article Permissions Share
cover of Genome Research Vol 36 Issue 4
Current Issue:

Abstract

The evolutionary history of a gene helps predict its function and relationship to phenotypic traits. Although sequence conservation is commonly used to decipher gene function and assess medical relevance, methods for functional inference from comparative expression data are lacking. Here, we use RNA-seq across seven tissues from 17 mammalian species to show that expression evolution across mammals is accurately modeled by the Ornstein–Uhlenbeck process, a commonly proposed model of continuous trait evolution. We apply this model to identify expression pathways under neutral, stabilizing, and directional selection. We further demonstrate novel applications of this model to quantify the extent of stabilizing selection on a gene's expression, parameterize the distribution of each gene's optimal expression level, and detect deleterious expression levels in expression data from individual patients. Our work provides a statistical framework for interpreting expression data across species and in disease.

Loading
Loading
Back to top