Base-specific mutational intolerance near splice sites clarifies the role of nonessential splice nucleotides

  1. Mark J. Daly1,2,4,5
  1. 1Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts 02114, USA;
  2. 2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA;
  3. 3Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, Massachusetts 02115, USA;
  4. 4Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA;
  5. 5Institute for Molecular Medicine Finland (FIMM), 00290 Helsinki, Finland
  • Corresponding author: mjdaly{at}atgu.mgh.harvard.edu
  • Abstract

    Variation in RNA splicing (i.e., alternative splicing) plays an important role in many diseases. Variants near 5′ and 3′ splice sites often affect splicing, but the effects of these variants on splicing and disease have not been fully characterized beyond the two “essential” splice nucleotides flanking each exon. Here we provide quantitative measurements of tolerance to mutational disruptions by position and reference allele–alternative allele combinations. We show that certain reference alleles are particularly sensitive to mutations, regardless of the alternative alleles into which they are mutated. Using public RNA-seq data, we demonstrate that individuals carrying such variants have significantly lower levels of the correctly spliced transcript, compared to individuals without them, and confirm that these specific substitutions are highly enriched for known Mendelian mutations. Our results propose a more refined definition of the “splice region” and offer a new way to prioritize and provide functional interpretation of variants identified in diagnostic sequencing and association studies.

    Footnotes

    • Received November 1, 2017.
    • Accepted May 31, 2018.

    This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

    Articles citing this article

    Preprint Server