Pathogenicity and selective constraint on variation near splice sites

  1. The Deciphering Developmental Disorders Study16
  1. 1 Wellcome Sanger Institute;
  2. 2 Manchester Centre for Genomic Medicine;
  3. 3 Sheffield Clinical Genetics Service;
  4. 4 Liverpool Women's Hospital Foundation Trust;
  5. 5 West of Scotland Regional Genetics Service;
  6. 6 South East Thames Regional Genetics Centre;
  7. 7 Wessex Clinical Genetics Service;
  8. 8 South West Thames Regional Genetics Centre;
  9. 9 Temple Street Children's Hospital;
  10. 10 Northern Ireland Regional Genetics Centre;
  11. 11 North West Thames Regional Genetics Service;
  12. 12 Northern Ireland Regional Genetics Service;
  13. 13 University of Edinburgh;
  14. 14 University of Exeter;
  15. 15 East Anglian Medical Genetics Service;
  16. 16 -
  • * Corresponding author; email: meh{at}sanger.ac.uk
  • Abstract

    Mutations which perturb normal pre-mRNA splicing are significant contributors to human disease. We used exome sequencing data from 7,833 probands with developmental disorders (DD) and their unaffected parents, as well as >60,000 aggregated exomes from the Exome Aggregation Consortium, to investigate selection around the splice site, and quantify the contribution of splicing mutations to DDs. Patterns of purifying selection, a deficit of variants in highly constrained genes in healthy subjects and excess de novo mutations in patients highlighted particular positions within and around the consensus splice site of greater functional relevance. Using mutational burden analyses in this large cohort of proband-parent trios, we could estimate in an unbiased manner the relative contributions of mutations at canonical dinucleotides (73%) and flanking non-canonical positions (27%), and calculated the positive predictive value of pathogenicity for different classes of mutations. We identified 18 patients with likely diagnostic de novo mutations in dominant DD-associated genes at non-canonical positions in splice sites. We estimate 35-40% of pathogenic variants in non-canonical splice site positions are missing from public databases.

    • Received April 13, 2018.
    • Accepted December 20, 2018.

    This manuscript is Open Access.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International license), as described at http://creativecommons.org/licenses/by/4.0/.

    Articles citing this article

    OPEN ACCESS ARTICLE
    ACCEPTED MANUSCRIPT

    This Article

    1. Genome Res. gr.238444.118 Published by Cold Spring Harbor Laboratory Press

    Article Category

    ORCID

    Share

    Preprint Server