Kinetics of Xist-induced gene silencing can be predicted from combinations of epigenetic and genomic features

  1. Annalisa Marsico1,5,7,11
  1. 1Otto Warburg Laboratories, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany;
  2. 2Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA;
  3. 3Institut Curie, PSL Research University, CNRS UMR3215, INSERM U934, UPMC Paris-Sorbonne, 75005 Paris, France;
  4. 4Institut Curie, PSL Research University, Mines Paris Tech, INSERM U900, 75005 Paris, France;
  5. 5Department of Mathematics and Informatics, Free University of Berlin, 14195 Berlin, Germany
  1. 6 These authors are co-first authors and contributed equally to this work.

  2. 7 These authors are co-senior authors and contributed equally to this work.

  • Present addresses: 8Annoroad Gene Technology Co., Ltd., 100176 Beijing, China; 9Department of Genetics, University Medical Centre Groningen, University of Groningen, 9700 RB Groningen, The Netherlands; 10Institut Cochin, Inserm U1016, CNRS UMR8104, Université Paris Descartes, Sorbonne Paris Cité, 74014 Paris, France; 11Institute for Computational Biology (ICB), Helmholtz Zentrum München, 85764 Oberschleißheim, Germany; 12European Molecular Biology Laboratory (EMBL), Directors’ research unit, 69117 Heidelberg, Germany

  • Corresponding authors: Edith.Heard{at}embl.org, annalisa.marsico{at}helmholtz-muenchen.de, edda.schulz{at}molgen.mpg.de, johnlis{at}cornell.edu
  • Abstract

    To initiate X-Chromosome inactivation (XCI), the long noncoding RNA Xist mediates chromosome-wide gene silencing of one X Chromosome in female mammals to equalize gene dosage between the sexes. The efficiency of gene silencing is highly variable across genes, with some genes even escaping XCI in somatic cells. A gene's susceptibility to Xist-mediated silencing appears to be determined by a complex interplay of epigenetic and genomic features; however, the underlying rules remain poorly understood. We have quantified chromosome-wide gene silencing kinetics at the level of the nascent transcriptome using allele-specific Precision nuclear Run-On sequencing (PRO-seq). We have developed a Random Forest machine-learning model that can predict the measured silencing dynamics based on a large set of epigenetic and genomic features and tested its predictive power experimentally. The genomic distance to the Xist locus, followed by gene density and distance to LINE elements, are the prime determinants of the speed of gene silencing. Moreover, we find two distinct gene classes associated with different silencing pathways: a class that requires Xist-repeat A for silencing, which is known to activate the SPEN pathway, and a second class in which genes are premarked by Polycomb complexes and tend to rely on the B repeat in Xist for silencing, known to recruit Polycomb complexes during XCI. Moreover, a series of features associated with active transcriptional elongation and chromatin 3D structure are enriched at rapidly silenced genes. Our machine-learning approach can thus uncover the complex combinatorial rules underlying gene silencing during X inactivation.

    Footnotes

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.245027.118.

    • Freely available online through the Genome Research Open Access option.

    • Received October 11, 2018.
    • Accepted May 28, 2019.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

    Articles citing this article

    OPEN ACCESS ARTICLE

    Preprint Server