Uncovering cis-regulatory sequence requirements for context-specific transcription factor binding

  1. Alexander Stark1,5
  1. 1Research Institute of Molecular Pathology (IMP), 1030 Vienna, Austria;
  2. 2Gregor Mendel Institute of Molecular Plant Biology (GMI), Austrian Academy of Sciences, 1030 Vienna, Austria;
  3. 3Center for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, 1030 Vienna, Austria
    1. 4 These authors contributed equally to this work.

    Abstract

    The regulation of gene expression is mediated at the transcriptional level by enhancer regions that are bound by sequence-specific transcription factors (TFs). Recent studies have shown that the in vivo binding sites of single TFs differ between developmental or cellular contexts. How this context-specific binding is encoded in the cis-regulatory DNA sequence has, however, remained unclear. We computationally dissect context-specific TF binding sites in Drosophila, Caenorhabditis elegans, mouse, and human and find distinct combinations of sequence motifs for partner factors, which are predictive and reveal specific motif requirements of individual binding sites. We predict that TF binding in the early Drosophila embryo depends on motifs for the early zygotic TFs Vielfaltig (also known as Zelda) and Tramtrack. We validate experimentally that the activity of Twist-bound enhancers and Twist binding itself depend on Vielfaltig motifs, suggesting that Vielfaltig is more generally important for early transcription. Our finding that the motif content can predict context-specific binding and that the predictions work across different Drosophila species suggests that characteristic motif combinations are shared between sites, revealing context-specific motif codes (cis-regulatory signatures), which appear to be conserved during evolution. Taken together, this study establishes a novel approach to derive predictive cis-regulatory motif requirements for individual TF binding sites and enhancers. Importantly, the method is generally applicable across different cell types and organisms to elucidate cis-regulatory sequence determinants and the corresponding trans-acting factors from the increasing number of tissue- and cell-type-specific TF binding studies.

    Footnotes

    • 5 Corresponding author

      E-mail stark{at}starklab.org

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.132811.111.

      Freely available online through the Genome Research Open Access option.

    • Received September 30, 2011.
    • Accepted April 24, 2012.

    This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 3.0 Unported License), as described at http://creativecommons.org/licenses/by-nc/3.0/.

    Related Article

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server