Unraveling transcription regulatory networks by protein–DNA and protein–protein interaction mapping

Albertha J.M. Walhout

doi:10.1101/gr.5321506

Unraveling transcription regulatory networks by protein–DNA and protein–protein interaction mapping

Albertha J.M. Walhout

Program in Gene Function and Expression and Program in Molecular Medicine, University of Massachusetts Medical School, Worcester, Massachusetts 01605, USA

Next Section

Abstract

Metazoan genomes contain thousands of protein-coding and noncoding RNA genes, most of which are differentially expressed, i.e., at different locations or at different times during development, function, or pathology of the organism. Differential gene expression is achieved in part by the action of regulatory transcription factors (TFs) that bind to cis-regulatory elements that are often located in or near their target genes. Each TF likely regulates many targets in the context of intricate transcription regulatory networks. Up to 10% of a genome may encode TFs, but only a handful of these have been studied in detail. Here, I will discuss the different steps involved in the mapping and analysis of transcription regulatory networks, including the identification of network nodes (TFs and their target sequences) and edges (TF–TF dimers and TF–DNA target interactions), integration with other data types, and network properties and emerging principles that provide insights into differential gene expression.

Metazoan genomes contain thousands of protein- and RNA-encoding genes. Some genes are ubiquitously expressed, whereas others are expressed in a tightly controlled manner in only part of the organism, or under particular conditions during development or disease. In order to understand how differential gene expression is controlled at a genome-wide or systems level, it is important to identify all the cis-acting regulatory sequences and trans-acting factors involved, and how and when they interact to affect gene expression.

Differential gene expression can be regulated at the transcriptional and at the post-transcriptional level by three types of trans-acting factors (Fig. 1). Regulatory transcription factors (TFs) can activate or repress transcription by physically interacting with genomic cis-regulatory DNA elements that can be located in gene promoters, or at a greater genomic distance in enhancers, or in introns (Fig. 1A).

View larger version:

Download as PowerPoint Slide

Figure 1.

Regulators of gene expression physically interact with their targets. (A) Transcriptional regulation. Regulatory TFs function by binding to proteins and to DNA. Black boxes, exons; blue squares, cis-regulatory DNA elements; red ellipses, TFs; arrow, transcription start site. Curved arrows indicate activation of gene expression and blunt “arrow” indicates transcriptional repression. AD, transcription activation domain; RD, transcription repression domain; DB, DNA binding domain. (B) Post-transcriptional regulation. RNA binding proteins and microRNAs function by directly interacting with their target mRNAs. Arrow, transcription start site; purple box, microRNA gene; green line, 3′ UTR of target microRNA (purple line); yellow circle, RNA binding protein. Upstream regulation of miRNA expression (by the pink TF binding to the light blue element) is indicated and connects transcriptional and post-transcriptional gene regulation.

RNA binding proteins can interact with specific cis-regulatory RNA elements, for instance, that are located in the 3′ untranslated region of an mRNA molecule (Fig. 1B). The binding of RNA binding proteins regulates differential gene expression at the post-transcriptional level, by affecting transcript localization, translation, or degradation (for reviews, see Hieronymus and Silver 2004; Keene and Lager 2005).

microRNAs exclusively repress gene expression by physically interacting, through hybridization, with cis-regulatory elements located in the 3′ untranslated region of their target mRNAs (Fig. 1B). This hybridization results in the inhibition of translation and/or decreased mRNA stability (Ambros 2004; Du and Zamore 2005). Thus, TFs, RNA binding proteins, and microRNAs physically interact with their target genes, either at the DNA or at the mRNA level. Such regulator-target interactions are now being systematically mapped and modeled into regulatory networks. Because information about many genes and TFs is assembled into a single network model, transcription regulatory networks provide insight into the principles and properties that control differential gene expression at a systems level, rather than at the level of individual genes.

Previous Section Next Section

What is a regulatory network?

Network models are composed of nodes and edges that describe relationships between nodes. In biological networks, the nodes are bioactive macromolecules such as proteins, DNA, RNA, and metabolites (Barabasi and Oltvai 2004). Two types of regulatory networks can be distinguished: transcription regulatory networks and post-transcription regulatory networks (Fig. 2). Each of these types of networks can be subdivided into physical and functional networks. Physical networks contain protein–protein, protein–DNA, protein–RNA, and/or RNA–RNA interactions (Fig. 2A,C). Functional networks incorporate the consequences of these physical interactions, e.g., activation or repression of gene expression (Fig. 2B,D). Ultimately, transcription and post-transcription regulatory networks need to be combined to obtain a comprehensive picture of all aspects of the regulation of differential gene expression in complex metazoan systems (Fig. 2E).

View larger version:

Download as PowerPoint Slide

Figure 2.

Regulatory networks. (A) Protein–DNA and protein–protein interaction network involving regulatory TFs and their target genes. (B) Protein–RNA, protein–protein, and microRNA–RNA interaction network involving RNA binding proteins and their target mRNAs and microRNAs and their target mRNAs. (C) Transcription regulatory networks. The transcriptional consequences of the protein interactions shown in A are included. (D) Post-transcription regulatory networks. The effects of the protein interactions shown in B on target gene expression are included. (E) Combined transcription and post-transcription regulatory networks. Red nodes, TFs; blue nodes, target genes; green node, a gene can be a target gene and encode a TF; yellow nodes, RNA binding proteins; purple nodes, microRNAs. Black edges, protein–DNA interactions; dashed edges, protein–RNA interactions; red edges, microRNA–RNA interactions; blue edges, protein–protein interactions. Arrows, activation; blunt “arrow,” repression.

In this review, I will focus on the mapping of transcription regulatory networks. I will discuss the identification of predicted TFs and cis-regulatory sequences, i.e., network nodes, and the protein–protein and protein–DNA interaction mapping approaches that are being used to identify physical interactions between these nodes, i.e., network edges. I will discuss several emerging insights and hypotheses that can be derived from such networks, and the future challenges that lie ahead in this rapidly evolving field.

Previous Section Next Section

Identifying network nodes

Transcription regulatory networks contain two types of nodes: regulatory TFs and their target DNA sequences. Many different strategies have been employed to identify both types of nodes, including computational and experimental methods.

Regulatory transcription factors

Regulatory TFs are composed of at least two types of domains: a DNA binding domain, which serves to interact with its cognate DNA target sequence, and a transcription regulation domain, which serves to activate or repress transcription (Fig. 1A). TFs are grouped into families based on their predicted DNA binding domains. To date, more than 100 different DNA binding domains have been found (Kummerfeld and Teichmann 2006). These domains have been used to computationally predict which genes in a genome of interest encode regulatory TFs. However, computational prediction alone is insufficient to obtain comprehensive and high-quality TF predictions. For instance, we recently obtained a high-quality compendium of Caenorhabditis elegans TFs by a combination of computational prediction and extensive manual curation (Reece-Hoyes et al. 2005). By doing so, the number of false positive and false negative predictions was drastically reduced. It should be feasible to obtain such comprehensive predictions for other organisms, including human, as well. However, even manually curated collections are likely incomplete as not all DNA binding domains have yet been uncovered. For example, both yeast and C. elegans proteins that bind DNA but that do not possess a known DNA binding domain have recently been retrieved (Hall et al. 2004; Deplancke et al. 2006).

TF predictions have led to the observation that, in increasingly complex metazoan organisms, a larger proportion of the genome encodes TFs, compared with relatively simple, unicellular eukaryotes. For instance, the genome of the unicellular yeast Saccharomyces cerevisiae encodes ∼200 predicted TFs (Harbison et al. 2004), which is ∼3% of all protein-coding genes; the relatively simple metazoan nematode C. elegans contains 934 predicted TFs, which is ∼5% of all protein-coding genes (Reece-Hoyes et al. 2005); and more complex eukaryotes such as humans may devote up to 10% of their coding power to regulatory TFs (Levine and Tjian 2003).

TFs interact with different types of DNA sequences, including promoters and cis-regulatory modules, and, within such larger elements, bind to specific cis-regulatory elements or TF binding sites. Considerable efforts are underway to identify each of these elements in order to decipher the “regulatory code” that controls differential gene expression. For instance, the ENCODE (ENCyclopedia of DNA elements) Consortium aims to identify all functional elements in the human genome (ENCODE Project Consortium 2004). So far the efforts of this consortium have focused on 1% of the genome, or 30 Mb of sequence, which contains ∼600 predicted protein-coding genes. In order to gain insight into the regulatory code of a genome, one of the first steps is to comprehensively identify all gene promoters.

Promoters

A gene promoter is defined as the regulatory sequence (a few hundred base pairs) that is located immediately upstream of the transcription start site (for review, see Maston et al. 2006). Eukaryotic protein and microRNA-encoding gene promoters are composed of two parts: a proximal promoter that serves as a recognition sequence for the pre-initiation complex and RNA polymerase II, and a distal promoter that performs a regulatory function by interacting with regulatory TFs. The identification of promoters is relatively straightforward in unicellular eukaryotes such as S. cerevisiae as its genome is compact (Goffeau et al. 1996): It contains very few introns and short intergenic regions (median length shorter than 400 bp). Hence, the intergenic regions contain most cis-regulatory elements and can be used as a proxy for gene promoters when transcription start sites have not been precisely mapped. Since the genomes of higher eukaryotes contain longer intergenic regions with many repeat sequences, and because transcription start sites are often poorly defined, it is more difficult to accurately pinpoint metazoan gene promoters. Moreover, higher eukaryotic genes may frequently be regulated from multiple, alternative promoters.

Several experimental approaches have been developed for transcription start site and, thus, promoter annotation. First, full-length cDNA sequencing has led to the annotation of thousands of transcripts for both the murine and human genome (Imanishi et al. 2004; Carninci et al. 2005). Second, the use of genome-wide tiling arrays has enabled the identification of 5′ and 3′ boundaries of transcripts (Carninci et al. 2005). Third, cap analysis of gene expression (CAGE) has been used to more precisely define transcription start sites in mammalian promoters (Carninci et al. 2006). Fourth, chromatin-immunoprecipitations (see below) with anti-TFIID and anti-RNA polymerase II antibodies have been used to identify many active human promoters (Kim et al. 2005a, b). Finally, by transient transfection assays, 387 gene promoters from the ENCODE regions that drive gene expression in at least one of 16 different cell-lines were identified (Cooper et al. 2006). Although a lot of progress has been made, it is likely that highly sensitive experimental methods need to be developed to identify promoters that are rarely active.

Cis-regulatory modules

Many gene promoters have been identified to date. However, the genome-wide identification of enhancers and silencers in higher eukaryotes has been relatively slow. This is because they can be located at a great genomic distance from the target’s transcription start site(s) and can be found upstream, downstream, or within introns (for review, see Maston et al. 2006).

It has been postulated that functional TF binding sites often occur in clusters and form cis-regulatory modules (Davidson 2001). Recently, this hypothesis has been utilized by several groups for the computational prediction of cis-regulatory modules that may constitute enhancers or silencers (Aerts et al. 2003; Sharan et al. 2003; Gupta and Liu 2005; Blanchette et al. 2006; Hallikas et al. 2006). The methods used by these groups provide powerful tools to search for cis-regulatory modules containing consensus binding motifs for TFs for which the recognition sequence has been mapped. However, to date such information is only available for a limited number of TFs.

In addition to using computational tools to predict regulatory sequences, experimental methods can be used for the discovery of cis-regulatory modules (for review, see Elnitski et al. 2006). For instance, the observation that the genome is more accessible to DNaseI when TFs are bound, leading to DNaseI hypersensitive sites, can be used to identify cis-regulatory modules. Until recently, the unbiased, genome-wide mapping of such sites has been hampered by a lack of high-throughput “readout” methods that can be used to map such sites onto genome sequences. Several groups have already made significant progress toward this goal, for instance by combining DNaseI treatment with microarrays or massive parallel sequencing (Dorschner et al. 2004; Crawford et al. 2006a, b; Sabo et al. 2006). Integration with other types of data will be necessary to delineate the function of each DNaseI hypersensitive site and to find the transacting factors that bind to these sites.

TF binding sites and cis-regulatory elements

For a thorough understanding of transcription regulatory networks, it is not only important to find promoters and cis-regulatory modules, but also to precisely map the cis-regulatory elements located within these longer sequences. Individual cis-regulatory elements are short (usually <20 bp) DNA sequences that interact directly with regulatory TFs. Such TF binding sites have traditionally been mapped using a combination of deletion analyses and reporter gene expression (see, e.g., Davidson et al. 2002). However, such methods are not readily adaptable to high-throughput settings.

Recently, several methods have been employed to computationally identify putative cis-regulatory elements (Fig. 3) (for review, see Elnitski et al. 2006). The first method is based on the hypothesis that genes that are coexpressed under a particular condition are subject to control by the same TF(s). The advent of gene expression analysis by microarrays greatly facilitated the identification of coexpressed genes (DeRisi et al. 1997). Using a variety of computational algorithms, the regulatory regions of coexpressed genes can be interrogated for the occurrence of overrepresented DNA sequences that may constitute binding sites for the TF responsible for the coexpression (for information and performance on such algorithms, see Tompa et al. 2005).

View larger version:

Download as PowerPoint Slide

Figure 3.

Different approaches that can be used for the identification of cis-regulatory DNA elements are highly complementary and interconnected. Cis-regulatory elements can be identified by interrogating the regulatory regions of coexpressed genes, by phylogenetic footprinting, or by experimentally identifying TF binding sites.

The second method, referred to as phylogenetic footprinting, is based on the conservation of functional cis-regulatory elements in closely related organisms. This method has been used to identify putative elements in yeast (Cliften et al. 2001, 2003; Kellis et al. 2003), Drosophila melanogaster (Glazov et al. 2005), and human genomes (Bejerano et al. 2004; Siepel et al. 2005; Woolfe et al. 2005; Xie et al. 2005). In addition to using computational tools to find putative cis-regulatory elements in complete genome sequences, experimentally mapped consensus TF binding motifs (see below) can also be used to interrogate a genome sequence of interest. However, depending on the length of the motif, many functional and nonfunctional sequences will be identified. Phylogenetic footprinting and coexpression can then be used to determine which motifs have a higher likelihood of being functional in vivo (Fig. 3; Elnitski et al. 2006).

Only a small portion of all TF binding sites that occur in a genome of interest have been identified to date. For instance, by comparative genomics, Xie and colleagues found 174 candidate DNA motifs that likely correspond to numerous TF binding sites in human promoters (Xie et al. 2005). However, these different elements may only represent ∼10% of all TF binding motifs as the human genome may encode more than 2000 TFs (Levine and Tjian 2003), each of which likely binds DNA with different specificity and affinity. On the other hand, it is likely that some TFs from one family may have overlapping binding specificities, and that therefore the number of different TF binding motifs may be considerably less than 2000. In addition, certain TFs may exclusively bind to regulatory elements that are located in transcriptional enhancers or silencers. These TF binding motifs will be missed in studies that focus solely on promoter sequences.

The computational prediction of cis-regulatory modules has relied on the observation that TF binding sites are often clustered. However, the generality of this phenomenon has not been investigated and, thus, it is not clear how many functional, nonclustered TF binding sites occur in the genome. In addition, most researchers have focused on elements that are conserved between related species. Such phylogenetic footprinting likely increases the specificity of motif finding. However, the sensitivity will suffer from only interrogating conserved sequences because many important, species-specific elements are not conserved. The success of phylogenetic profiling for the identification of functional regulatory elements also depends on the evolutionary distance between the organisms used in the analysis: The use of closely related species may result in relatively low specificity and the use of distantly related species may result in high specificity, but relatively low sensitivity (Ruvinsky and Ruvkun 2003).

Previous Section Next Section

Identifying network edges

TF–TF dimers

Many TFs bind their target genes as dimers. For instance, bZIP, bHLH, and nuclear hormone receptor TFs all dimerize. The comprehensive identification of TF dimers requires the use of protein–protein interaction detection methods that can be used in (semi) high-throughput settings. One assay that is particularly suited to identify binary protein–protein interactions is the yeast two-hybrid system (Fields and Song 1989), and multiple putative TF homo- and heterodimers have already been found using this system (Li et al. 2004; Reece-Hoyes et al. 2005; Rual et al. 2005; Stelzl et al. 2005). Putative TF dimers have also been identified by protein arrays. For instance, Newman and Keating tested >2400 combinations of human bZIP protein–protein interactions and found multiple putative dimers (Newman and Keating 2003). Finally, 15 putative yeast TF–TF heterodimers have been identified by large-scale TAP-TAG purification methods (Gavin et al. 2006; Krogan et al. 2006).

Only a small portion of all TF dimers has been identified to date. For instance, <10% of all predicted TFs are present in the current C. elegans protein–protein interaction network (Li et al. 2004), even though at least 30% of all TFs belong to the bZIP, bHLH, or nuclear hormone receptor families (Reece-Hoyes et al. 2005). Since TF dimerization may be condition-dependent, many TF dimers have also likely been missed by TAP-TAG assays in yeast. In the future, it will be important to comprehensively map all dimerization interactions between TFs and to incorporate this information into network models (Fig. 2).

Interactions between TFs and their target genes/sequences

Protein–DNA interactions between TFs and their target DNA sequences can be mapped using two conceptually different strategies. First, one can identify for a TF or set of TFs of interest, the target genes, and/or cis-regulatory elements these TFs bind to. Alternatively, one can take a DNA sequence as a starting point and aim to identify the TFs that can interact with this sequence. We refer to these strategies as “TF-centered” and “gene-centered” methods, respectively (Fig. 4A; Deplancke et al. 2006).

View larger version:

Download as PowerPoint Slide

Figure 4.

High-throughput methods for protein–DNA interaction mapping. (A) protein–DNA interactions can be mapped using either TF- or gene-centered methods, as indicated by the arrows. Y1H, yeast one-hybrid assays; ChIP, chromatin-immunoprecipitations; PBM, protein binding microarray; B1H, bacterial one-hybrid system; Dam-ID, DNA adenine methytransferase-ID. (B) ChIP is the most commonly used TF-centered method. It is based on the precipitation of a TF (blue) and its associated DNA fragments (red) using an anti-TF antibody (purple). Multiple readouts of the precipitated DNA can be used, including PCR with specific primers, tiling microarrays (chip), cloning and sequencing, and paired-end ditag sequencing. (C) Y1H assays are based on interactions of hybrid “prey” proteins with a DNA “bait” of interest. The hybrid protein consists of a protein that can bind DNA (blue) and a heterologous transcription activation domain (AD, yellow). The use of such a domain enables the identification of both activators and repressors of transcription. The readout for a protein–DNA interaction is the expression of one or more reporter genes. Prey identity is determined by PCR and sequencing. In high-throughput Y1H assays, vectors containing Gateway recombination sites (GW) are used to enable standardized cloning from promoterome resources.

TF-centered protein–DNA interaction mapping

The most widely used protein–DNA interaction mapping methods are TF-centered, and most are based on chromatin-immunoprecipitation (ChIP) (for review, see Elnitski et al. 2006). In ChIP assays an anti-TF antibody is used to precipitate DNA bound by the TF in vivo (Fig. 4B). These DNA fragments can subsequently be identified and quantified using a variety of readouts, including PCR, microarrays (referred to as ChIP-on-chip), and cloning/sequencing as in SAGE-like methods (serial analysis of gene expression, Fig. 4B; for review, see Blais and Dynlacht 2005; Elnitski et al. 2006). For yeast ChIP-on-chip assays, endogenous TFs were replaced by hybrid proteins in which the TFs were fused to a universal protein tag (Lee et al. 2002). Thus, almost 200 individual yeast strains were created, each carrying a different TF-TAG fusion protein. The advantage of this strategy is that the same antibody can be used for each TF. ChIP-on-chip has been used to identify target sequences for most yeast TFs under standard laboratory growth conditions (Lee et al. 2002). In addition, target binding has been examined under multiple experimental conditions for a subset of these TFs (Harbison et al. 2004; Workman et al. 2006). In addition to yeast, ChIP-on-chip has been used for a variety of mammalian TFs by using tissue culture cells (Cawley et al. 2004; Carroll et al. 2005; Bieda et al. 2006). So far, only a few studies focused on the DNA binding of metazoan TFs in their natural environment. For instance, in a pioneering study, endogenous promoters bound by HNF1a, HNF4a, and HNF6 within human liver and pancreas were identified (Odom et al. 2004). Similarly, by a combination of computational target prediction and ChIP-on-chip, many target promoters bound by CREB were identified in both tissue culture cells and primary hepatocytes (Zhang et al. 2005). ChIP-on-chip was also used to identify promoters bound by the TFs OCT4, SOX2, and NANOG in human embryonic stem cells (Boyer et al. 2005). Finally, ChIP-cloning (i.e., the cloning and sequencing of precipitated DNA) was used to identify in vivo target genes for the C. elegans FOXO TF, DAF-16 (Oh et al. 2005).

In DamID, a TF is fused to Escherichia coli DNA adenine methyltransferase (Dam) and expressed in tissue culture cells or intact model organisms (van Steensel and Henikoff 2000). Upon binding of the TF to DNA, the surrounding nucleotides are methylated. This methylation can be detected by PCR or microarrays after immunoprecipitation of methylated DNA. DamID has mainly been used to identify the DNA targets of general chromatin-binding proteins, but has also been used to dissect the Drosophila Myc TF network (Orian et al. 2003).

In protein-binding microarrays, a TF is fused to GST, expressed in bacteria or yeast, purified, and hybridized to a double-stranded DNA array that contains DNA sequences of interest (Mukherjee et al. 2004). To date, this method has been used to find targets for the yeast TFs Abf1, Rap1, and Mig1. The target sequences were then used to identify the consensus TF binding sites for each of these factors. DIP-ChIP can also be used to identify consensus TF binding sites. This method uses naked genomic DNA and a purified TF. Briefly, after incubation of the DNA with the factor, an immunoprecipitation is performed and TF-associated DNA is identified by microarray analysis (Liu et al. 2005). Although both DIP-chip and PBM are carried out in vitro, the TF binding sites obtained were in very good agreement with data obtained using in vivo methods, suggesting that they are effective and rapid methods to identify TF binding specificities, and, perhaps, affinities (Mukherjee et al. 2004; Liu et al. 2005).

In bacterial one-hybrid assays, a plasmid encoding a TF of interest is transformed into bacteria containing a library of random DNA elements (Meng et al. 2005). Binding of the TF to a specific element is selected on specific media and positive colonies are analyzed by sequencing. After aligning multiple sequences bound by an individual TF, its recognition sequence can be derived. This sequence can then be used to search the genome to identify putative TF target genes. Since TF binding sites are generally short, many of such sequences will occur in a genome, only some of which will likely be functional.

Gene-centered protein–DNA interaction mapping

Eukaryotic genomes encode hundreds of putative TFs, of which only a handful has been analyzed by TF-centered methods. The identification of protein–DNA interactions involving uncharacterized, predicted TFs has recently been facilitated by the development of high-throughput, gene-centered protein–DNA interaction mapping methods, such as yeast one-hybrid (Y1H) assays. The Y1H system was first developed to facilitate the identification of proteins that can bind to multiple copies of a short DNA sequence of interest (the “DNA bait”) (Li and Herskowitz 1993; Wang and Reed 1993). This method is not suitable for the unbiased, comprehensive mapping of protein–DNA interactions with longer DNA fragments because the cis-regulatory elements that contribute to gene expression are only known for a few genes, and because the system was based on traditional, restriction enzyme-based cloning methods. To enable the unbiased, large-scale detection of protein–DNA interactions, we developed a high-throughput version of the Y1H system (Fig. 4C; Deplancke et al. 2004). This system is compatible with Gateway cloning, a recombinational cloning system by which many fragments (i.e., DNA baits) can be cloned simultaneously (Hartley et al. 2000; Walhout et al. 2000). This Y1H system can be used with single copy gene promoters as DNA baits and, therefore, allows the unbiased identification of TF-promoter interactions without prior knowledge about the cis-regulatory elements that reside within the promoter. The system is compatible with “promoterome” resources, collections of Gateway-cloned promoters, for the high-throughput cloning of DNA baits (Dupuy et al. 2004). The Gateway-compatible Y1H system also makes use of Gateway-compatible “protein prey” resources. For instance, mini-libraries consisting solely of predicted TFs can be created and screened successfully. This is important as TFs that are expressed at low levels or in only a few cells in an organism are difficult to retrieve from standard cDNA libraries (Deplancke et al. 2004). Recently, we used the Gateway-compatible Y1H system to map a first C. elegans gene-centered protein–DNA interaction network, containing 283 protein–DNA interactions between 72 promoters and 117 proteins, 107 of which encode predicted C. elegans TFs and 10 of which may be novel DNA binding proteins (Deplancke et al. 2006).

As with any large-scale, high-throughput method, protein–protein and protein–DNA interactions will be missed and wrongly identified by each of the methods discussed. Some of these methods identify interactions that do occur in vivo (e.g., ChIP with endogenous TFs) and others find interactions that can occur (e.g., in vitro methods, yeast two-hybrid and yeast one-hybrid assays). Protein–DNA interactions that occur infrequently, i.e., in a few cells or during a short time period in development or disease, will likely be missed by the first methods but may be found by the second. However, interactions found by the second do not necessarily occur in vivo. To assure the generation of high-quality data sets, it is desirable to filter protein–protein and protein–DNA interaction data, and to only include high-confidence interactions, i.e., interactions that are likely relevant. Such criteria have previously been used for large-scale protein–protein interaction maps, generated by high-throughput yeast two-hybrid assays (Li et al. 2004; Rual et al. 2005; Stelzl et al. 2005), and we have recently devised stringent criteria to filter Y1H data (Deplancke et al. 2006). In summary, both sensitivity and specificity are important issues to consider when choosing a protein–protein or protein–DNA interaction identification method, and the choice depends on the question being addressed. As the various methods are highly complementary, it is desired, in the long term, to use a multitude of techniques for comprehensive, high-quality protein–protein and protein–DNA interaction mapping.

Previous Section Next Section

Emerging concepts and future challenges

Large sets of protein–protein and protein–DNA interactions can be visualized as network models using various freely available software packages, including Cytoscape (Shannon et al. 2003) and N-browse (Lall et al. 2006).

Network models serve multiple purposes. For instance, they provide a great tool for the visualization and navigation of large interaction data sets. In addition, networks can be analyzed at different levels, i.e., at the level of the network as a whole, the level of subgraphs and network motifs, and the level of individual nodes or edges. By doing so, they enable the derivation of hypotheses regarding different levels of gene expression.

Network analysis

Once visualized, networks can be analyzed topologically using different network parameters such as connectivity, path length, clustering coefficient, etc. (For review, see Barabasi and Oltvai 2004). As has been observed for other networks, transcription regulatory networks are highly connected and display a scale-free degree distribution (Albert et al. 2000), i.e., they contain a small number of disproportionately highly-connected nodes, or hubs, and many less-well connected nodes (Guelzim et al. 2002; Lee et al. 2002; Luscombe et al. 2004; Deplancke et al. 2006). Transcription regulatory networks potentially contain two types of hubs: TF hubs (TFs that bind many promoters) and promoter hubs (promoters that interact with many TFs). Interestingly, transcription regulatory networks predominantly contain TF hubs, rather than promoter hubs (Guelzim et al. 2002; Deplancke et al. 2006). As in other networks, such hubs provide integrity to the network: When nodes are randomly removed, the network stays connected. However, when hubs are sequentially removed the network disintegrates rapidly (for review, see Barabasi and Oltvai 2004). The biological implication of this became apparent when it was demonstrated that TF hubs have a higher tendency to be essential for the organism (Jeong et al. 2001; Yu et al. 2004; Deplancke et al. 2006).

We recently mapped a protein–DNA interaction network of genes expressed or involved in the C. elegans digestive tract (Deplancke et al. 2006). By visualizing and analyzing this network, we can derive hypotheses at different levels of gene regulation. For instance, we observed that the network is highly connected, contains several TF hubs, and is enriched for TFs expressed in the digestive tract (Fig. 5A; Deplancke et al. 2006). In addition, we found that most promoters are bound by a combination of TF hubs and less well-connected TFs, some of which may be master regulators. This led to a model in which we propose that C. elegans transcription is regulated by a layered organization of TF function (Fig. 5A; Deplancke et al. 2006). The digestive tract is predominantly composed of the pharynx and intestine, each of which is derived from distinct germlayers. We found that TF hubs interact with both pharyngeal and intestinal genes. This suggests that these TFs function as global regulators of gene expression and leads to the prediction that they interact with promoters of large numbers of genes that are expressed in other tissues as well.

View larger version:

Download as PowerPoint Slide

Figure 5.

Deriving biological hypotheses from regulatory networks. (A) A protein–DNA interaction network of C. elegans digestive tract genes was used to derive a three-layered model of transcription regulation. Reprinted with permission from Elsevier © 2006, Deplancke et al. 2006. (B) Example of a protein–DNA interaction network subgraph. (C) Example of a protein–DNA interaction network motif. See main text for details.

In addition to hypotheses regarding gene regulation at the level of an entire network and system, one can also derive hypotheses by zooming into network subgraphs.

Network subgraphs

Network subgraphs can be network modules, motif clusters, or other network neighborhoods. A network module can be defined as a subgraph consisting of highly interconnected nodes that may fulfill a particular biological function. Network modularity has been observed in yeast regulatory networks (Ihmels et al. 2002; Bar-Joseph et al. 2003; Segal et al. 2003; Luscombe et al. 2004), although there are few modules that can be clearly separated from the main network component (Babu et al. 2004). This may be because individual yeast TFs may function in multiple, apparently unrelated pathways. These observations suggest that regulatory networks of higher eukaryotes such as C. elegans may be organized in modules as well but that these modules share multiple TFs. This hypothesis is in agreement with our observation that many C. elegans TFs are expressed in multiple tissues (Deplancke et al. 2006).

Figure 5B shows an example of a subgraph of the C. elegans digestive tract protein–DNA interaction network that can be used to derive specific biological hypotheses. This subgraph is composed of multiple bifan motifs (see below for network motifs) of the TF hubs DIE-1 and ZTF-1 and their target promoters. Interestingly, these TFs share 22 promoters, which is 73% of their combined targets (Fig. 5B). This leads to the prediction that DIE-1 and ZTF-1 have a similar biochemical function: For instance, they may have similar DNA binding motifs. The observation that they do not share all of their targets suggests that the motifs are not completely identical. Interestingly, these proteins share no homology in their primary amino acid sequence but both proteins do contain two pairs of C2H2 zinc fingers that are separated by a long amino acid sequence. The observation of shared targets also leads to the prediction that these two TFs may share biological functions. However, while knockdown of DIE-1 is lethal, ZTF-1 is dispensable for the function of the organism. In contrast, we could not create stable transgenic lines expressing ZTF-1, suggesting that overexpression of this protein may be lethal (Deplancke et al. 2006). Whereas DIE-1 can activate gene expression (Deplancke et al. 2006), the transcriptional function of ZTF-1 remains to be elucidated.

Network motifs

Network motifs are the building blocks of networks (Milo et al. 2002). Several motifs are overrepresented in experimentally derived transcription regulatory networks compared with random networks (Milo et al. 2002; Shen-Orr et al. 2002). Such motifs provide insights into the properties of networks and the propagation of regulatory signals. As such, the analysis of network motifs may help to uncover the biochemical functions of both TFs and their target genes. For instance, feed forward loops are overrepresented in transcription regulatory networks of various organisms (Milo et al. 2002; Shen-Orr et al. 2002). This may not be surprising as such loops offer a rapid gene expression output, for example in response to outside signals. In contrast, feed-back or autoregulatory loops can either reinforce or diminish a transcriptional output, whereas single input motifs can confer strong coexpression of downstream target genes (Shen-Orr et al. 2002). The analysis of each TF and the network context in which it functions will be important to unravel how each factor contributes to differential gene expression.

Network motifs can also be used to derive specific biological hypotheses, either for individual promoters, or TFs. For instance, we found a single input motif in which ZTF-2 interacts specifically with the promoters of five pharyngeal genes (Deplancke et al. 2006). This led to the prediction that ZTF-2 is a regulator of pharyngeal gene expression and that these promoters share a pharyngeal gene element to which ZTF-2 binds. We tested these hypotheses experimentally and found that ZTF-2 is itself expressed in the pharynx (and elsewhere), and that a knockdown of ztf-2 results in a pharyngeal phenotype. Furthermore, we used the five promoter sequences to define a ZTF-2 binding motif and found that it is highly similar to a previously described pharyngeal gene element. Finally, we demonstrated that ZTF-2 represses expression of its pharyngeal targets and that it can bind the pharyngeal element in vivo. Taken together, the mapping, analysis and deconvolution of a protein–DNA interaction network into subgraphs and motifs can be used to derive biological hypotheses regarding differential gene expression at different levels.

Previous Section Next Section

Future challenges

The transcription regulatory networks that are currently available are likely to be a small representation of all the interactions that occur in vivo. Even in yeast, where binding of each TF has been examined under standard laboratory conditions and binding of a few under multiple conditions (Lee et al. 2002; Harbison et al. 2004; Workman et al. 2006), the regulatory information is likely far from complete. This is because many conditions remain to be tested and because TFs that bind DNA with low specificity or affinity may be difficult to analyze. The transcription regulatory networks that have been mapped in higher eukaryotes represent an even smaller sample of the entire network. This is because so far (1) ChIP-on-chip assays mainly utilized arrays containing probes corresponding to promoter regions and, thus, TF binding to cis-regulatory modules located elsewhere in the genome will be missed; (2) only very few TFs have been examined by TF-centered methods; (3) <1% of all promoters in C. elegans have been examined by gene-centered methods (Deplancke et al. 2006), and, finally, not all cis-regulatory elements and TF binding sites have been identified either computationally (Xie et al. 2005) or experimentally (Mukherjee et al. 2004; Meng et al. 2005).

Cis-regulatory elements or TF binding sites are often found in intergenic regions. When intergenic regions are short (i.e., in yeast and C. elegans) and reside between genes that are transcribed from opposite strands, perhaps by bidirectional promoters, it is difficult to infer which of the two genes will be affected through such regulatory sequences. Similarly, in the genomes of higher eukaryotes, cis-regulatory modules can be located far from a transcribed unit and it may be difficult to infer the gene that is controlled by the cis-regulatory module. In the future, it will be important to scan larger genomic regions, both in cis and in trans for genes that can be affected by different individual cis-regulatory modules.

From protein–DNA interaction networks to transcription regulatory networks

Networks that are solely based on protein–protein and protein–DNA interactions do not contain regulatory information because the protein–protein and protein–DNA interaction detection methods discussed above do not provide insight into the consequences of physical interactions (e.g., activation or repression of transcription). The transcriptional consequences of protein–DNA interactions need to be superimposed onto protein–DNA interaction networks by integrating interaction data with other data types (Deplancke et al. 2006). This can be done either at the level of individual interactions using detailed and often labor-intensive methods such as quantitative RT-PCR and RNAi (Baugh et al. 2005; Oh et al. 2005; Deplancke et al. 2006) or at the network level by integration with other large data sets, such as expression profiles (Lee et al. 2002; Segal et al. 2003; Yu et al. 2003; Luscombe et al. 2004). In the future, it will be important to develop methods that can be used to map, at a large scale, the transcriptional activity of each TF.

Spatio-temporal network modeling

Protein–protein and protein–DNA interaction networks and transcription regulatory networks are static models of all the transcriptional events that can occur in a system of interest. To fully understand how such networks contribute to system development, function, and pathology, it is important to unravel where and when which parts of the network are active and what the biological consequences of this activity are (Davidson et al. 2002). Such analysis has again been pioneered in yeast. For instance, the binding of the cell cycle regulatory TF complexes SBF and MBF has been analyzed during different phases of the cell cycle (Horak et al. 2002). It was found that these TFs bind to and regulate many other TF-encoding genes that are involved in cell cycle progression and/or differentiation. In addition, more extensive networks that are active under different endogenous and exogenous experimental conditions were compiled (Luscombe et al. 2004). Surprisingly, it was found that these different subnetworks have different topological properties and motifs that may reflect their particular function. In the future, it will be important to extrapolate where and when which parts of transcription regulatory networks are active in higher eukaryotes as well.

Longer term, transcription regulatory networks need to be integrated to model more comprehensive regulatory networks in which transcription regulation of the expression of both protein-coding and microRNA-encoding genes is combined with gene regulation by both RNA binding proteins and microRNAs (Fig. 2E). Such networks need themselves to be integrated with spatio-temporal information about gene expression and TF/microRNA activity and with phenotypes conferred by TFs and microRNAs to obtain a comprehensive picture about regulatory networks and how they control the development, function, and pathology of complex metazoan systems.

Previous Section Next Section

Acknowledgments

I thank members of the Walhout laboratory for discussions, J. Lieb for information on intergenic region length in yeast, S. Ryder for advice about RNA binding proteins, I. Barrasa for help with yeast TF–TF dimer retrieval, and B. Deplancke, V. Vermeirssen, N. Martinez, and J. Dekker for critical reading of the manuscript. The Walhout laboratory is supported by NIH grants CA097516, DK068429, and DK071713.

Previous Section Next Section

Footnotes

↵E-mail marian.walhout{at}umassmed.edu; fax (508) 856-5460.
Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.5321506
Copyright © 2006, Cold Spring Harbor Laboratory Press

Previous Section

References

↵
1. Aerts, S.,
2. van Loo, P.,
3. Thijs, G.,
4. Moreau, Y.,
5. de Moor, B.
(2003) Computational detection of cis-regulatory modules. Bioinformatics 19:ii5–ii14.
Abstract
↵
1. Albert, R.,
2. Jeong, H.,
3. Barabasi, A.-L.
(2000) Error and attack tolerance of complex networks. Nature 378:378–381.
Google Scholar
↵
1. Ambros, V.
(2004) The functions of animal microRNAs. Nature 431:350–355.
CrossRef Medline Google Scholar
↵
1. Babu, M.M.,
2. Luscombe, N.M.,
3. Aravind, L.,
4. Gerstein, M.,
5. Teichmann, S.A.
(2004) Structure and evolution of transcriptional regulatory networks. Curr. Opin. Struct. Biol. 14:283–291.
CrossRef Medline Google Scholar
↵
1. Barabasi, A.L.,
2. Oltvai, Z.N.
(2004) Network biology: Understanding the cell’s functional organization. Nat. Rev. Genet. 5:101–113.
CrossRef Medline Google Scholar
↵
1. Bar-Joseph, Z.,
2. Gerber, G.K.,
3. Lee, T.I.,
4. Rinaldi, N.J.,
5. Yoo, J.Y.,
6. Robert, F.,
7. Gordon, D.B.,
8. Fraenkel, E.,
9. Jaakkola, T.S.,
10. Young, R.A.,
11. et al.
(2003) Computational discovery of gene modules and regulatory networks. Nat. Biotechnol. 21:1337–1342.
CrossRef Medline Google Scholar
↵
1. Baugh, L.R.,
2. Hill, A.A.,
3. Claggett, J.M.,
4. Hill-Harfe, K.,
5. Wen, J.C.,
6. Slonim, D.K.,
7. Brown, E.L.,
8. Hunter, C.P.
(2005) The homeodomain protein PAL-1 specifies a lineage-specific regulatory network in the C. elegans embryo. Development 132:1843–1854.
Abstract/FREE Full Text
↵
1. Bejerano, G.,
2. Pheasant, M.,
3. Makunin, I.,
4. Stephen, S.,
5. Kent, W.J.,
6. Mattick, J.S.,
7. Haussler, D.
(2004) Ultraconserved elements in the human genome. Science 304:1321–1325.
Abstract/FREE Full Text
↵
1. Bieda, M.,
2. Xu, X.,
3. Singer, M.A.,
4. Green, R.,
5. Farnham, P.J.
(2006) Unbiased location analysis of E2F1-binding sites suggests a widespread role for E2F1 in the human genome. Genome Res. 16:595–605.
Abstract/FREE Full Text
↵
1. Blais, A.,
2. Dynlacht, B.D.
(2005) Constructing transcriptional regulatory networks. Genes & Dev. 19:1499–1511.
Abstract/FREE Full Text
↵
1. Blanchette, M.,
2. Bataille, A.R.,
3. Chen, X.,
4. Poitras, C.,
5. Laganiere, J.,
6. Lefebvre, D.,
7. Deblois, G.,
8. Giuere, V.,
9. Ferretti, V.,
10. Bergeron, D.,
11. et al.
(2006) Genome-wide computational prediction of transcriptional regulatory modules reveals new insights into human gene expression. Genome Res. 16:656–668.
Abstract/FREE Full Text
↵
1. Boyer, L.A.,
2. Lee, T.I.,
3. Cole, M.F.,
4. Johnstone, S.E.,
5. Levine, S.S.,
6. Zucker, J.P.,
7. Guenther, M.G.,
8. Kumar, R.M.,
9. Murray, H.L.,
10. Jenner, R.G.,
11. et al.
(2005) Core transcriptional regulatory circuitry in human embryonic stem cells. Cell 122:947–956.
CrossRef Medline Google Scholar
↵
1. Carninci, P.,
2. Kasukawa, T.,
3. Katayama, S.,
4. Gough, J.,
5. Frith, M.C.,
6. Maeda, N.,
7. Oyama, R.,
8. Ravasi, T.,
9. Lenhard, B.,
10. Wells, C.,
11. et al.
(2005) The transcriptional landscape of the mammalian genome. Science 309:1559–1563.
Abstract/FREE Full Text
↵
1. Carninci, P.,
2. Sandelin, A.,
3. Lenhard, B.,
4. Katayama, S.,
5. Shimokawa, K.,
6. Ponjavic, J.,
7. Semple, C.A.,
8. Taylor, M.S.,
9. Engstrom, P.G.,
10. Frith, M.C.,
11. et al.
(2006) Genome-wide analysis of mammalian promoter architecture and evolution. Nat. Genet. 38:626–635.
CrossRef Medline Google Scholar
↵
1. Carroll, J.S.,
2. Liu, X.S.,
3. Brodsky, A.S.,
4. Li, W.,
5. Meyer, C.A.,
6. Szary, A.J.,
7. Eeckhoute, J.,
8. Shao, W.L.,
9. Hestermann, E.V.,
10. Geistlinger, T.R.,
11. et al.
(2005) Chomosome-wide mapping of estrogen receptor binding reveals long-range regulation requiring the forkhead protein FoxA1. Cell 122:33–43.
CrossRef Medline Google Scholar
↵
1. Cawley, S.,
2. Bekiranov, S.,
3. Ng, H.H.,
4. Kapranov, P.,
5. Sekinger, E.A.,
6. Kampa, D.,
7. Piccolboni, A.,
8. Sementchenko, V.,
9. Cheng, J.,
10. Williams, A.J.,
11. et al.
(2004) Unbiased mapping of transcription factor binding sites along human chromosomes 21 and 22 points to widespread regulation of noncoding RNAs. Cell 116:499–509.
CrossRef Medline Google Scholar
↵
1. Cliften, P.F.,
2. Hillier, L.W.,
3. Fulton, L.,
4. Graves, T.,
5. Miner, T.,
6. Gish, W.R.,
7. Waterston, R.H.,
8. Johnston, M.
(2001) Surveying Saccharyomyces genomes to identify functional elements by comparative DNA sequence analysis. Genome Res. 11:1175–1186.
Abstract/FREE Full Text
↵
1. Cliften, P.,
2. Sudarsanam, P.,
3. Desikan, A.,
4. Fulton, L.,
5. Fulton, B.,
6. Majors, J.,
7. Waterston, R.,
8. Cohen, B.A.,
9. Johnston, M.
(2003) Finding functional features in Saccharomyces genomes by phylogenetic footprinting. Science 301:71–76.
Abstract/FREE Full Text
↵
1. Cooper, S.J.,
2. Trinklein, N.D.,
3. Anton, E.D.,
4. Nguyen, L.,
5. Myers, R.M.
(2006) Comprehensive analysis of transcriptional promoter structure and function in 1% of the human genome. Genome Res. 16:1–10.
Abstract/FREE Full Text
↵
1. Crawford, G.E.,
2. Davis, S.,
3. Scacheri, P.C.,
4. Renaud, G.,
5. Halawi, M.J.,
6. Erdos, M.R.,
7. Green, R.,
8. Meltzer, P.S.,
9. Wolfsberg, T.G.,
10. Collins, F.S.
(2006a) DNase-chip: A high-resolution method to identify DNaseI hypersensitive sites using tiled microarrays. Nat. Methods 3:503–509.
CrossRef Medline Google Scholar
↵
1. Crawford, G.E.,
2. Holt, I.E.,
3. Whittle, J.,
4. Webb, B.D.,
5. Tai, D.,
6. Davis, S.,
7. Margulies, E.H.,
8. Chen, Y.,
9. Bernat, J.A.,
10. Ginsburg, D.,
11. et al.
(2006b) Genome-wide mapping of DNase hypersensitive sites using massively parallel signature sequencing (MPSS) Genome Res. 16:123–131.
Abstract/FREE Full Text
↵
1. Davidson, E.H.
(2001) Genomic regulatory systems: Development and evolution (Academic Press, San Diego).
Google Scholar
↵
1. Davidson, E.H.,
2. Rast, J.P.,
3. Oliveri, P.,
4. Ransick, A.,
5. Calestani, C.,
6. Yuh, C.-H.,
7. Minokawa, T.,
8. Amore, G.,
9. Hinman, V.,
10. Arenas-Mena, C.,
11. et al.
(2002) A genomic regulatory network for development. Science 295:1669–1678.
Abstract/FREE Full Text
↵
1. Deplancke, B.,
2. Dupuy, D.,
3. Vidal, M.,
4. Walhout, A.J.M.
(2004) A Gateway-compatible yeast one-hybrid system. Genome Res. 14:2093–2101.
Abstract/FREE Full Text
↵
1. Deplancke, B.,
2. Mukhopadhyay, A.,
3. Ao, W.,
4. Elewa, A.M.,
5. Grove, C.A.,
6. Martinez, N.J.,
7. Sequerra, R.,
8. Doucette-Stam, L.,
9. Reece-Hoyes, J.S.,
10. Hope, I.A.,
11. et al.
(2006) A gene-centered C. elegans protein–DNA interaction network. Cell 125:1193–1205.
CrossRef Medline Google Scholar
↵
1. DeRisi, J.L.,
2. Iyer, V.,
3. Brown, P.O.
(1997) Exploring the metabolic and genetic control of gene expression on a genomic scale. Science 278:680–686.
Abstract/FREE Full Text
↵
1. Dorschner, M.O.,
2. Hawrylycz, M.,
3. Humbert, R.,
4. Wallace, J.C.,
5. Shafer, A.,
6. Kawamoto, J.,
7. Mack, J.,
8. Hall, R.,
9. Goldy, J.,
10. Sabo, P.J.,
11. et al.
(2004) High-throughput localization of functional elements by quantitative chromatin profiling. Nat. Methods 1:219–225.
CrossRef Medline Google Scholar
↵
1. Du, T.,
2. Zamore, P.D.
(2005) microPrimer: The biogenesis and function of microRNA. Development 132:4645–4652.
Abstract/FREE Full Text
↵
1. Dupuy, D.,
2. Li, Q.,
3. Deplancke, B.,
4. Boxem, M.,
5. Hao, T.,
6. Lamesch, P.,
7. Sequerra, R.,
8. Bosak, S.,
9. Doucette-Stam, L.,
10. Hope, I.A.,
11. et al.
(2004) A first version of the Caenorhabditis elegans promoterome. Genome Res. 14:2169–2175.
Abstract/FREE Full Text
↵
1. Elnitski, L.,
2. Jin, V.X.,
3. Farnham, P.J.,
4. Jones, S.J.M.
(2006) Locating mammalian transcription factor binding sites: A survey of computational and experimental techniques. Genome Res. (this issue).
Google Scholar
↵
1. ENCODE Project Consortium
(2004) The ENCODE (ENCyclopedia of DNA elements) project. Science 306:636–640.
Abstract/FREE Full Text
↵
1. Fields, S.,
2. Song, O.
(1989) A novel genetic system to detect protein–protein interactions. Nature 340:245–246.
CrossRef Medline Google Scholar
↵
1. Gavin, A.C.,
2. Aloy, P.,
3. Grandi, P.,
4. Krause, R.,
5. Boesche, M.,
6. Marzioch, M.,
7. Rau, C.,
8. Jensen, L.J.,
9. Bastuck, S.,
10. Dumpelfeld, B.,
11. et al.
(2006) Proteome survey reveals modularity of the yeast cell machinery. Nature 440:631–636.
CrossRef Medline Google Scholar
↵
1. Glazov, E.A.,
2. Pheasant, M.,
3. McGraw, E.A.,
4. Bejerano, G.,
5. Mattick, J.S.
(2005) Ultraconserved elements in insect genomes: A highly conserved intronic sequence implicated in the control of homothorax mRNA splicing. Genome Res. 15:800–808.
Abstract/FREE Full Text
↵
1. Goffeau, A.,
2. Barrell, B.G.,
3. Bussey, H.,
4. Davis, R.W.,
5. Dujon, B.,
6. Feldmann, H.,
7. Galibert, F.,
8. Hoheisel, J.D.,
9. Jacq, C.,
10. Johnston, M.,
11. et al.
(1996) Life with 6000 genes. Science 274:546, 563–567.
Google Scholar
↵
1. Guelzim, N.,
2. Bottani, S.,
3. Bourgine, P.,
4. Kepes, F.
(2002) Topological and causal structure of the yeast transcriptional regulatory network. Nat. Genet. 31:60–63.
CrossRef Medline Google Scholar
↵
1. Gupta, M.,
2. Liu, J.S.
(2005) De novo cis-regulatory module elicitation for eukaryotic genomes. Proc. Natl. Acad. Sci. 102:7079–7084.
Abstract/FREE Full Text
↵
1. Hall, D.A.,
2. Zhu, H.,
3. Zhu, X.,
4. Royce, T.,
5. Gerstein, M.,
6. Snyder, M.
(2004) Regulation of gene expression by a metabolic enzyme. Science 306:482–484.
Abstract/FREE Full Text
↵
1. Hallikas, O.,
2. Palin, K.,
3. Sinjushina, N.,
4. Rautiainen, R.,
5. Partanen, J.,
6. Ukkonen, E.,
7. Taipale, J.
(2006) Genome-wide prediction of mammalian enhancers based on analysis of transcription-factor binding affinity. Cell 124:47–59.
CrossRef Medline Google Scholar
↵
1. Harbison, C.T.,
2. Gordon, D.B.,
3. Lee, T.I.,
4. Rinaldi, N.J.,
5. Macisaac, K.D.,
6. Danford, T.W.,
7. Hannett, N.M.,
8. Tagne, J.B.,
9. Reynolds, D.B.,
10. Yoo, J.,
11. et al.
(2004) Transcriptional regulatory code of a eukaryotic genome. Nature 431:99–104.
CrossRef Medline Google Scholar
↵
1. Hartley, J.L.,
2. Temple, G.F.,
3. Brasch, M.A.
(2000) DNA cloning using in vitro site-specific recombination. Genome Res. 10:1788–1795.
Abstract/FREE Full Text
↵
1. Hieronymus, H.,
2. Silver, P.A.
(2004) A systems view of mRNP biology. Genes & Dev. 18:2845–2860.
Abstract/FREE Full Text
↵
1. Horak, C.E.,
2. Luscombe, N.M.,
3. Qian, J.,
4. Bertone, P.,
5. Piccirrillo, S.,
6. Gerstein, M.,
7. Snyder, M.
(2002) Complex transcriptional circuitry at the G1/S transition in Saccharomyces cerevisiae. Genes & Dev. 16:3017–3033.
Abstract/FREE Full Text
↵
1. Ihmels, J.,
2. Friedlander, G.,
3. Bergmann, S.,
4. Sarig, O.,
5. Ziv, Y.,
6. Barkai, N.
(2002) Revealing modular organization in the yeast transcriptional network. Nat. Genet. 31:370–377.
CrossRef Medline Google Scholar
↵
1. Imanishi, T.,
2. Itoh, T.,
3. Suzuki, Y.,
4. O’Donovan, C.,
5. Fukuchi, S.,
6. Koyanagi, K.O.,
7. Barrero, R.A.,
8. Tamura, T.,
9. Yamaguchi-Kabata, Y.,
10. Tanino, M.,
11. et al.
(2004) Integrative annotation of 21,037 human genes validated by full-length cDNA clones. PLoS Biol. 2:e162.
CrossRef Medline Google Scholar
↵
1. Jeong, H.,
2. Mason, S.P.,
3. Barabasi, A.-L.,
4. Oltvai, Z.N.
(2001) Lethality and centrality in protein networks. Nature 411:41–42.
CrossRef Medline Google Scholar
↵
1. Keene, J.D.,
2. Lager, P.J.
(2005) Post-transcriptional operons and regulons co-ordinating gene expression. Chromosome Res. 13:327–337.
CrossRef Medline Google Scholar
↵
1. Kellis, M.,
2. Patterson, N.,
3. Endrizzi, M.,
4. Birren, B.,
5. Lander, E.S.
(2003) Sequencing and comparison of yeast species to identify genes and regulatory elements. Nature 423:241–254.
CrossRef Medline Google Scholar
↵
1. Kim, T.H.,
2. Barrera, L.O.,
3. Qu, C.,
4. Van Calcar, S.,
5. Trinklein, N.D.,
6. Cooper, S.J.,
7. Luna, R.M.,
8. Glass, C.K.,
9. Rosenfeld, M.G.,
10. Myers, R.M.,
11. et al.
(2005a) Direct isolation and identification of promoters in the human genome. Genome Res. 15:830–839.
Abstract/FREE Full Text
↵
1. Kim, T.H.,
2. Barrera, L.O.,
3. Zheng, M.,
4. Qu, C.,
5. Singer, M.A.,
6. Richmond, T.A.,
7. Wu, Y.,
8. Green, R.D.,
9. Ren, B.
(2005b) A high-resolution map of active promoters in the human genome. Nature 436:876–880.
CrossRef Medline Google Scholar
↵
1. Krogan, N.J.,
2. Cagney, G.,
3. Yu, H.,
4. Zhong, G.,
5. Guo, X.,
6. Ignatchenko, A.,
7. Li, J.,
8. Pu, S.,
9. Datta, N.,
10. Tikuisis, A.P.,
11. et al.
(2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae. Nature 440:637–643.
CrossRef Medline Google Scholar
↵
1. Kummerfeld, S.K.,
2. Teichmann, S.A.
(2006) DBD: A transcription factor prediction database. Nucleic Acids Res. 34:D74–D81.
Abstract/FREE Full Text
↵
1. Lall, S.,
2. Grun, D.,
3. Krek, A.,
4. Chen, K.,
5. Wang, Y.-L.,
6. Dewey, C.N.,
7. Sood, P.,
8. Colombo, T.,
9. Bray, N.,
10. MacMenamin, P.,
11. et al.
(2006) A genome-wide map of conserved microRNA targets in C. elegans. Curr. Biol. 16:460–471.
CrossRef Medline Google Scholar
↵
1. Lee, T.I.,
2. Rinaldi, N.J.,
3. Robert, F.,
4. Odom, D.T.,
5. Bar-Joseph, Z.,
6. Gerber, G.K.,
7. Hannett, N.M.,
8. Harbison, C.T.,
9. Thompson, C.M.,
10. Simon, I.,
11. et al.
(2002) Transcriptional regulatory networks in Saccharomyces cerevisiae. Science 298:799–804.
Abstract/FREE Full Text
↵
1. Levine, M.,
2. Tjian, R.
(2003) Transcription regulation and animal diversity. Nature 424:147–151.
CrossRef Medline Google Scholar
↵
1. Li, J.J.,
2. Herskowitz, I.
(1993) Isolation of the ORC6, a component of the yeast origin recognition complex by a one-hybrid system. Science 262:1870–1874.
Abstract/FREE Full Text
↵
1. Li, S.,
2. Armstrong, C.M.,
3. Bertin, N.,
4. Ge, H.,
5. Milstein, S.,
6. Boxem, M.,
7. Vidalain, P.-O.,
8. Han, J.-D.J.,
9. Chesneau, A.,
10. Hao, T.,
11. et al.
(2004) A map of the interactome network of the metazoan C. elegans. Science 303:540–543.
Abstract/FREE Full Text
↵
1. Liu, X.,
2. Noll, D.M.,
3. Lieb, J.D.,
4. Clarke, N.D.
(2005) DIP-chip: Rapid and accurate determination of DNA-binding specificity. Genome Res. 15:421–427.
Abstract/FREE Full Text
↵
1. Luscombe, N.M.,
2. Madan Babu, M.,
3. Yu, H.,
4. Snyder, M.,
5. Teichmann, S.A.,
6. Gerstein, M.
(2004) Genomic analysis of regulatory network dynamics reveals large topological changes. Nature 431:308–312.
CrossRef Medline Google Scholar
↵
1. Maston, G.A.,
2. Evans, S.K.,
3. Green, M.R.
(2006) Transcriptional regulatory elements in the human genome. Annu. Rev. Genomics Hum. Genet. 7:29–59.
CrossRef Medline Google Scholar
↵
1. Meng, X.,
2. Brodsky, M.H.,
3. Wolfe, S.A.
(2005) A bacterial one-hybrid system for determining the DNA-binding specificity of transcription factors. Nat. Biotechnol. 23:988–994.
CrossRef Medline Google Scholar
↵
1. Milo, R.,
2. Shen-Orr, S.,
3. Itzkovitz, S.,
4. Kashtan, N.,
5. Chklovskii, D.,
6. Alon, U.
(2002) Network motifs: Simple building blocks of complex networks. Science 298:824–827.
Abstract/FREE Full Text
↵
1. Mukherjee, S.,
2. Berger, M.F.,
3. Jona, G.,
4. Wang, X.S.,
5. Muzzey, D.,
6. Snyder, M.,
7. Young, R.A.,
8. Bulyk, M.L.
(2004) Rapid analysis of the DNA-binding specificities of transcription factors with DNA microarrays. Nat. Genet. 36:1331–1339.
CrossRef Medline Google Scholar
↵
1. Newman, J.R.S.,
2. Keating, A.E.
(2003) Comprehensive identification of human bZip interactions with coiled-coil arrays. Science 300:2097–2101.
Abstract/FREE Full Text
↵
1. Odom, D.T.,
2. Zizlsperger, N.,
3. Gordon, D.B.,
4. Bell, G.W.,
5. Rinaldi, N.J.,
6. Murray, H.L.,
7. Volkert, T.L.,
8. Schreiber, J.,
9. Rolfe, P.A.,
10. Gifford, D.K.,
11. et al.
(2004) Control of pancreas and liver gene expression by HNF transcription factors. Science 303:1378–1381.
Abstract/FREE Full Text
↵
1. Oh, S.W.,
2. Mukhopadhyay, A.,
3. Dixit, B.L.,
4. Raha, T.,
5. Green, M.R.,
6. Tissenbaum, H.A.
(2005) Identification of direct targets of DAF-16 controlling longevity, metabolism and diapause by chromatin immunoprecipitation. Nat. Genet. 38:251–257.
Google Scholar
↵
1. Orian, A.,
2. van Steensel, B.,
3. Delrow, J.,
4. Bussemaker, H.J.,
5. Li, L.,
6. Sawado, T.,
7. Williams, E.,
8. Loo, L.W.M.,
9. Cowley, S.M.,
10. Yost, C.,
11. et al.
(2003) Genomic binding by the Drosophila Myc, Max, Mad/Mnt transcription factor network. Genes & Dev. 17:1101–1114.
Abstract/FREE Full Text
↵
1. Reece-Hoyes, J.S.,
2. Deplancke, B.,
3. Shingles, J.,
4. Grove, C.A.,
5. Hope, I.A.,
6. Walhout, A.J.M.
(2005) A compendium of C. elegans regulatory transcription factors: A resource for mapping transcription regulatory networks. Genome Biol. 6:R110.
CrossRef Medline Google Scholar
↵
1. Rual, J.F.,
2. Venkatesan, K.,
3. Hao, T.,
4. Hirozane-Kishikawa, T.,
5. Dricot, A.,Li, N.,
6. Berriz, G.F.,
7. Gibbons, F.D.,
8. Dreze, M.,
9. Ayivi-Guedehoussou, N.,
10. et al.
(2005) Towards a proteome-scale map of the human protein–protein interaction network. Nature 437:1173–1178.
CrossRef Medline Google Scholar
↵
1. Ruvinsky, I.,
2. Ruvkun, G.
(2003) Functional tests of enhancer conservation between distantly related species. Development 130:5133–5142.
Abstract/FREE Full Text
↵
1. Sabo, P.J.,
2. Kuehn, M.S.,
3. Thurman, R.,
4. Johnson, B.E.,
5. Johnson, E.M.,
6. Cao, H.,
7. Yu, M.,
8. Rosenzweig, E.,
9. Goldy, J.,
10. Haydock, A.,
11. et al.
(2006) Genome-scale mapping of DNaseI sensitivity in vivo using tiling DNA microarrays. Nat. Methods 3:511–518.
CrossRef Medline Google Scholar
↵
1. Segal, E.,
2. Shapira, M.,
3. Regev, A.,
4. Pe’er, D.,
5. Boststein, D.,
6. Koller, D.,
7. Friedman, N.
(2003) Module networks: Identifying regulatory modules and their condition-specific regulators from gene expression data. Nat. Genet. 34:166–176.
CrossRef Medline Google Scholar
↵
1. Shannon, P.,
2. Markiel, A.,
3. Ozier, O.,
4. Baliga, N.S.,
5. Wang, J.T.,
6. Ramage, D.,
7. Amin, N.,
8. Schwikowski, B.,
9. Ideker, T.
(2003) Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 13:2498–2504.
Abstract/FREE Full Text
↵
1. Sharan, R.,
2. Ovcharenko, I.,
3. Ben-Hur, A.,
4. Karp, R.M.
(2003) CREME: A framework for identifying cis-regulatory modules in human–mouse conserved segments. Bioinformatics 19:i283–i291.
Abstract
↵
1. Shen-Orr, S.S.,
2. Milo, R.,
3. Mangan, S.,
4. Alon, U.
(2002) Network motifs in the transcriptional regulation network of Escherichia coli. Nat. Genet. 31:64–68.
CrossRef Medline Google Scholar
↵
1. Siepel, A.,
2. Bejerano, G.,
3. Pedersen, J.S.,
4. Hinrichs, A.S.,
5. Hou, M.,
6. Rosenbloom, K.,
7. Clawson, H.,
8. Spieth, J.,
9. Hillier, L.W.,
10. Richards, S.,
11. et al.
(2005) Evolutionarily conserved elements in vertebrate, insect, worm and yeast genomes. Genome Res. 15:1034–1050.
Abstract/FREE Full Text
↵
1. Stelzl, U.,
2. Worm, U.,
3. Lalowski, M.,
4. Haenig, C.,
5. Brembeck, F.H.,
6. Goehler, H.,
7. Stroedicke, M.,
8. Zenkner, M.,
9. Schoenherr, A.,
10. Koeppen, S.,
11. et al.
(2005) A human protein–protein interaction network: A resource for annotating the proteome. Cell 122:957–968.
CrossRef Medline Google Scholar
↵
1. Tompa, M.,
2. Li, N.,
3. Bailey, T.L.,
4. Church, G.M.,
5. De Moor, B.,
6. Eskin, E.,
7. Favorov, A.V.,
8. Frith, M.C.,
9. Fu, Y.,
10. Kent, W.J.,
11. et al.
(2005) Assessing computational tools for the discovery of transcription factor binding sites. Nat. Biotechnol. 23:137–144.
CrossRef Medline Google Scholar
↵
1. van Steensel, B.,
2. Henikoff, S.
(2000) Identification of in vivo DNA targets of chromatin proteins using tethered Dam methyltransferase. Nat. Biotechnol. 18:424–428.
CrossRef Medline Google Scholar
↵
1. Walhout, A.J.M.,
2. Sordella, R.,
3. Lu, X.,
4. Hartley, J.L.,
5. Temple, G.F.,
6. Brasch, M.A.,
7. Thierry-Mieg, N.,
8. Vidal, M.
(2000) Protein interaction mapping in C. elegans using proteins involved in vulval development. Science 287:116–122.
Abstract/FREE Full Text
↵
1. Wang, M.M.,
2. Reed, R.R.
(1993) Molecular cloning of the olfactory neuronal transcription factor Olf-1 by genetic selection in yeast. Nature 364:121–126.
CrossRef Medline Google Scholar
↵
1. Woolfe, A.,
2. Goodson, M.,
3. Goode, D.K.,
4. Snell, P.,
5. McEwen, G.K.,
6. Vavouri, T.,
7. Smith, S.F.,
8. North, P.,
9. Callaway, H.,
10. Kelly, K.,
11. et al.
(2005) Highly conserved non-coding sequences are associated with vertebrate development. PLoS Biol. 3:e7.
CrossRef Medline Google Scholar
↵
1. Workman, C.T.,
2. Mak, H.C.,
3. McCuine, S.,
4. Tagne, J.B.,
5. Agarwal, M.,
6. Ozier, O.,
7. Begley, T.J.,
8. Samson, L.D.,
9. Ideker, T.
(2006) A systems approach to mapping DNA damage response pathways. Science 312:1054–1059.
Abstract/FREE Full Text
↵
1. Xie, X.,
2. Lu, J.,
3. Kulkobas, E.J.,
4. Golub, T.R.,
5. Mootha, V.,
6. Lindblad-Toh, K.,
7. Lander, E.S.,
8. Kellis, M.
(2005) Systematic discovery of regulatory motifs in human promoters and 3′ UTRs by comparison of several mammals. Nature 434:338–345.
CrossRef Medline Google Scholar
↵
1. Yu, H.,
2. Luscombe, N.M.,
3. Qian, J.,
4. Gerstein, M.
(2003) Genomic analysis of gene expression relationships in transcriptional regulatory networks. Trends Genet. 19:422–427.
CrossRef Medline Google Scholar
↵
1. Yu, H.,
2. Greenbaum, D.,
3. Xin Lu, H.,
4. Zhu, X.,
5. Gerstein, M.
(2004) Genomic analysis of essentiality within protein networks. Trends Genet. 20:227–231.
CrossRef Medline Google Scholar
↵
1. Zhang, X.,
2. Odom, D.T.,
3. Koo, S.-H.,
4. Conkright, M.D.,
5. Canettieri, G.,
6. Best, J.,
7. Chen, H.,
8. Jenner, R.,
9. Herbolsheimer, E.,
10. Jacobsen, E.,
11. et al.
(2005) Genome-wide analysis of cAMP-response element binding protein occupancy, phosphorylation, and target gene activation in human tissues. Proc. Natl. Acad. Sci. 102:4459–4464.
Abstract/FREE Full Text

[1] ↵

Aerts, S.,

van Loo, P.,

Thijs, G.,

Moreau, Y.,

de Moor, B.

(2003) Computational detection of cis-regulatory modules. Bioinformatics 19:ii5–ii14.

Abstract

[2] Aerts, S.,

[3] van Loo, P.,

[4] Thijs, G.,

[5] Moreau, Y.,

[6] de Moor, B.

[7] ↵

Albert, R.,

Jeong, H.,

Barabasi, A.-L.

(2000) Error and attack tolerance of complex networks. Nature 378:378–381.

Google Scholar

[8] Albert, R.,

[9] Jeong, H.,

[10] Barabasi, A.-L.

[11] ↵

Ambros, V.

(2004) The functions of animal microRNAs. Nature 431:350–355.

CrossRef Medline Google Scholar

[12] Ambros, V.

[13] ↵

Babu, M.M.,

Luscombe, N.M.,

Aravind, L.,

Gerstein, M.,

Teichmann, S.A.

(2004) Structure and evolution of transcriptional regulatory networks. Curr. Opin. Struct. Biol. 14:283–291.

CrossRef Medline Google Scholar

[14] Babu, M.M.,

[15] Luscombe, N.M.,

[16] Aravind, L.,

[17] Gerstein, M.,

[18] Teichmann, S.A.

[19] ↵

Barabasi, A.L.,

Oltvai, Z.N.

(2004) Network biology: Understanding the cell’s functional organization. Nat. Rev. Genet. 5:101–113.

CrossRef Medline Google Scholar

[20] Barabasi, A.L.,

[21] Oltvai, Z.N.

[22] ↵

Bar-Joseph, Z.,

Gerber, G.K.,

Lee, T.I.,

Rinaldi, N.J.,

Yoo, J.Y.,

Robert, F.,

Gordon, D.B.,

Fraenkel, E.,

Jaakkola, T.S.,

Young, R.A.,

et al.

(2003) Computational discovery of gene modules and regulatory networks. Nat. Biotechnol. 21:1337–1342.

CrossRef Medline Google Scholar

[23] Bar-Joseph, Z.,

[24] Gerber, G.K.,

[25] Lee, T.I.,

[26] Rinaldi, N.J.,

[27] Yoo, J.Y.,

[28] Robert, F.,

[29] Gordon, D.B.,

[30] Fraenkel, E.,

[31] Jaakkola, T.S.,

[32] Young, R.A.,

[33] et al.

[34] ↵

Baugh, L.R.,

Hill, A.A.,

Claggett, J.M.,

Hill-Harfe, K.,

Wen, J.C.,

Slonim, D.K.,

Brown, E.L.,

Hunter, C.P.

(2005) The homeodomain protein PAL-1 specifies a lineage-specific regulatory network in the C. elegans embryo. Development 132:1843–1854.

Abstract/FREE Full Text

[35] Baugh, L.R.,

[36] Hill, A.A.,

[37] Claggett, J.M.,

[38] Hill-Harfe, K.,

[39] Wen, J.C.,

[40] Slonim, D.K.,

[41] Brown, E.L.,

[42] Hunter, C.P.

[43] ↵

Bejerano, G.,

Pheasant, M.,

Makunin, I.,

Stephen, S.,

Kent, W.J.,

Mattick, J.S.,

Haussler, D.

(2004) Ultraconserved elements in the human genome. Science 304:1321–1325.

Abstract/FREE Full Text

[44] Bejerano, G.,

[45] Pheasant, M.,

[46] Makunin, I.,

[47] Stephen, S.,

[48] Kent, W.J.,

[49] Mattick, J.S.,

[50] Haussler, D.

[51] ↵

Bieda, M.,

Xu, X.,

Singer, M.A.,

Green, R.,

Farnham, P.J.

(2006) Unbiased location analysis of E2F1-binding sites suggests a widespread role for E2F1 in the human genome. Genome Res. 16:595–605.

Abstract/FREE Full Text

[52] Bieda, M.,

[53] Xu, X.,

[54] Singer, M.A.,

[55] Green, R.,

[56] Farnham, P.J.

[57] ↵

Blais, A.,

Dynlacht, B.D.

(2005) Constructing transcriptional regulatory networks. Genes & Dev. 19:1499–1511.

Abstract/FREE Full Text

[58] Blais, A.,

[59] Dynlacht, B.D.

[60] ↵

Blanchette, M.,

Bataille, A.R.,

Chen, X.,

Poitras, C.,

Laganiere, J.,

Lefebvre, D.,

Deblois, G.,

Giuere, V.,

Ferretti, V.,

Bergeron, D.,

et al.

(2006) Genome-wide computational prediction of transcriptional regulatory modules reveals new insights into human gene expression. Genome Res. 16:656–668.

Abstract/FREE Full Text

[61] Blanchette, M.,

[62] Bataille, A.R.,

[63] Chen, X.,

[64] Poitras, C.,

[65] Laganiere, J.,

[66] Lefebvre, D.,

[67] Deblois, G.,

[68] Giuere, V.,

[69] Ferretti, V.,

[70] Bergeron, D.,

[71] et al.

[72] ↵

Boyer, L.A.,

Lee, T.I.,

Cole, M.F.,

Johnstone, S.E.,

Levine, S.S.,

Zucker, J.P.,

Guenther, M.G.,

Kumar, R.M.,

Murray, H.L.,

Jenner, R.G.,

et al.

(2005) Core transcriptional regulatory circuitry in human embryonic stem cells. Cell 122:947–956.

CrossRef Medline Google Scholar

[73] Boyer, L.A.,

[74] Lee, T.I.,

[75] Cole, M.F.,

[76] Johnstone, S.E.,

[77] Levine, S.S.,

[78] Zucker, J.P.,

[79] Guenther, M.G.,

[80] Kumar, R.M.,

[81] Murray, H.L.,

[82] Jenner, R.G.,

[83] et al.

[84] ↵

Carninci, P.,

Kasukawa, T.,

Katayama, S.,

Gough, J.,

Frith, M.C.,

Maeda, N.,

Oyama, R.,

Ravasi, T.,

Lenhard, B.,

Wells, C.,

et al.

(2005) The transcriptional landscape of the mammalian genome. Science 309:1559–1563.

Abstract/FREE Full Text

[85] Carninci, P.,

[86] Kasukawa, T.,

[87] Katayama, S.,

[88] Gough, J.,

[89] Frith, M.C.,

[90] Maeda, N.,

[91] Oyama, R.,

[92] Ravasi, T.,

[93] Lenhard, B.,

[94] Wells, C.,

[95] et al.

[96] ↵

Carninci, P.,

Sandelin, A.,

Lenhard, B.,

Katayama, S.,

Shimokawa, K.,

Ponjavic, J.,

Semple, C.A.,

Taylor, M.S.,

Engstrom, P.G.,

Frith, M.C.,

et al.

(2006) Genome-wide analysis of mammalian promoter architecture and evolution. Nat. Genet. 38:626–635.

CrossRef Medline Google Scholar

[97] Carninci, P.,

[98] Sandelin, A.,

[99] Lenhard, B.,

[100] Katayama, S.,

[101] Shimokawa, K.,

[102] Ponjavic, J.,

[103] Semple, C.A.,

[104] Taylor, M.S.,

[105] Engstrom, P.G.,

[106] Frith, M.C.,

[107] et al.

[108] ↵

Carroll, J.S.,

Liu, X.S.,

Brodsky, A.S.,

Li, W.,

Meyer, C.A.,

Szary, A.J.,

Eeckhoute, J.,

Shao, W.L.,

Hestermann, E.V.,

Geistlinger, T.R.,

et al.

(2005) Chomosome-wide mapping of estrogen receptor binding reveals long-range regulation requiring the forkhead protein FoxA1. Cell 122:33–43.

CrossRef Medline Google Scholar

[109] Carroll, J.S.,

[110] Liu, X.S.,

[111] Brodsky, A.S.,

[112] Li, W.,

[113] Meyer, C.A.,

[114] Szary, A.J.,

[115] Eeckhoute, J.,

[116] Shao, W.L.,

[117] Hestermann, E.V.,

[118] Geistlinger, T.R.,

[119] et al.

[120] ↵

Cawley, S.,

Bekiranov, S.,

Ng, H.H.,

Kapranov, P.,

Sekinger, E.A.,

Kampa, D.,

Piccolboni, A.,

Sementchenko, V.,

Cheng, J.,

Williams, A.J.,

et al.

(2004) Unbiased mapping of transcription factor binding sites along human chromosomes 21 and 22 points to widespread regulation of noncoding RNAs. Cell 116:499–509.

CrossRef Medline Google Scholar

[121] Cawley, S.,

[122] Bekiranov, S.,

[123] Ng, H.H.,

[124] Kapranov, P.,

[125] Sekinger, E.A.,

[126] Kampa, D.,

[127] Piccolboni, A.,

[128] Sementchenko, V.,

[129] Cheng, J.,

[130] Williams, A.J.,

[131] et al.

[132] ↵

Cliften, P.F.,

Hillier, L.W.,

Fulton, L.,

Graves, T.,

Miner, T.,

Gish, W.R.,

Waterston, R.H.,

Johnston, M.

(2001) Surveying Saccharyomyces genomes to identify functional elements by comparative DNA sequence analysis. Genome Res. 11:1175–1186.

Abstract/FREE Full Text

[133] Cliften, P.F.,

[134] Hillier, L.W.,

[135] Fulton, L.,

[136] Graves, T.,

[137] Miner, T.,

[138] Gish, W.R.,

[139] Waterston, R.H.,

[140] Johnston, M.

[141] ↵

Cliften, P.,

Sudarsanam, P.,

Desikan, A.,

Fulton, L.,

Fulton, B.,

Majors, J.,

Waterston, R.,

Cohen, B.A.,

Johnston, M.

(2003) Finding functional features in Saccharomyces genomes by phylogenetic footprinting. Science 301:71–76.

Abstract/FREE Full Text

[142] Cliften, P.,

[143] Sudarsanam, P.,

[144] Desikan, A.,

[145] Fulton, L.,

[146] Fulton, B.,

[147] Majors, J.,

[148] Waterston, R.,

[149] Cohen, B.A.,

[150] Johnston, M.

[151] ↵

Cooper, S.J.,

Trinklein, N.D.,

Anton, E.D.,

Nguyen, L.,

Myers, R.M.

(2006) Comprehensive analysis of transcriptional promoter structure and function in 1% of the human genome. Genome Res. 16:1–10.

Abstract/FREE Full Text

[152] Cooper, S.J.,

[153] Trinklein, N.D.,

[154] Anton, E.D.,

[155] Nguyen, L.,

[156] Myers, R.M.

[157] ↵

Crawford, G.E.,

Davis, S.,

Scacheri, P.C.,

Renaud, G.,

Halawi, M.J.,

Erdos, M.R.,

Green, R.,

Meltzer, P.S.,

Wolfsberg, T.G.,

Collins, F.S.

(2006a) DNase-chip: A high-resolution method to identify DNaseI hypersensitive sites using tiled microarrays. Nat. Methods 3:503–509.

CrossRef Medline Google Scholar

[158] Crawford, G.E.,

[159] Davis, S.,

[160] Scacheri, P.C.,

[161] Renaud, G.,

[162] Halawi, M.J.,

[163] Erdos, M.R.,

[164] Green, R.,

[165] Meltzer, P.S.,

[166] Wolfsberg, T.G.,

[167] Collins, F.S.

[168] ↵

Crawford, G.E.,

Holt, I.E.,

Whittle, J.,

Webb, B.D.,

Tai, D.,

Davis, S.,

Margulies, E.H.,

Chen, Y.,

Bernat, J.A.,

Ginsburg, D.,

et al.

(2006b) Genome-wide mapping of DNase hypersensitive sites using massively parallel signature sequencing (MPSS) Genome Res. 16:123–131.

Abstract/FREE Full Text

[169] Crawford, G.E.,

[170] Holt, I.E.,

[171] Whittle, J.,

[172] Webb, B.D.,

[173] Tai, D.,

[174] Davis, S.,

[175] Margulies, E.H.,

[176] Chen, Y.,

[177] Bernat, J.A.,

[178] Ginsburg, D.,

[179] et al.

[180] ↵

Davidson, E.H.

(2001) Genomic regulatory systems: Development and evolution (Academic Press, San Diego).

Google Scholar

[181] Davidson, E.H.

[182] ↵

Davidson, E.H.,

Rast, J.P.,

Oliveri, P.,

Ransick, A.,

Calestani, C.,

Yuh, C.-H.,

Minokawa, T.,

Amore, G.,

Hinman, V.,

Arenas-Mena, C.,

et al.

(2002) A genomic regulatory network for development. Science 295:1669–1678.

Abstract/FREE Full Text

[183] Davidson, E.H.,

[184] Rast, J.P.,

[185] Oliveri, P.,

[186] Ransick, A.,

[187] Calestani, C.,

[188] Yuh, C.-H.,

[189] Minokawa, T.,

[190] Amore, G.,

[191] Hinman, V.,

[192] Arenas-Mena, C.,

[193] et al.

[194] ↵

Deplancke, B.,

Dupuy, D.,

Vidal, M.,

Walhout, A.J.M.

(2004) A Gateway-compatible yeast one-hybrid system. Genome Res. 14:2093–2101.

Abstract/FREE Full Text

[195] Deplancke, B.,

[196] Dupuy, D.,

[197] Vidal, M.,

[198] Walhout, A.J.M.

[199] ↵

Deplancke, B.,

Mukhopadhyay, A.,

Ao, W.,

Elewa, A.M.,

Grove, C.A.,

Martinez, N.J.,

Sequerra, R.,

Doucette-Stam, L.,

Reece-Hoyes, J.S.,

Hope, I.A.,

et al.

(2006) A gene-centered C. elegans protein–DNA interaction network. Cell 125:1193–1205.

CrossRef Medline Google Scholar

[200] Deplancke, B.,

[201] Mukhopadhyay, A.,

[202] Ao, W.,

[203] Elewa, A.M.,

[204] Grove, C.A.,

[205] Martinez, N.J.,

[206] Sequerra, R.,

[207] Doucette-Stam, L.,

[208] Reece-Hoyes, J.S.,

[209] Hope, I.A.,

[210] et al.

[211] ↵

DeRisi, J.L.,

Iyer, V.,

Brown, P.O.

(1997) Exploring the metabolic and genetic control of gene expression on a genomic scale. Science 278:680–686.

Abstract/FREE Full Text

[212] DeRisi, J.L.,

[213] Iyer, V.,

[214] Brown, P.O.

[215] ↵

Dorschner, M.O.,

Hawrylycz, M.,

Humbert, R.,

Wallace, J.C.,

Shafer, A.,

Kawamoto, J.,

Mack, J.,

Hall, R.,

Goldy, J.,

Sabo, P.J.,

et al.

(2004) High-throughput localization of functional elements by quantitative chromatin profiling. Nat. Methods 1:219–225.

CrossRef Medline Google Scholar

[216] Dorschner, M.O.,

[217] Hawrylycz, M.,

[218] Humbert, R.,

[219] Wallace, J.C.,

[220] Shafer, A.,

[221] Kawamoto, J.,

[222] Mack, J.,

[223] Hall, R.,

[224] Goldy, J.,

[225] Sabo, P.J.,

[226] et al.

[227] ↵

Du, T.,

Zamore, P.D.

(2005) microPrimer: The biogenesis and function of microRNA. Development 132:4645–4652.

Abstract/FREE Full Text

[228] Du, T.,

[229] Zamore, P.D.

[230] ↵

Dupuy, D.,

Li, Q.,

Deplancke, B.,

Boxem, M.,

Hao, T.,

Lamesch, P.,

Sequerra, R.,

Bosak, S.,

Doucette-Stam, L.,

Hope, I.A.,

et al.

(2004) A first version of the Caenorhabditis elegans promoterome. Genome Res. 14:2169–2175.

Abstract/FREE Full Text

[231] Dupuy, D.,

[232] Li, Q.,

[233] Deplancke, B.,

[234] Boxem, M.,

[235] Hao, T.,

[236] Lamesch, P.,

[237] Sequerra, R.,

[238] Bosak, S.,

[239] Doucette-Stam, L.,

[240] Hope, I.A.,

[241] et al.

[242] ↵

Elnitski, L.,

Jin, V.X.,

Farnham, P.J.,

Jones, S.J.M.

(2006) Locating mammalian transcription factor binding sites: A survey of computational and experimental techniques. Genome Res. (this issue).

Google Scholar

[243] Elnitski, L.,

[244] Jin, V.X.,

[245] Farnham, P.J.,

[246] Jones, S.J.M.

[247] ↵

ENCODE Project Consortium

(2004) The ENCODE (ENCyclopedia of DNA elements) project. Science 306:636–640.

Abstract/FREE Full Text

[248] ENCODE Project Consortium

[249] ↵

Fields, S.,

Song, O.

(1989) A novel genetic system to detect protein–protein interactions. Nature 340:245–246.

CrossRef Medline Google Scholar

[250] Fields, S.,

[251] Song, O.

[252] ↵

Gavin, A.C.,

Aloy, P.,

Grandi, P.,

Krause, R.,

Boesche, M.,

Marzioch, M.,

Rau, C.,

Jensen, L.J.,

Bastuck, S.,

Dumpelfeld, B.,

et al.

(2006) Proteome survey reveals modularity of the yeast cell machinery. Nature 440:631–636.

CrossRef Medline Google Scholar

[253] Gavin, A.C.,

[254] Aloy, P.,

[255] Grandi, P.,

[256] Krause, R.,

[257] Boesche, M.,

[258] Marzioch, M.,

[259] Rau, C.,

[260] Jensen, L.J.,

[261] Bastuck, S.,

[262] Dumpelfeld, B.,

[263] et al.

[264] ↵

Glazov, E.A.,

Pheasant, M.,

McGraw, E.A.,

Bejerano, G.,

Mattick, J.S.

(2005) Ultraconserved elements in insect genomes: A highly conserved intronic sequence implicated in the control of homothorax mRNA splicing. Genome Res. 15:800–808.

Abstract/FREE Full Text

[265] Glazov, E.A.,

[266] Pheasant, M.,

[267] McGraw, E.A.,

[268] Bejerano, G.,

[269] Mattick, J.S.

[270] ↵

Goffeau, A.,

Barrell, B.G.,

Bussey, H.,

Davis, R.W.,

Dujon, B.,

Feldmann, H.,

Galibert, F.,

Hoheisel, J.D.,

Jacq, C.,

Johnston, M.,

et al.

(1996) Life with 6000 genes. Science 274:546, 563–567.

Google Scholar

[271] Goffeau, A.,

[272] Barrell, B.G.,

[273] Bussey, H.,

[274] Davis, R.W.,

[275] Dujon, B.,

[276] Feldmann, H.,

[277] Galibert, F.,

[278] Hoheisel, J.D.,

[279] Jacq, C.,

[280] Johnston, M.,

[281] et al.

[282] ↵

Guelzim, N.,

Bottani, S.,

Bourgine, P.,

Kepes, F.

(2002) Topological and causal structure of the yeast transcriptional regulatory network. Nat. Genet. 31:60–63.

CrossRef Medline Google Scholar

[283] Guelzim, N.,

[284] Bottani, S.,

[285] Bourgine, P.,

[286] Kepes, F.

[287] ↵

Gupta, M.,

Liu, J.S.

(2005) De novo cis-regulatory module elicitation for eukaryotic genomes. Proc. Natl. Acad. Sci. 102:7079–7084.

Abstract/FREE Full Text

[288] Gupta, M.,

[289] Liu, J.S.

[290] ↵

Hall, D.A.,

Zhu, H.,

Zhu, X.,

Royce, T.,

Gerstein, M.,

Snyder, M.

(2004) Regulation of gene expression by a metabolic enzyme. Science 306:482–484.

Abstract/FREE Full Text

[291] Hall, D.A.,

[292] Zhu, H.,

[293] Zhu, X.,

[294] Royce, T.,

[295] Gerstein, M.,

[296] Snyder, M.

[297] ↵

Hallikas, O.,

Palin, K.,

Sinjushina, N.,

Rautiainen, R.,

Partanen, J.,

Ukkonen, E.,

Taipale, J.

(2006) Genome-wide prediction of mammalian enhancers based on analysis of transcription-factor binding affinity. Cell 124:47–59.

CrossRef Medline Google Scholar

[298] Hallikas, O.,

[299] Palin, K.,

[300] Sinjushina, N.,

[301] Rautiainen, R.,

[302] Partanen, J.,

[303] Ukkonen, E.,

[304] Taipale, J.

[305] ↵

Harbison, C.T.,

Gordon, D.B.,

Lee, T.I.,

Rinaldi, N.J.,

Macisaac, K.D.,

Danford, T.W.,

Hannett, N.M.,

Tagne, J.B.,

Reynolds, D.B.,

Yoo, J.,

et al.

(2004) Transcriptional regulatory code of a eukaryotic genome. Nature 431:99–104.

CrossRef Medline Google Scholar

[306] Harbison, C.T.,

[307] Gordon, D.B.,

[308] Lee, T.I.,

[309] Rinaldi, N.J.,

[310] Macisaac, K.D.,

[311] Danford, T.W.,

[312] Hannett, N.M.,

[313] Tagne, J.B.,

[314] Reynolds, D.B.,

[315] Yoo, J.,

[316] et al.

[317] ↵

Hartley, J.L.,

Temple, G.F.,

Brasch, M.A.

(2000) DNA cloning using in vitro site-specific recombination. Genome Res. 10:1788–1795.

Abstract/FREE Full Text

[318] Hartley, J.L.,

[319] Temple, G.F.,

[320] Brasch, M.A.

[321] ↵

Hieronymus, H.,

Silver, P.A.

(2004) A systems view of mRNP biology. Genes & Dev. 18:2845–2860.

Abstract/FREE Full Text

[322] Hieronymus, H.,

[323] Silver, P.A.

[324] ↵

Horak, C.E.,

Luscombe, N.M.,

Qian, J.,

Bertone, P.,

Piccirrillo, S.,

Gerstein, M.,

Snyder, M.

(2002) Complex transcriptional circuitry at the G1/S transition in Saccharomyces cerevisiae. Genes & Dev. 16:3017–3033.

Abstract/FREE Full Text

[325] Horak, C.E.,

[326] Luscombe, N.M.,

[327] Qian, J.,

[328] Bertone, P.,

[329] Piccirrillo, S.,

[330] Gerstein, M.,

[331] Snyder, M.

[332] ↵

Ihmels, J.,

Friedlander, G.,

Bergmann, S.,

Sarig, O.,

Ziv, Y.,

Barkai, N.

(2002) Revealing modular organization in the yeast transcriptional network. Nat. Genet. 31:370–377.

CrossRef Medline Google Scholar

[333] Ihmels, J.,

[334] Friedlander, G.,

[335] Bergmann, S.,

[336] Sarig, O.,

[337] Ziv, Y.,

[338] Barkai, N.

[339] ↵

Imanishi, T.,

Itoh, T.,

Suzuki, Y.,

O’Donovan, C.,

Fukuchi, S.,

Koyanagi, K.O.,

Barrero, R.A.,

Tamura, T.,

Yamaguchi-Kabata, Y.,

Tanino, M.,

et al.

(2004) Integrative annotation of 21,037 human genes validated by full-length cDNA clones. PLoS Biol. 2:e162.

CrossRef Medline Google Scholar

[340] Imanishi, T.,

[341] Itoh, T.,

[342] Suzuki, Y.,

[343] O’Donovan, C.,

[344] Fukuchi, S.,

[345] Koyanagi, K.O.,

[346] Barrero, R.A.,

[347] Tamura, T.,

[348] Yamaguchi-Kabata, Y.,

[349] Tanino, M.,

[350] et al.

[351] ↵

Jeong, H.,

Mason, S.P.,

Barabasi, A.-L.,

Oltvai, Z.N.

(2001) Lethality and centrality in protein networks. Nature 411:41–42.

CrossRef Medline Google Scholar

[352] Jeong, H.,

[353] Mason, S.P.,

[354] Barabasi, A.-L.,

[355] Oltvai, Z.N.

[356] ↵

Keene, J.D.,

Lager, P.J.

(2005) Post-transcriptional operons and regulons co-ordinating gene expression. Chromosome Res. 13:327–337.

CrossRef Medline Google Scholar

[357] Keene, J.D.,

[358] Lager, P.J.

[359] ↵

Kellis, M.,

Patterson, N.,

Endrizzi, M.,

Birren, B.,

Lander, E.S.

(2003) Sequencing and comparison of yeast species to identify genes and regulatory elements. Nature 423:241–254.

CrossRef Medline Google Scholar

[360] Kellis, M.,

[361] Patterson, N.,

[362] Endrizzi, M.,

[363] Birren, B.,

[364] Lander, E.S.

[365] ↵

Kim, T.H.,

Barrera, L.O.,

Qu, C.,

Van Calcar, S.,

Trinklein, N.D.,

Cooper, S.J.,

Luna, R.M.,

Glass, C.K.,

Rosenfeld, M.G.,

Myers, R.M.,

et al.

(2005a) Direct isolation and identification of promoters in the human genome. Genome Res. 15:830–839.

Abstract/FREE Full Text

[366] Kim, T.H.,

[367] Barrera, L.O.,

[368] Qu, C.,

[369] Van Calcar, S.,

[370] Trinklein, N.D.,

[371] Cooper, S.J.,

[372] Luna, R.M.,

[373] Glass, C.K.,

[374] Rosenfeld, M.G.,

[375] Myers, R.M.,

[376] et al.

[377] ↵

Kim, T.H.,

Barrera, L.O.,

Zheng, M.,

Qu, C.,

Singer, M.A.,

Richmond, T.A.,

Wu, Y.,

Green, R.D.,

Ren, B.

(2005b) A high-resolution map of active promoters in the human genome. Nature 436:876–880.

CrossRef Medline Google Scholar

[378] Kim, T.H.,

[379] Barrera, L.O.,

[380] Zheng, M.,

[381] Qu, C.,

[382] Singer, M.A.,

[383] Richmond, T.A.,

[384] Wu, Y.,

[385] Green, R.D.,

[386] Ren, B.

[387] ↵

Krogan, N.J.,

Cagney, G.,

Yu, H.,

Zhong, G.,

Guo, X.,

Ignatchenko, A.,

Li, J.,

Pu, S.,

Datta, N.,

Tikuisis, A.P.,

et al.

(2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae. Nature 440:637–643.

CrossRef Medline Google Scholar

[388] Krogan, N.J.,

[389] Cagney, G.,

[390] Yu, H.,

[391] Zhong, G.,

[392] Guo, X.,

[393] Ignatchenko, A.,

[394] Li, J.,

[395] Pu, S.,

[396] Datta, N.,

[397] Tikuisis, A.P.,

[398] et al.

[399] ↵

Kummerfeld, S.K.,

Teichmann, S.A.

(2006) DBD: A transcription factor prediction database. Nucleic Acids Res. 34:D74–D81.

Abstract/FREE Full Text

[400] Kummerfeld, S.K.,

[401] Teichmann, S.A.

[402] ↵

Lall, S.,

Grun, D.,

Krek, A.,

Chen, K.,

Wang, Y.-L.,

Dewey, C.N.,

Sood, P.,

Colombo, T.,

Bray, N.,

MacMenamin, P.,

et al.

(2006) A genome-wide map of conserved microRNA targets in C. elegans. Curr. Biol. 16:460–471.

CrossRef Medline Google Scholar

[403] Lall, S.,

[404] Grun, D.,

[405] Krek, A.,

[406] Chen, K.,

[407] Wang, Y.-L.,

[408] Dewey, C.N.,

[409] Sood, P.,

[410] Colombo, T.,

[411] Bray, N.,

[412] MacMenamin, P.,

[413] et al.

[414] ↵

Lee, T.I.,

Rinaldi, N.J.,

Robert, F.,

Odom, D.T.,

Bar-Joseph, Z.,

Gerber, G.K.,

Hannett, N.M.,

Harbison, C.T.,

Thompson, C.M.,

Simon, I.,

et al.

(2002) Transcriptional regulatory networks in Saccharomyces cerevisiae. Science 298:799–804.

Abstract/FREE Full Text

[415] Lee, T.I.,

[416] Rinaldi, N.J.,

[417] Robert, F.,

[418] Odom, D.T.,

[419] Bar-Joseph, Z.,

[420] Gerber, G.K.,

[421] Hannett, N.M.,

[422] Harbison, C.T.,

[423] Thompson, C.M.,

[424] Simon, I.,

[425] et al.

[426] ↵

Levine, M.,

Tjian, R.

(2003) Transcription regulation and animal diversity. Nature 424:147–151.

CrossRef Medline Google Scholar

[427] Levine, M.,

[428] Tjian, R.

[429] ↵

Li, J.J.,

Herskowitz, I.

(1993) Isolation of the ORC6, a component of the yeast origin recognition complex by a one-hybrid system. Science 262:1870–1874.

Abstract/FREE Full Text

[430] Li, J.J.,

[431] Herskowitz, I.

[432] ↵

Li, S.,

Armstrong, C.M.,

Bertin, N.,

Ge, H.,

Milstein, S.,

Boxem, M.,

Vidalain, P.-O.,

Han, J.-D.J.,

Chesneau, A.,

Hao, T.,

et al.

(2004) A map of the interactome network of the metazoan C. elegans. Science 303:540–543.

Abstract/FREE Full Text

[433] Li, S.,

[434] Armstrong, C.M.,

[435] Bertin, N.,

[436] Ge, H.,

[437] Milstein, S.,

[438] Boxem, M.,

[439] Vidalain, P.-O.,

[440] Han, J.-D.J.,

[441] Chesneau, A.,

[442] Hao, T.,

[443] et al.

[444] ↵

Liu, X.,

Noll, D.M.,

Lieb, J.D.,

Clarke, N.D.

(2005) DIP-chip: Rapid and accurate determination of DNA-binding specificity. Genome Res. 15:421–427.

Abstract/FREE Full Text

[445] Liu, X.,

[446] Noll, D.M.,

[447] Lieb, J.D.,

[448] Clarke, N.D.

[449] ↵

Luscombe, N.M.,

Madan Babu, M.,

Yu, H.,

Snyder, M.,

Teichmann, S.A.,

Gerstein, M.

(2004) Genomic analysis of regulatory network dynamics reveals large topological changes. Nature 431:308–312.

CrossRef Medline Google Scholar

[450] Luscombe, N.M.,

[451] Madan Babu, M.,

[452] Yu, H.,

[453] Snyder, M.,

[454] Teichmann, S.A.,

[455] Gerstein, M.

[456] ↵

Maston, G.A.,

Evans, S.K.,

Green, M.R.

(2006) Transcriptional regulatory elements in the human genome. Annu. Rev. Genomics Hum. Genet. 7:29–59.

CrossRef Medline Google Scholar

[457] Maston, G.A.,

[458] Evans, S.K.,

[459] Green, M.R.

[460] ↵

Meng, X.,

Brodsky, M.H.,

Wolfe, S.A.

(2005) A bacterial one-hybrid system for determining the DNA-binding specificity of transcription factors. Nat. Biotechnol. 23:988–994.

CrossRef Medline Google Scholar

[461] Meng, X.,

[462] Brodsky, M.H.,

[463] Wolfe, S.A.

[464] ↵

Milo, R.,

Shen-Orr, S.,

Itzkovitz, S.,

Kashtan, N.,

Chklovskii, D.,

Alon, U.

(2002) Network motifs: Simple building blocks of complex networks. Science 298:824–827.

Abstract/FREE Full Text

[465] Milo, R.,

[466] Shen-Orr, S.,

[467] Itzkovitz, S.,

[468] Kashtan, N.,

[469] Chklovskii, D.,

[470] Alon, U.

[471] ↵

Mukherjee, S.,

Berger, M.F.,

Jona, G.,

Wang, X.S.,

Muzzey, D.,

Snyder, M.,

Young, R.A.,

Bulyk, M.L.

(2004) Rapid analysis of the DNA-binding specificities of transcription factors with DNA microarrays. Nat. Genet. 36:1331–1339.

CrossRef Medline Google Scholar

[472] Mukherjee, S.,

[473] Berger, M.F.,

[474] Jona, G.,

[475] Wang, X.S.,

[476] Muzzey, D.,

[477] Snyder, M.,

[478] Young, R.A.,

[479] Bulyk, M.L.

[480] ↵

Newman, J.R.S.,

Keating, A.E.

(2003) Comprehensive identification of human bZip interactions with coiled-coil arrays. Science 300:2097–2101.

Abstract/FREE Full Text

[481] Newman, J.R.S.,

[482] Keating, A.E.

[483] ↵

Odom, D.T.,

Zizlsperger, N.,

Gordon, D.B.,

Bell, G.W.,

Rinaldi, N.J.,

Murray, H.L.,

Volkert, T.L.,

Schreiber, J.,

Rolfe, P.A.,

Gifford, D.K.,

et al.

(2004) Control of pancreas and liver gene expression by HNF transcription factors. Science 303:1378–1381.

Abstract/FREE Full Text

[484] Odom, D.T.,

[485] Zizlsperger, N.,

[486] Gordon, D.B.,

[487] Bell, G.W.,

[488] Rinaldi, N.J.,

[489] Murray, H.L.,

[490] Volkert, T.L.,

[491] Schreiber, J.,

[492] Rolfe, P.A.,

[493] Gifford, D.K.,

[494] et al.

[495] ↵

Oh, S.W.,

Mukhopadhyay, A.,

Dixit, B.L.,

Raha, T.,

Green, M.R.,

Tissenbaum, H.A.

(2005) Identification of direct targets of DAF-16 controlling longevity, metabolism and diapause by chromatin immunoprecipitation. Nat. Genet. 38:251–257.

Google Scholar

[496] Oh, S.W.,

[497] Mukhopadhyay, A.,

[498] Dixit, B.L.,

[499] Raha, T.,

[500] Green, M.R.,

[501] Tissenbaum, H.A.

Unraveling transcription regulatory networks by protein–DNA and protein–protein interaction mapping

Abstract

What is a regulatory network?

Identifying network nodes

Regulatory transcription factors

Promoters

Cis-regulatory modules

TF binding sites and cis-regulatory elements

Identifying network edges

TF–TF dimers

Interactions between TFs and their target genes/sequences

TF-centered protein–DNA interaction mapping

Gene-centered protein–DNA interaction mapping

Emerging concepts and future challenges

Network analysis

Network subgraphs

Network motifs

Future challenges

From protein–DNA interaction networks to transcription regulatory networks

Spatio-temporal network modeling

Acknowledgments

Footnotes

References

This Article

Article Category

Services

Citing Articles

Google Scholar

PubMed/NCBI

Share

Preprint Server

Navigate This Article

Current Issue

In This Issue