Coevolution within a transcriptional network by compensatory trans and cis mutations

  1. Trey Ideker1,4
  1. 1 Departments of Bioengineering and Medicine, University of California, San Diego, La Jolla, California 92093, USA;
  2. 2 Red Sea Laboratory of Integrative Systems Biology, Division of Chemical and Life Sciences and Engineering, Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal 23955-6900, Kingdom of Saudi Arabia;
  3. 3 Departments of Internal Medicine and Biomedical Engineering, University of Iowa, Iowa City, Iowa 52242, USA

    Abstract

    Transcriptional networks have been shown to evolve very rapidly, prompting questions as to how such changes arise and are tolerated. Recent comparisons of transcriptional networks across species have implicated variations in the cis-acting DNA sequences near genes as the main cause of divergence. What is less clear is how these changes interact with trans-acting changes occurring elsewhere in the genetic circuit. Here, we report the discovery of a system of compensatory trans and cis mutations in the yeast AP-1 transcriptional network that allows for conserved transcriptional regulation despite continued genetic change. We pinpoint a single species, the fungal pathogen Candida glabrata, in which a trans mutation has occurred very recently in a single AP-1 family member, distinguishing it from its Saccharomyces ortholog. Comparison of chromatin immunoprecipitation profiles between Candida and Saccharomyces shows that, despite their different DNA-binding domains, the AP-1 orthologs regulate a conserved block of genes. This conservation is enabled by concomitant changes in the cis-regulatory motifs upstream of each gene. Thus, both trans and cis mutations have perturbed the yeast AP-1 regulatory system in such a way as to compensate for one another. This demonstrates an example of “coevolution” between a DNA-binding transcription factor and its cis-regulatory site, reminiscent of the coevolution of protein binding partners.

    Transcriptional networks are central to understanding both evolution and phenotypic diversity among organisms. Of the many ways in which transcriptional networks can evolve, much attention has been given to changes in the so-called cis-regulatory regions of gene promoters (Wray 2007; Wagner and Lynch 2008). Such changes include gain, loss, or modification of DNA sequence motifs (Cliften et al. 2003; Kellis et al. 2003; Gasch et al. 2004; Stark et al. 2007) as well as alterations in motif spacing relative to the start of transcription, or to other motifs (Ihmels et al. 2005; Tanay et al. 2005). In addition to changes in cis, transcriptional networks can also evolve through alterations to transcription factor (TF) proteins and other trans-acting factors (Wagner and Lynch 2008). Although there have been fewer reports of evolutionary changes in trans, potential mechanisms include mutations to protein structure impacting transcriptional activation or DNA-binding domains (Wagner and Lynch 2008), modulation of TF expression (Sankaran et al. 2009) or post-translational modifications (Holt et al. 2009), or gain and loss of protein–protein interactions among TFs (Tuch et al. 2008; Lavoie et al. 2010).

    Recently, a number of genome-scale studies have performed systematic comparisons of TF-binding patterns (Borneman et al. 2007; Tuch et al. 2008; Bradley et al. 2010; Lavoie et al. 2010; Schmidt et al. 2010) or mRNA expression profiles across species (Ihmels et al. 2005; Tanay et al. 2005; Hogues et al. 2008; Field et al. 2009; Wapinski et al. 2010). Almost universally, these studies have identified transcriptional programs that are dramatically rewired over short evolutionary time scales. As with earlier work, many of the observed differences in binding and expression have been linked to changes in cis-regulatory regions. For example, Borneman et al. (2007) found that the TF Tec1 binds only 20% of the same target genes in comparisons between Saccharomyces cerevisiae and the closely related Saccharomyces bayanus and Saccharomyces mikatae, and that this difference is due to gain and loss of canonical Tec1 cis-regulatory motifs. While some recent studies have associated genetic variants in TFs with gene expression changes observed in interspecies hybrids (Wilson et al. 2008; Wittkopp et al. 2008; Tirosh et al. 2009; Bullard et al. 2010; Emerson et al. 2010), in outbred crosses (Brem and Kruglyak 2005; Landry et al. 2005; Gerke et al. 2009; Sung et al. 2009; Zheng et al. 2010), or in human populations (Kasowski et al. 2010), the picture that emerges is that cis-regulatory regions are incredibly plastic over evolutionary time, while TFs (trans) evolve at a comparatively slower rate (Wray 2007).

    Given the dramatic changes that appear to be occurring in transcriptional networks, a key question is how such systems retain essential functions over evolutionary time (Wray 2007). One solution is that changes in cis can occur by replacement of one TF cofactor with another, thereby maintaining regulatory control (Tsong et al. 2006). Alternatively, rather than replacing specific cofactors, it is conceivable that the DNA-binding domains of the TFs that bind these cis-regulatory sequences might be altered in lock-step with changes in cis, similarly to the evolution of protein-binding partners (Pazos and Valencia 2008). However, such a mechanism of evolution has yet to be observed. Here, we present a direct example of such “coevolution,” where a specific change to a DNA-binding transcription factor and its cis-regulatory site have occurred in compensatory fashion.

    As a model of transcriptional network evolution, we examined the yeast AP-1 (yAP-1) family, which, with a total of eight members, is one of the largest paralogous TF families in S. cerevisiae (Fernandes et al. 1997; Rodrigues-Pousada et al. 2010). Like other paralogous families, AP-1 factors have been born through the process of gene duplication, which gives rise to multiple copies that are free from selective pressure and may functionally diverge from their duplicates by sub- or neofunctionalization (Hittinger and Carroll 2007). AP-1 also provides a classic example of the basic leucine zipper (bZIP) motif, which is widely conserved across eukaryotes (Tan et al. 2008; Rodrigues-Pousada et al. 2010). In humans, AP-1 TFs have been heavily studied due to their crucial role in cell proliferation, death, and differentiation (Shaulian and Karin 2002). In yeast, yAP-1-mediated transcriptional networks carry out overlapping, but distinct biological responses to stress (Tan et al. 2008; Rodrigues-Pousada et al. 2010). In contrast to the widespread divergence in TF binding that has been demonstrated previously (Borneman et al. 2007; Tuch et al. 2008; Lavoie et al. 2010), we show that coupled trans and cis mutations enable conservation of a subset of genes targeted by yAP-1. These results provide an example of compensatory coevolution of a trans and cis regulatory system.

    Results

    A trans mutation is associated with AP-1 DNA-binding motif specificity

    To identify trans mutations that could be associated with AP-1-binding preference, we performed an amino acid sequence alignment of the DNA-binding domains of all eight AP-1-like TFs in S. cerevisiae (Sc). This alignment and its associated phylogenetic tree (Fig. 1A) were searched to identify the key polymorphic amino acids whose patterns of conservation and divergence best explain the phylogeny (Methods, Evolutionary Trace Analysis). Such residues have been shown to frequently play important evolutionary roles (Innis et al. 2000). Using this approach, we identified residue 12 of the DNA-binding domain basic region as the most important evolutionarily divergent position across the yAP-1 family (i.e., the one that was most highly correlated with the phylogeny; Fig. 1A).

    Figure 1.

    A single residue determines yAP-1 DNA-binding motif specificity. (A) Alignment and phylogeny of AP-1 DNA-binding domain basic regions (residues 6 to 20 are shown). Residue 12 (red star) is predictive of preference for overlapping (YRE-O) or adjacent (YRE-A) DNA-binding motifs (left). Note that Yap8 possesses an Asp at residue 12 and binds a 2-bp overlapping YRE-O (Harbison et al. 2004). Positions affecting Gcn4 half-site spacing preference (Kim and Struhl 1995) are shown (gray stars). (B) Recognition of the yAP-1 half-site (Fujii et al. 2000). Residue 12 (red star) is in close proximity to residues conferring AP-1 sequence specificity. (C,D) ScYap1.R79K and ScYap4.K252R mutants have altered half-site spacing preference as evidenced by ChIP-chip (Methods). P-values refer to differences in binding to genes with either YRE-O or YRE-A sites as assessed by Fisher's exact test. (E,F) ScYap1.R79K and ScYap4.K252R mutations cause mRNA expression changes among genes with YRE-O and YRE-A sites among the top 50 most differentially expressed genes. P-values denote the significance of YRE-A and YRE-O motifs among gene promoters compared with the genomic background.

    Residue 12 was also predictive of AP-1 family DNA-binding motif preference (Fig. 1A) (MacIsaac et al. 2006; Tan et al. 2008). AP-1 family members bind DNA as homo- or heterodimers, where each constituent monomer recognizes the consensus sequence TTAC (Suckow et al. 1999; Fujii et al. 2000). These “half-sites” are positioned in either adjacent or overlapping fashion (Fig. 1A), which we refer to as yAP-1 response element adjacent (YRE-A) or yAP-1 response element overlapping (YRE-O), respectively. Previous analyses of genome-wide chromatin immunoprecipitation with microarray hybridization (ChIP-chip) data in Sc (Harbison et al. 2004; Tan et al. 2008) have determined that five AP-1 family members (Yap1, Yap2, Yap5, Yap7, Yap8) recognize YRE-O, whereas two family members (Yap4 and Yap6) recognize YRE-A. We examined the binding of the remaining Sc AP-1 member Yap3 by ChIP-chip and determined it preferred YRE-A sites in both complete media and stress conditions (Supplemental Fig. 1). This preference for YRE-O or YRE-A-binding sites in Sc AP-1s correlates precisely with the presence of arginine or lysine at residue 12 (Fig. 1A).

    Interestingly, residue 12 is part of an alpha-helical surface that forms multiple contacts to DNA (basic region residues 7–15) (Fig. 1B; Fujii et al. 2000). Previously, this residue was predicted as a likely determinant of DNA half-site spacing preference in Gcn4, another bZIP family TF (Kim and Struhl 1995). Although in vitro testing of Gcn4 mutants was not able to confirm this prediction (Kim and Struhl 1995), it has become apparent that such variations in half-site recognition are best distinguished in vivo (Suckow and Hollenberg 1998; Berger et al. 2008; Maerkl and Quake 2009).

    Residue 12 point mutations cause rewiring of AP-1 transcriptional interactions

    To further examine the regulatory role of residue 12, we mutated this residue in Yap1, a representative YRE-O-binding factor, and Yap4 (also known as Cin5), a representative YRE-A-binding factor. This process involved generating mutants Yap1.R79K and Yap4.K252R, changing arginine to lysine in Yap1 and lysine to arginine in Yap4 (Methods). Next, Yap1.R79K binding and Yap4.K252R binding were assayed in vivo using ChIP-chip (Methods). Comparison of the top 50 promoters bound by Yap1.R79K with the top 50 promoters bound by wild-type Yap1 (as determined in Tan et al. 2008) showed that mutation of Yap1 significantly altered its preference for YRE-O and YRE-A sites (Fig. 1C; Fisher's exact test P = 0.0002). Comparison of promoters bound by mutant and wild-type Yap4 also showed the predicted shift in binding preference (Fig. 1D; Fisher's exact test P = 0.037). These results were not dependent on the number of promoters examined (Supplemental Fig. 2).

    Next, to assess the functional implications of changes in yAP-1 binding, we generated genome-wide mRNA expression profiles for each mutant in comparison to the unmutated parental strain (Methods). Both mutations, Yap1.R79K and Yap4.K252R, altered the expression of genes whose promoters were highly enriched for AP-1-binding sites (YRE-O and YRE-A, Fig. 1E,F; Supplemental Fig. 3). These genes were also enriched for Yap1.R79K and Yap4.K252R binding (P < 10−5), respectively.

    An apparent paradox: Candida AP-1 diverges at residue 12, but its targets are conserved

    Based on our observation that residue 12 affects binding of AP-1 paralogs in S. cerevisiae, we next asked whether changes in this residue could lead to divergent binding of AP-1 orthologs across species. We searched the yeast phylogeny (Wapinski et al. 2007) for AP-1 orthologs that were anomalous in their use of Arg 12 or Lys 12, suggesting lineage-specific mutation (Supplemental Fig. 4). Among TFs orthologous to Sc YAP1, we found that the Candida glabrata (Cg) ortholog CgAP1 diverges from other yeasts (Fig. 2A–C) due to the presence of lysine at residue 12, in contrast to other yeasts in its clade that possess an arginine. This CgAp1 amino acid substitution was confirmed by sequencing of genomic DNA from two independent Cg isolates, 2001HTU and NCCLS84 (Fig. 2B).

    Figure 2.

    Evolution of the yAP-1 TFs. (A) CgAp1 possesses a lysine at residue 12 (CgAp1.46), while most other species possess an arginine. (B) Sequencing of CgAP1 in two unrelated isolates shows complete identity to the Cg reference genome. (C) Phylogenetic clustering of all Sc and Cg AP-1 DNA-binding domains reveals that CgAp1 and ScYap1 co-cluster. Internal branch point numbers refer to the Bayesian posterior probability, a measure of confidence (Drummond and Rambaut 2007).

    We used ChIP-chip to determine whether this Lys 12 substitution had a functional effect on CgAp1 binding (Methods). To facilitate this assay, we tagged CgAp1 with the TAP epitope and designed a custom microarray tiling the Cg genome (Methods). As a control on both the TAP construct and the array design, we used ChIP-qPCR to successfully validate a panel of five randomly chosen Cg gene promoters that were determined to be bound by CgAp1 in the ChIP-chip experiment (Supplemental Fig. 5).

    We found that CgAp1 bound the promoters of a total of 114 genes, 90 of which had known orthologs in Sc (Methods). Comparison of these data with ChIP profiles for each of the AP-1 factors in Sc grown under the same treatment conditions (as determined in Tan et al. 2008) showed significant overlap between the targets of CgAp1 and ScYap1 (17 genes, P < 10−17). Overlap with other Sc AP-1 factors was less substantial (Fig. 3A). This pattern of overlap was reinforced by sequence analysis, in which phylogenetic clustering of AP-1 DNA-binding domains places CgAp1 definitively with ScYap1 and not with other Sc AP-1 sequences (Fig. 2C; Methods).

    Figure 3.

    The CgAp1 transcriptional network has been rewired. (A) For the promoters targeted by each yAP-1 transcription factor in Sc, the overlap with CgAp1 targets is shown (of 90 CgAp1 targets total). (B) CgAp1 prefers YRE-A-binding sites compared with ScYap1 (Fisher's exact test). (C) The CgAp1 DNA-binding motif (green) clusters with YRE-A rather than YRE-O motifs. (D) The YRE-O site is enriched (star) among common ScYap1 and CgAp1 targets in other yeasts (hypergeometric test, Q < 0.05), but not Cg (Q ∼ 1.0). The YRE-A site is enriched among these targets in Cg (star) but not other yeasts. (E) Compensatory mutations in both trans and cis maintain AP-1 binding.

    We were therefore faced with the following conundrum: On the one hand, the CgAp1 sequence diverges from Yap1 orthologs at residue 12, suggesting a shift in DNA binding. On the other hand, the CgAp1-binding profile is quite specifically conserved with that of Yap1, calling into question the importance of residue 12 for sequence recognition.

    CgAp1 prefers YRE-A rather than YRE-O sites

    To investigate this apparent contradiction, we next turned to the gene promoters targeted by CgAp1 in the ChIP assay. Promoters targeted by CgAp1 showed a clear preference for YRE-A sites over YRE-O sites (49 vs. four promoters, respectively). This preference significantly differs from ScYap1, which prefers YRE-O over YRE-A (Fisher's exact test P = 3.5 × 10−8; Fig. 3B, 21 vs. 12 promoters, respectively). This preference could not be attributed to threshold effects on binding-site calls, as direct comparison of motif scores confirmed a preference for YRE-A over YRE-O sites (Mann-Whitney U test, P = 0.0072). This preference was also observed via de novo motif search in these promoters (Fig. 3C) and even among the Cg orthologs of all ScYap1 targets (Q = 0.05).

    We further analyzed this cis-regulatory preference by examining the orthologs of genes targeted by both CgAp1 and ScYap1 across 20 sequenced yeast genomes (Wapinski et al. 2007). C. glabrata stood out clearly as the only species with enrichment for YRE-A sites (Fig. 3D). In contrast, the YRE-O site was enriched in all neighboring species in the yeast phylogeny, including S. cerevisiae and other sensu stricto species (S. paradoxus, S. mikatae, and S. bayanus) as well as the more diverged Saccharomyces castellii, Kluyveromyces waltii, Kluyveromyces lactis, Ashbya gosspyii, and Candida tropicalis. These results indicate that upstream DNA-binding motifs of CgAp1 targets have evolved from YRE-O to YRE-A (Fig. 3E). Such a switch may have also been accompanied by concordant changes in secondary cis-regulatory DNA motifs (Supplemental material, Yap Cis-Regulatory Motifs Are Coincident with Those of Rtg3 and Aft1; Supplemental Fig. 6) and possible functional divergence (Supplemental material, Divergence and Conservation of Yap1 Function). The most plausible explanation is that these motifs have coevolved with a Lys 12 mutation in CgAp1, with the result that this transcriptional system has retained regulatory control of the same set of target genes over evolutionary time.

    Discussion

    Which mutation came first: the cis or trans? It is possible to envision two equally plausible scenarios (Fig. 3E): (1) An initial mutation in the Yap1 TF provided selective pressure for subsequent cis-regulatory changes in Yap1 targeted genes; (2) a change from YRE-O to YRE-A-binding site in key Yap1 target(s) provided selective pressure for a mutation in the Yap1 TF. In either scenario, mutations in trans and cis may have been facilitated by other AP-1 family members. The large size and interconnectivity of the AP-1 family may serve as a buffer for accumulation of cis and trans mutations, allowing for highly plastic evolution of the AP-1 regulatory network. In support of this hypothesis, several yAP-1s have been shown to bind each other along with common target genes, which might compensate for some loss in regulation by paralogs (Tan et al. 2008).

    Examination of the protein sequences of all AP-1 family members across 20 available yeast genomes (Wapinski et al. 2007) suggests that mutations in residue 12 have occurred frequently during AP-1 family evolution (Supplemental Fig. 4). Interestingly, we found that all yeasts possess at least one AP-1 TF with Arg 12 (Fig. 4). In contrast, several yeasts lack AP-1 TFs with Lys 12, and these species are the most evolutionarily diverged from Sc. These results suggest that the common yeast AP-1 ancestor encoded arginine and that the emergence of TFs using lysine is a more recent evolutionary innovation (Supplemental material, AP-1 Family Ancestry).

    Figure 4.

    All yeasts in the species phylogeny (Wapinski et al. 2007) possess an AP-1 with an arginine. The Candida clade (shaded green) and the whole-genome duplication event (orange star) are noted. Note that the Candida clade does not include Candida glabrata.

    Within the Candida clade, several species (C. tropicalis, C. albicans, C. parapsilosis, and Lodderomyces elongosporus) have AP-1 families based exclusively on Arg 12, while others (C. lusitaniae, Debaryomyces hansenii, and C. guilliermondii) (Fig. 4) represent both Arg 12 and Lys 12 across the AP-1 family. This suggests two equally plausible scenarios for the emergence of Lys 12 in yAP-1 TFs: (1) Lysine emerged following the divergence of Yarrowia lipolytica from other hemi-ascomycetes, followed by a lineage-specific loss within the Candida clade. (2) Lysine emerged following the split of the Candida clade from the rest of the hemi-ascomycetes and emerged again within the Candida clade. In either scenario, a switch from arginine (coded by AGA or AGG) to lysine (coded by AAA or AAG) could be accomplished by a simple single base-pair mutation.

    Open questions still remain regarding how arginine and lysine substitutions alter AP-1 DNA-binding motif preference. One hypothesis is that differences in their electrostatic charges alter the space required to accommodate other positively charged residues of the bZIP DNA-binding domain without electrostatic repulsion (Kim and Struhl 1995). This hypothesis suggests that the most positively charged residues such as arginine should be associated with YRE-A rather than YRE-O sites (Kim and Struhl 1995). However, in our findings lysine (pI = 9.59) rather than arginine (pI = 11.15) was associated with YRE-A sites.

    An alternative explanation involves a role for AP-1-induced DNA flexibility. Complexes of yAP-1 protein with the YRE-O DNA sequence have been associated with an increase in incorporated water molecules (Dragan et al. 2004a) leading to a decrease in DNA flexibility (Kim and Struhl 1995) compared with yAP-1/YRE-A complexes. A previous report has suggested that changes in DNA flexibility play a key role in determining half-site spacing preference and are responsible for differences between in vivo and in vitro measurements (Suckow and Hollenberg 1998). Since residue 12 is in close proximity to DNA (Fig. 1B) within the protein–DNA complex, residue changes may affect the ability of DNA to incorporate water during binding, thus affecting both yAP-1 DNA motif flexibility and binding (Dragan et al. 2004b). Interestingly, the higher positive charge of arginine induces a stronger dipole than that of lysine, providing a possible mechanism for the increase in the number of incorporated water molecules present at YRE-O sites and associated changes in DNA flexibility and binding preference.

    In summary, we have shown that conservation of the AP-1 regulatory program in yeast occurs through coordinated evolution of both the sequence of the TF (trans) and in its DNA-binding motifs (cis). This finding echoes that of previous studies of protein–protein interaction, which have demonstrated cases in which compensatory mutations are required to maintain protein interaction over evolutionary time (Pazos and Valencia 2008). In the context of transcriptional networks, coevolution gives rise to “regulatory homeostasis,” in which both mutations in a TF and its DNA-binding motif occur in compensatory fashion to maintain transcriptional regulation. This series of compensatory mutations, which maintains both the transcriptional circuit and regulatory logic, parallels that of previous work demonstrating evolution of alternative transcriptional circuits producing identical logic (Tsong et al. 2006). Such systems of tightly coupled compensatory mutations might serve to counter the widespread divergence observed in transcriptional networks, and may constitute a general evolutionary mechanism maintaining the regulation of transcriptional networks.

    Methods

    Yeast strains

    All immunoprecipitations were performed on strains where the appropriate gene has been endogenously fused to the TAP epitope (Rigaut et al. 1999). Sc TAP-tagged strains were obtained from Open Biosystems. In Cg, the 2001HTU strain was used for TAP-tagging (see below) and deletions. Cg 2001HTU and NCCLS84 were obtained from ATCC.

    ScYap1.R79K and ScYap4.K252R mutants

    Endogenously epitope-tagged ScYap1∷TAP and ScYap4∷TAP strains (Open Biosystems) were used to introduce the appropriate mutation (ScYap1.R79K, ScYap4.K252R) via the delitto perfetto method (Storici et al. 2001). In brief, ScYAP1 and ScYAP4 were disrupted with the URA3 selectable marker from pRS306 (Brachmann et al. 1998) using ∼100-bp homology. Complementary 200-bp oligos with a mutation (ScYap1.R79K or ScYap4.K252R) were then transformed by electroporation (Thompson et al. 1998) to remove URA3 by 5-FOA (US Biological) selection and verified by sequencing. This process creates strains possessing endogenous yAP-1 proteins having both the desired mutation and epitope.

    CgAp1 TAP-tagged strain

    The TAP tag was amplified with ∼100-bp homology (CgAP1) from pFA6a-TAP-HIS3MX6 (Longtine et al. 1998), transformed by electroporation (Thompson et al. 1998), and selected on complete –his media (Amberg et al. 2005). C-terminal integration was verified by PCR and DNA sequencing with protein expression verified by immunoblot (Amberg et al. 2005) with the peroxidase anti-peroxidase antibody (Sigma P1291).

    Growth conditions, mRNA expression, and ChIP

    Three (CgAp1) or two (ScYap1.R79K, ScYap4.K252R, ScYap3) biological replicates were grown from OD600 0.2 to 0.8 at 30°C in complete media (Amberg et al. 2005) and treated with 0.03% methyl methanesulfonate (Sigma) for 1 h as performed previously (Tan et al. 2008). For mRNA expression analysis, total RNA was isolated by hot phenol/chloroform extraction and labeled with Cy3 or Cy5 dyes (Invitrogen) (Kuo et al. 2010). Samples were hybridized to Agilent expression arrays and washed as recommended by Agilent (Agilent Technologies).

    For ChIP, all TAP-tagged strains were treated as previously described (Tan et al. 2008). In brief, cells were fixed with 1% formaldehyde for 20 min, inactivated with glycine and washed with TBS. Cells were lysed for 2 h (Vibrax-VXR 2000) with glass beads and sonicated for four cycles of 20 sec (+100-sec rest) at power setting 2 (Misonex Sonicator 3000) on ice. Lysate was incubated with Dynabeads M-280 conjugated with anti-TAP antibody (Open Biosystems CAB1001) overnight. Cross-link reversal was performed overnight at 65°C with antibody-enriched and unenriched DNA, and amplified (Sigma-Aldrich) and labeled (Invitrogen) with Cy5/Cy3 dyes. Sc and Cg samples were hybridized to commercial or custom (see below) Agilent tiling arrays and washed as recommended by Agilent (Agilent Technologies).

    Cg tiling microarray design and validation

    We designed a custom microarray tiling the Cg genome at ∼250-bp resolution with ∼44,000 60-mer probes designed to avoid self-dimerization and variability in melting temperatures (Mfold; Zuker 2003), low-complexity and repetitive sequences (RepeatMasker; http://repeatmasker.org), and cross-hybridization (WUBLAST2; Altschul and Gish 1996). Default settings were used for each program. Microarrays were manufactured using Agilent technology (Agilent Technologies). ChIP results were validated by qPCR of five targets compared against CgACT1 (Supplemental Fig. 5).

    Microarray data processing

    Intensities were background subtracted and normalized by LOESS (Smyth 2005). Expression microarrays were analyzed using the limma package (Smyth 2005) with default parameters. ChIP tiling array errors were estimated by the Rosetta error model (Weng et al. 2006) with resulting P-values of binding for each promoter calculated by combining P-values of adjacent probes as previously described (Tan et al. 2008).

    Phylogenetic trees, orthologs, and evolutionary trace analysis

    The yAP-1 DNA-binding domain phylogenetic tree was created with BEAST (Drummond and Rambaut 2007) using default settings. Sequences, species trees, and orthologs were obtained from the Fungal Orthogroups Repository (Wapinski et al. 2007). Multiple-sequence alignment was preformed using MUSCLE (Edgar 2004) with default parameters. Evolutionary trace analysis was performed using TraceSuite II (Innis et al. 2000) with default settings.

    Motif finding

    De novo motifs were identified by SOMBRERO (Mahony et al. 2005) using default parameters and compared with literature (MacIsaac et al. 2006; Tan et al. 2008) using the default settings for STAMP (Mahony et al. 2005). Promoters were scanned with the default settings for Patser (Hertz and Stormo 1999) for motif enrichment by the hypergeometric test with multiple test correction (Storey and Tibshirani 2003). A motif was considered “present” in a promoter for (fraction of maximal information content) ≥0.7.

    Acknowledgments

    We thank Jonathan Weissman for providing pFA6a-TAP-HIS3MX6, Paul Russell for pFA6a-kanMX6, Nevan Krogan for pFA6a-natMX6, and Richard Kolodner for pRS303 and pRS306. We also thank Ilan Wapinski, Lorraine Pillus, and members of the T.I. laboratory for helpful discussions. D.K. was supported by the National Science and Engineering Research Council of Canada. K.T. and T.I. were supported by the David and Lucille Packard Foundation and NIH grant no. R01 ES014811 to T.I.

    Author contributions: D.K., K.T., T.R., and T.I. designed the study. D.K., K.L., S.B., R.C., J.C., and C.L. performed the experimental work. D.K. and K.T. analyzed the data. D.K. and T.I. wrote the manuscript. K.T. and T.I. supervised the work.

    Footnotes

    • Received June 15, 2010.
    • Accepted September 20, 2010.

    Freely available online through the Genome Research Open Access option.

    References

    Articles citing this article

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server