|
|
|
|
|
Published online before print
October 5, 2007, 10.1101/gr.6911207 Genome Res. 17:1614-1625, 2007 ©2007 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/07 $5.00
Letter An Arabidopsis gene network based on the graphical Gaussian model1 Physiological and Molecular Plant Biology Program, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA; 2 Department of Plant Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA; 3 Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
We describe a gene network for the Arabidopsis thaliana transcriptome based on a modified graphical Gaussian model (GGM). Through partial correlation (pcor), GGM infers coregulation patterns between gene pairs conditional on the behavior of other genes. Regularized GGM calculated pcor between gene pairs among 2000 input genes at a time. Regularized GGM coupled with iterative random samplings of genes was expanded into a network that covered the Arabidopsis genome (22,266 genes). This resulted in a network of 18,625 interactions (edges) among 6760 genes (nodes) with high confidence and connections representing 0.01% of all possible edges. When queried for selected genes, locally coherent subnetworks mainly related to metabolic functions, and stress responses emerged. Examples of networks for biochemical pathways, cell wall metabolism, and cold responses are presented. GGM displayed known coregulation pathways as subnetworks and added novel components to known edges. Finally, the network reconciled individual subnetworks in a topology joined at the whole-genome level and provided a general framework that can instruct future studies on plant metabolism and stress responses. The network model is included.
Remarkable conceptual and technical advances in genomics have generated exceptionally large data sets. Global analyses of these collections of data may now be used to construct biological networks that systematically categorize all molecules and describe their functions and interactions (Barabasi and Oltvai 2004
Most advanced are gene networks analyzing large-scale microarray hybridizations that monitor transcriptome dynamics (de la Fuente et al. 2002
In contrast to single-cell organisms, network reconstruction of higher organisms has been restricted mainly due to limitations in data availability. Nevertheless, in a complex system such as the plant model Arabidopsis thaliana, expression profiles extracted from microarray data sets offer information on physiological status, in particular, because data from time series and from developmental, genetic intervention, or manipulative treatments are available (Schmid et al. 2005
The assembly of a gene network depends on the mathematical models applied, which, ideally, should describe inferred causal relationships that govern the expression patterns and dynamics of a set of genes. In reality, networks are assembled according to coincidence or coregulation of genes and the magnitude of regulation or statistical significance of the coincidence (Brazhnik et al. 2002
Irrespective of the potential intrinsic to GGM, its application for building network inferences had before been restricted to a small number of genes (Kishino and Waddell 2000
We have used this regularized GGM to build a gene network for A. thaliana, based on data from more than 2000 Affymetrix ATH1 microarray experiments deposited in the NASC database (Craigon et al. 2004
Pilot experiment with 2000 genes The data for construction of the model (Schäfer and Strimmer 2005c
A proof-of-concept experiment started with a collection of
A network for 22,200 Arabidopsis genes This result provided motivation to expand the network by including 22,200 genes of the Arabidopsis transcriptome, represented by 22,266 Affymetrix ATH1 probes with the discrepancy in numbers, due to the fact that some genes were represented by more than one probe set. GGM does not allow for computing the pcor of all input genes simultaneously, because the maximum number of genes that may be analyzed at one time depends on sampling numbers. An iterative process with 2000 iterations was adopted. In each iteration, 2000 genes were randomly selected and used as input for pcor estimation. On average, every gene pair was sampled 16.2 times, and the pcor with the lowest absolute value, representing the one with the largest amount of effects from other genes removed, was chosen as an estimation of the final pcor in the expanded network. Compared with the pilot experiment, these pcors were more narrowly concentrated around zero (Fig. 1B). A null distribution model was then built to estimate the P-values for the edges derived from these final pcors (Supplemental Fig. 1). When setting the cutoff values for pcor at less than –0.10 or larger than 0.10, the corresponding P-value, according to this null model, was lower than 1.92 x 10–190. With this pcor cutoff and after applying the Pearson correlation filter (–0.25–0.35; see Methods), which removed 12.4% of the accepted pcor values, a network for 6760 genes and 18,625 edges was recovered, which retained 0.01% of all possible interactions as significant edges. Our selection of the pcor cutoff value at |0.10| represents high stringency. In comparison, a yeast gene network recovered 70,201 interactions between 5205 genes, a human coexpression network identified 220,649 links among 8805 genes, while another yeast network based on first order partial correlation revealed 11,416 connections between 4686 genes (Lee et al. 2004
Additionally, two random permutation experiments were conducted to evaluate potential false discovery rates. First, all 22,266 genes were permutated, followed by the analysis described before. After 1000 iterations, all final pcors were in the range of from –0.0002 to +0.0004 and deemed insignificant. Second, 1000 genes were randomly chosen, permutated, combined with the remaining 21,266 genes, and subjected to the analysis with 2000 iterations, resulting in an overall pcor distribution similar to that in Figure 1B. Among the 21,765,500 gene pairs with one or two genes permutated, 4875 pairs showed |pcor|
Overall network properties
The network seems to follow a truncated power-law distribution (Amaral et al. 2000 k 11, where the network exhibits certain scale-free behavior, followed by a sharp drop off. Biological networks with similar connectivity distribution have been reported before (Jeong et al. 2001The final overall network (Fig. 3B) was densely organized. When querying the network with selected genes, a number of coherent subnetworks emerged to which biological significance could be attached (Figs. 3, 4, 5, 6, below; Supplemental Fig. 2). We have chosen subnetworks for which biological proof and significance already exists. The resulting network modules, in their majority, defined and organized functions in metabolism or stress responses (Table 1). Subsequently, we included additional edges that then described the Arabidopsis transcriptional response to cold treatment. In these examples, the potential usefulness of the GGM gene network tool may be seen in establishing connections within a subnetwork of known functions with genes not previously associated with a network module or pathway. Often, these novel nodes were functionally unknown, never having been studied before.
Modules in metabolism reveal coherent network subgraphs By use of the kCores method in Carey and Longs RBGL package (version 1.10.0) in Bioconductor, we identified coherent subgraphs (Gentleman et al. 2004
Genes centered on APR1, one of three 5'-adenylylsulfate reductase genes in Arabidopsis, identified a coherence group associated with sulfur metabolism (Fig. 4A). Strongly associated with APR1 were APR2 and APR3, two homologs of APR1 in the Arabidopsis genome. Associated genes in this network were ATSERAT2;1, AKN2, APK, AT1G18590, AT1G74090, ATGSTF11, SULTR4;1, and AT1G74100, all of which encode proteins related to sulfur metabolism. Two genes, SUR1 and ASA1, are genes associated with auxin and tryptophan biosynthetic pathways, confirming other reports (Nikiforova et al. 2003
In Figure 4B, genes involved in phosphate starvation reactions are linked. AT3G05630 encodes phospholipase DZ2, which hydrolyzes phospholipids in plasma membranes, thus releasing inorganic phosphate upon phosphate starvation (Cruz-Ramirez et al. 2006
Figure 4C highlights genes that participate in branch-chained amino acid degradation. MCCA and MCCB form a complex involved in leucine degradation in mitochondria (Gavin et al. 2002
Figure 4D includes genes associated with TRP1. TRP1, TSA1, ASA1, CYP79B2, AT1G25155, PAD3, TSB1, and DHS1 are involved in tryptophan biosynthesis. GLIP1 is an important component of pathogen responses (Oh et al. 2005
Shown in Figure 4E and F are subgraphs for nitrogen and starch metabolism, respectively. NIA1 and NIA2 encode nitrate reductases involved in the first step of nitrate assimilation with NIR1 (encoding nitrite reductase) participating in the second step. NIR1 is connected to AT5G13110, AT1G24280, and AT4G05390, genes whose products participate in NADP metabolism. ASN2 encodes an asparagine synthetase converting ammonium into nitrogen-containing compounds. The subnetwork around starch catabolism included 15 genes, seven of which are known to belong to this pathway: SBE2.1, SEX1, AT5G64860, AT4G09020, DPE2, AT3G52180, and AT5G26570. Among them, AT3G52180 (DSP4) has been identified as encoding a protein phosphatase that binds to starch and regulates its accumulation (Sokolov et al. 2006
The selected seed genes for metabolic functions revealed a structure of the model (Fig. 4) that could be reconciled with established functions in plant metabolism. During phosphate starvation, biochemical studies have established degradation of phospho-, sulfo- and galactolipids (Cruz-Ramirez et al. 2006
Subnetworks describing cell wall biosynthesis and related processes
Of particular interest here were genes related to secondary cell wall synthesis. Covered in Figure 5B were not only group II CESA genes, but also other genes that have been demonstrated to be important for secondary cell wall synthesis, such as SND1 (AT1G32770), a NAM transcription factor (Zhong et al. 2006
In addition to group I and II CESAs, CESA10, one of the cellulose synthases involved in the biosynthesis of primary cell walls (Beeckman et al. 2002
Gene modules related to cell wall synthesis showed substantial overlap with networks based on Pearson correlation coefficients, with the exception that GGM provided more complex structure in as far as additional nodes were inserted. Also, highest correlation with genes reported by Pearson correlation were found only when the subgraphs were extended by several edges. For example, the cellulose synthases CESA4, CESA7 (IRX3), and CESA8, and additional genes in the synthesis of secondary cell walls (Fig. 5B) were arranged similar to structures reported by others (Brown et al. 2005
Arabidopsis responses to cold stress
The center of the subnetwork was dominated by DREB-type transcription factors (Fig. 6B). The three CBFs (CBF1, CBF2, and DREB1A) were identified by strong interactions, indicating mutual functional redundancy (Gilmour et al. 2004
Genes strongly induced by cold stress, and as well by a variety of other stress treatments (Fig. 6C), might be viewed as common or ubiquitous stress-response genes (Glazebrook et al. 2003
Figure 6D included genes rapidly induced predominantly by cold stress and, somewhat less pronounced, by salinity, osmotic stress, and ABA. Included were multiple PP2Cs (ABI1, ABI2, HAB1, AT1G07430), two homeobox genes (ATHB7 and ATHB12), and NCED3, whose functions in ABA metabolism and response have amply been demonstrated. Interestingly, these genes were connected to AFP (AT1G69260), a negative regulator in ABA signaling, promoting ABI5 protein degradation (Lopez-Molina et al. 2003
Other cold stress-induced functions included genes related to the regulation of circadian rhythm (Fig. 6F), TOC1, APRR5, ELF3, ELF4, COL9, and GI (Fig. 6A for GI). The subgraph identified other CONSTANS-like zinc finger proteins, AT1G07050, AT1G78600, and AT5G48250. Their placement into a separate subcluster might indicate regulation different from that of other cold stress-regulated genes, possibly connected to a diurnal cycle. Indeed, cold treatments have been shown to alter the expression of genes involved in the circadian rhythm (Kreps and Simon 1997
Comparison of GGM with a relevance network
The complete Arabidopsis data set that had generated the GGM network was then used to construct a relevance network (Supplemental Data File S2). This analysis recovered 134,594 gene-pair interactions among 5745 genes with Pearson correlation coefficients larger or equal to 0.80. We excluded 12 negative interactions, lower than –0.80, in this analysis. Figure 7 shows the intersection between the two models. Among the
The relevance network showed node distribution more similar to power-law (Supplemental Fig. S3), but many highly connected nodes in this relevance network were connected internally. Supplemental Figure S4A shows a subnetwork for the 100 most connected nodes, with 1939 interactions. Among these interactions, 1936 were assigned pcors lower than 0.10 and deemed insignificant in GGM, because the corresponding gene pairs shared expression patterns with many other genes, which then explained the low number of highly connected nodes in the GGM network. Additionally, GGM required high similarity in expression pattern for a gene to become connected with a highly populated node. As observed, this constraint in highly connected nodes generated the truncated power-law distribution for the whole network (Amaral et al. 2000
The model used is based on a shrinkage approach (Schäfer and Strimmer 2005c
GGM-based gene network structures at the genome level for Arabidopsis have not been presented before, but networks for selected pathways have been constructed (Wille et al. 2004 The examples (Figs. 4, 5, 6; Supplemental Fig. S2) showed GGM revealing subnetworks that were strongly associated with established biological knowledge, while they invariably incorporated genes with unknown functions. Many modules identified functions that play important roles in the response to various stress conditions and in biochemical pathways. Although the procedures leading to this model generated a gene network that is substantially different from other types of networks, we suggest that in combination with these other models, GGM, which is accessible through the script that is included, could provide hypotheses for future studies.
Microarray data All microarray data derived from Affymetrix ATH1 slides. The "Super Bulk Gene Download," a file with all genes and experiments, was downloaded from NASCarrays (http://affymetrix.arabidopsis.info/narrays/help/usefulfiles.html). By August 2006, the file contained data from 2466 slides recorded as raw intensities. The corresponding experiments are summarized in Table 1. Six slides with missing data were removed and the remaining 2460 slides were subjected to quantile normalization.
A method based on "deleted residuals" was used to screen for potential outlier chips (Persson et al. 2005
The raw intensity data (after quantile normalization) from the remaining 2045 chips were rounded to integers (for values
The pilot experiment
The GGM network for the entire Arabidopsis genome
Permutation experiment
Network layout and visualization
We thank the members of NASCArrays and the laboratories providing data for contributing to the database. Advice by Dr. K. Strimmer is gratefully acknowledged. The work was supported by grants from the National Science Foundation Plant Genome Project (DBI-0223905) and University of Illinois at Urbana-Champaign institutional grants. S.M. conceived the experimental approach and performed calculations. S.M., Q.G., and H.J.B. analyzed intermediary approaches to the problem and wrote the article.
4 Corresponding author.
E-mail bohnerth{at}life.uiuc.edu; fax: (217) 333-5574. [Supplemental material is available online at www.genome.org.] Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.6911207
Agarwal, P.K., Agarwal, P., Reddy, M.K., and Sopory, S.K. 2006. Role of DREB transcription factors in abiotic and biotic stress tolerance in plants. Plant Cell Rep. 25: 1263–1274.[CrossRef][Medline] Amaral, L.A., Scala, A., Barthelemy, M., and Stanley, H.E. 2000. Classes of small-world networks. Proc. Natl. Acad. Sci. 97: 11149–11152. Aoki, K., Ogata, Y., and Shibata, D. 2007. Approaches for extracting practical information from gene co-expression networks in plant biology. Plant Cell Physiol. 48: 381–390. Barabasi, A.L. and Albert, R. 1999. Emergence of scaling in random networks. Science 286: 509–512. Barabasi, A.L. and Oltvai, Z.N. 2004. Network biology: Understanding the cell's functional organization. Nat. Rev. Genet. 5: 101–113.[CrossRef][Medline] Baxter, C.J., Redestig, H., Schauer, N., Repsilber, D., Patil, K.R., Nielsen, J., Selbig, J., Liu, J., Fernie, A.R., and Sweetlove, L.J. 2007. The metabolic response of heterotrophic Arabidopsis cells to oxidative stress. Plant Physiol. 143: 312–325. Beeckman, T., Przemeck, G.K., Stamatiou, G., Lau, R., Terryn, N., De Rycke, R., Inze, D., and Berleth, T. 2002. Genetic complexity of cellulose synthase a gene function in Arabidopsis embryogenesis. Plant Physiol. 130: 1883–1893. Benning, C. and Ohta, H. 2005. Three enzyme systems for galactoglycerolipid biosynthesis are coordinately regulated in plants. J. Biol. Chem. 280: 2397–2400. Brazhnik, P., de la Fuente, A., and Mendes, P. 2002. Gene networks: How to put the function in genomics. Trends Biotechnol. 20: 467–472.[CrossRef][Medline] Breton, G., Danyluk, J., Charron, J.B., and Sarhan, F. 2003. Expression profiling and bioinformatic analyses of a novel stress-regulated multispanning transmembrane protein family from cereals and Arabidopsis. Plant Physiol. 132: 64–74. Brown, D.M., Zeef, L.A., Ellis, J., Goodacre, R., and Turner, S.R. 2005. Identification of novel genes in Arabidopsis involved in secondary cell wall formation using expression profiling and reverse genetics. Plant Cell 17: 2281–2295. Buck, M.J. and Lieb, J.D. 2004. ChIP-chip: Considerations for the design, analysis, and application of genome-wide chromatin immunoprecipitation experiments. Genomics 83: 349–360.[CrossRef][Medline] Craigon, D.J., James, N., Okyere, J., Higgins, J., Jotham, J., and May, S. 2004. NASCArrays: A repository for microarray data generated by NASC's transcriptomics service. Nucleic Acids Res. 32: D575–D577. doi: 10.1093/nar/gkh133. Cruz-Ramirez, A., Oropeza-Aburto, A., Razo-Hernandez, F., Ramirez-Chavez, E., and Herrera-Estrella, L. 2006. Phospholipase DZ2 plays an important role in extraplastidic galactolipid biosynthesis and phosphate recycling in Arabidopsis roots. Proc. Natl. Acad. Sci. 103: 6765–6770. Dan, H., Yang, G., and Zheng, Z.L. 2007. A negative regulatory role for auxin in sulphate deficiency response in Arabidopsis thaliana. Plant Mol. Biol. 63: 221–235.[CrossRef][Medline] de la Fuente, A., Brazhnik, P., and Mendes, P. 2002. Linking the genes: Inferring quantitative gene networks from microarray data. Trends Genet. 18: 395–398.[CrossRef][Medline] Efron, B. 2004. Large-scale simultaneous hypothesis testing: The choice of a null hypothesis. J. Am. Stat. Assoc. 99: 96–104.[CrossRef] Efron, B. 2007. Correlation and large-scale simultaneous significance testing. J. Am. Stat. Assoc. 102: 93–103.[CrossRef] Fowler, S.G., Cook, D., and Thomashow, M.F. 2005. Low temperature induction of Arabidopsis CBF1, 2, and 3 is gated by the circadian clock. Plant Physiol. 137: 961–968. Fujiki, Y., Yoshikawa, Y., Sato, T., Inada, N., Ito, M., Nishida, I., and Watanabe, A. 2001. Dark-inducible genes from Arabidopsis thaliana are associated with leaf senescence and repressed by sugars. Physiol. Plant. 111: 345–352.[CrossRef][Medline] Gachon, C.M., Langlois-Meurinne, M., Henry, Y., and Saindrenan, P. 2005. Transcriptional co-regulation of secondary metabolism enzymes in Arabidopsis: Functional and evolutionary implications. Plant Mol. Biol. 58: 229–245.[CrossRef][Medline] Gansner, E.R. and North, S.C. 2000. An open graph visualization system and its applications to software engineering. Software-Practice & Experience 30: 1203–1233.[CrossRef] Gansner, E.R., Koren, Y., and North, S. 2004. Graph drawing by stress majorization. Graph Drawing 3383: 239–250. Gavin, A.C., Bosche, M., Krause, R., Grandi, P., Marzioch, M., Bauer, A., Schultz, J., Rick, J.M., Michon, A.M., Cruciat, C.M., et al. 2002. Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature 415: 141–147.[CrossRef][Medline] Gentleman, R.C., Carey, V.J., Bates, D.M., Bolstad, B., Dettling, M., Dudoit, S., Ellis, B., Gautier, L., Ge, Y., Gentry, J., et al. 2004. Bioconductor: Open software development for computational biology and bioinformatics. Genome Biol. 5: R80. doi: 10.1186/gb-2004-5-10-r80.[CrossRef][Medline] Gilmour, S.J., Fowler, S.G., and Thomashow, M.F. 2004. Arabidopsis transcriptional activators CBF1, CBF2, and CBF3 have matching functional activities. Plant Mol. Biol. 54: 767–781.[CrossRef][Medline] Giot, L., Bader, J.S., Brouwer, C., Chaudhuri, A., Kuang, B., Li, Y., Hao, Y.L., Ooi, C.E., Godwin, B., Vitols, E., et al. 2003. A protein interaction map of Drosophila melanogaster. Science 302: 1727–1736. Glawischnig, E., Hansen, B.G., Olsen, C.E., and Halkier, B.A. 2004. Camalexin is synthesized from indole-3-acetaldoxime, a key branching point between primary and secondary metabolism in Arabidopsis. Proc. Natl. Acad. Sci. 101: 8245–8250. Glazebrook, J., Chen, W., Estes, B., Chang, H.S., Nawrath, C., Metraux, J.P., Zhu, T., and Katagiri, F. 2003. Topology of the network integrating salicylate and jasmonate signal transduction derived from global expression phenotyping. Plant J. 34: 217–228.[CrossRef][Medline] Gutierrez, R.A., Lejay, L.V., Dean, A., Chiaromonte, F., Shasha, D.E., and Coruzzi, G.M. 2007. Qualitative network models and genome-wide expression data define carbon/nitrogen-responsive molecular machines in Arabidopsis. Genome Biol. 8: R7. doi: 10.1186/gb-2007-8-1-r7.[CrossRef][Medline] Hirai, M.Y., Sugiyama, K., Sawada, Y., Tohge, T., Obayashi, T., Suzuki, A., Araki, R., Sakurai, N., Suzuki, H., Aoki, K., et al. 2007. Omics-based identification of Arabidopsis Myb transcription factors regulating aliphatic glucosinolate biosynthesis. Proc. Natl. Acad. Sci. 104: 6478–6483. Hotelling, H. 1953. New light on the correlation coefficient and its transforms. J. Royal Stat. Soc. Series B-Stat. Meth. 15: 193–232. Ito, T., Chiba, T., Ozawa, R., Yoshida, M., Hattori, M., and Sakaki, Y. 2001. A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc. Natl. Acad. Sci. 98: 4569–4574. Jeong, H., Mason, S.P., Barabasi, A.L., and Oltvai, Z.N. 2001. Lethality and centrality in protein networks. Nature 411: 41–42.[CrossRef][Medline] Kaplan, B., Davydov, O., Knight, H., Galon, Y., Knight, M.R., Fluhr, R., and Fromm, H. 2006. Rapid transcriptome changes induced by cytosolic Ca2+ transients reveal ABRE-related sequences as Ca2+-responsive cis elements in Arabidopsis. Plant Cell 18: 2733–2748. Khanin, R. and Wit, E. 2006. How scale-free are biological networks. J. Comput. Biol. 13: 810–818.[CrossRef][Medline] Kikis, E.A., Khanna, R., and Quail, P.H. 2005. ELF4 is a phytochrome-regulated component of a negative-feedback loop involving the central oscillator components CCA1 and LHY. Plant J. 44: 300–313.[CrossRef][Medline] Kilian, J., Whitehead, D., Horak, J., Wanke, D., Weinl, S., Batistic, O., D'Angelo, C., Bornberg-Bauer, E., Kudla, J., and Harter, K. 2007. The AtGenExpress global stress expression data set: Protocols, evaluation and model data analysis of UV-B light, drought and cold stress responses. Plant J. 50: 347–363.[CrossRef][Medline] Kishino, H. and Waddell, P.J. 2000. Correspondence analysis of genes and tissue types and finding genetic links from microarray data. Genome Inform. Ser. Workshop Genome Inform. 11: 83–95.[Medline] Kreps, J.A. and Simon, A.E. 1997. Environmental and genetic effects on circadian clock-regulated gene expression in Arabidopsis. Plant Cell 9: 297–304.[Abstract] Krizek, B.A. and Fletcher, J.C. 2005. Molecular mechanisms of flower development: An armchair guide. Nat. Rev. Genet. 6: 688–698.[CrossRef][Medline] Ledger, S., Strayer, C., Ashton, F., Kay, S.A., and Putterill, J. 2001. Analysis of the function of two circadian-regulated CONSTANS-LIKE genes. Plant J. 26: 15–22.[CrossRef][Medline] Lee, T.I., Rinaldi, N.J., Robert, F., Odom, D.T., Bar-Joseph, Z., Gerber, G.K., Hannett, N.M., Harbison, C.T., Thompson, C.M., Simon, I., et al. 2002. Transcriptional regulatory networks in Saccharomyces cerevisiae. Science 298: 799–804. Lee, H.K., Hsu, A.K., Sajdak, J., Qin, J., and Pavlidis, P. 2004. Coexpression analysis of human genes across many microarray data sets. Genome Res. 14: 1085–1094. Li, H. and Gui, J. 2006. Gradient directed regularization for sparse Gaussian concentration graphs, with applications to inference of genetic networks. Biostatistics 7: 302–317. Lopez-Molina, L., Mongrand, S., Kinoshita, N., and Chua, N.H. 2003. AFP is a novel negative regulator of ABA signaling that promotes ABI5 protein degradation. Genes & Dev. 17: 410–418. Magwene, P.M. and Kim, J. 2004. Estimating genomic coexpression networks using first-order conditional independence. Genome Biol. 5: R100. doi: 10.1186/gb-2004-5-12-r100.[CrossRef][Medline] Martins, A.M., Sha, W., Evans, C., Martino-Catt, S., Mendes, P., and Shulaev, V. 2007. Comparison of sampling techniques for parallel analysis of transcript and metabolite levels in Saccharomyces cerevisiae. Yeast 24: 181–188.[CrossRef][Medline] Maruyama, K., Sakuma, Y., Kasuga, M., Ito, Y., Seki, M., Goda, H., Shimada, Y., Yoshida, S., Shinozaki, K., and Yamaguchi-Shinozaki, K. 2004. Identification of cold-inducible downstream genes of the Arabidopsis DREB1A/CBF3 transcriptional factor using two microarray systems. Plant J. 38: 982–993.[CrossRef][Medline] Misson, J., Raghothama, K.G., Jain, A., Jouhet, J., Block, M.A., Bligny, R., Ortet, P., Creff, A., Somerville, S., Rolland, N., et al. 2005. A genome-wide transcriptional analysis using Arabidopsis thaliana Affymetrix gene chips determined plant responses to phosphate deprivation. Proc. Natl. Acad. Sci. 102: 11934–11939. Mittler, R., Kim, Y., Song, L., Coutu, J., Coutu, A., Ciftci-Yilmaz, S., Lee, H., Stevenson, B., and Zhu, J.K. 2006. Gain- and loss-of-function mutations in Zat10 enhance the tolerance of plants to abiotic stress. FEBS Lett. 580: 6537–6542.[CrossRef][Medline] Nakamura, Y., Awai, K., Masuda, T., Yoshioka, Y., Takamiya, K., and Ohta, H. 2005. A novel phosphatidylcholine-hydrolyzing phospholipase C induced by phosphate starvation in Arabidopsis. J. Biol. Chem. 280: 7469–7476. Nikiforova, V., Freitag, J., Kempa, S., Adamik, M., Hesse, H., and Hoefgen, R. 2003. Transcriptome analysis of sulfur depletion in Arabidopsis thaliana: Interlacing of biosynthetic pathways provides response specificity. Plant J. 33: 633–650.[CrossRef][Medline] Nikiforova, V.J., Daub, C.O., Hesse, H., Willmitzer, L., and Hoefgen, R. 2005. Integrative gene-metabolite network with implemented causality deciphers informational fluxes of sulphur stress response. J. Exp. Bot. 56: 1887–1896. Obayashi, T., Kinoshita, K., Nakai, K., Shibaoka, M., Hayashi, S., Saeki, M., Shibata, D., Saito, K., and Ohta, H. 2007. ATTED-II: A database of co-expressed genes and cis elements for identifying co-regulated gene groups in Arabidopsis. Nucleic Acids Res. 35: D863–D869. doi: 10.1093/nar/gki783. Oh, I.S., Park, A.R., Bae, M.S., Kwon, S.J., Kim, Y.S., Lee, J.E., Kang, N.Y., Lee, S., Cheong, H., and Park, O.K. 2005. Secretome analysis reveals an Arabidopsis lipase involved in defense against Alternaria brassicicola. Plant Cell 17: 2832–2847. Pan, X., Ye, P., Yuan, D.S., Wang, X., Bader, J.S., and Boeke, J.D. 2006. A DNA integrity network in the yeast Saccharomyces cerevisiae. Cell 124: 1069–1081.[CrossRef][Medline] Persson, S., Wei, H., Milne, J., Page, G.P., and Somerville, C.R. 2005. Identification of genes required for cellulose synthesis by regression analysis of public microarray data sets. Proc. Natl. Acad. Sci. 102: 8633–8638. Salome, P.A. and McClung, C.R. 2004. The Arabidopsis thaliana clock. J. Biol. Rhythms 19: 425–435.[Abstract] Schäfer, J. and Strimmer, K. 2005a. An empirical Bayes approach to inferring large-scale gene association networks. Bioinformatics 21: 754–764. Schäfer, J. and Strimmer, K. 2005b. Learning large-scale graphical Gaussian models from genomic data. In Proceedings of "Science of Complex Networks: From Biology to the Internet and WWW" (CNET 2004) (eds. J. Mendes et al.). The American Institute of Physics, Aveiro, Portugal. Schäfer, J. and Strimmer, K. 2005c. A shrinkage approach to large-scale covariance matrix estimation and implications for functional genomics. Stat. Appl. Genet. Mol. Biol. 4: Article32. Schäfer, J., Opgen-Rhein, R., and Strimmer, K. 2006. Reverse engineering genetic networks using the GeneNet package. R News 6: 50–53 http://cran.r-project.org/doc/Rnews. Schmid, M., Davison, T.S., Henz, S.R., Pape, U.J., Demar, M., Vingron, M., Scholkopf, B., Weig | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||