|
|
|
|
Published online before print
November 30, 2007, 10.1101/gr.6584707 Genome Res. 17:1723-1730, 2007 ©2007 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/07 $5.00
Letter Computational and experimental identification of novel human imprinted genes1 Center for Bioinformatics and Computational Biology, Duke University, Durham, North Carolina 27708, USA; 2 Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina 27708, USA; 3 Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, North Carolina 27710, USA; 4 Department of Radiation Oncology, Duke University Medical Center, Durham, North Carolina 27710, USA; 5 Department of Computer Science, Duke University, Durham, North Carolina 27708, USA
Imprinted genes are essential in embryonic development, and imprinting dysregulation contributes to human disease. We report two new human imprinted genes: KCNK9 is predominantly expressed in the brain, is a known oncogene, and may be involved in bipolar disorder and epilepsy, while DLGAP2 is a candidate bladder cancer tumor suppressor. Both genes lie on chromosome 8, not previously suspected to contain imprinted genes. We identified these genes, along with 154 others, based on the predictions of multiple classification algorithms using DNA sequence characteristics as features. Our findings demonstrate that DNA sequence characteristics, including recombination hot spots, are sufficient to accurately predict the imprinting status of individual genes in the human genome.
A gene is imprinted if the expression of one of its alleles is silenced depending on the parent from which that allele was inherited (Reik and Walter 2001
Identifying imprinted genes experimentally is challenging because the monoallelic expression of an imprinted gene may occur only in one of possibly several isoforms, only in particular tissues, or only at particular stages of development. Consequently, failure to confirm imprinting in a specific tissue at a specific stage of development for a specific splice variant does not eliminate the possibility that a different isoform may be imprinted in some other tissue at some other stage of development. Although estimates of imprinted gene prevalence in the human genome vary, they hover around 1%. Consequently, in the absence of any method for prioritizing genes, an average of 100 genes must be examined (perhaps in a broad range of tissues and at many stages of development) before a new imprinted gene can be identified. Indeed, experimental identification of human imprinted genes to date has been slow: The untranslated mRNA H19 was the first gene shown to be imprinted in human (Zhang and Tycko 1992
We set out to develop a computational method for predicting the genome-wide imprint status of human genes, the output of which could be used to prioritize genes for experimental identification. Since the concentration of certain types of repeated elements and other DNA sequence characteristics has been shown to differ between monoallelically and biallelically expressed genes (Greally 2002
Although we previously demonstrated the feasibility of this kind of approach by identifying novel imprinted genes in the mouse genome (Luedi et al. 2005 Here, we describe a new algorithm for predicting the genome-wide imprint status of human genes directly from sequence features in the human genome. Compared to our previous approach, we have included additional features and introduced additional learning algorithms to significantly reduce the possibility of methodological bias. Further, we focused our attention primarily on high-confidence predictions and demonstrated by cross-validation (CV) and independent testing that our new predictions are at the same time both more sensitive and more selective than before. Finally, we used our predictions to prioritize the experimental identification of two new human imprinted genes on chromosome 8, a chromosome not previously suspected to contain imprinted genes.
Conceptual approach We adopted a more conservative approach in identifying human imprinted genes because of their important role in the etiology of human health conditions. Specifically, we applied two separate classifier learning strategies—one based on support vector machines and the other on sparse logistic regression—each with a different feature selection process. With each strategy, we trained classifiers with two different similarity kernels: linear and radial basis function (RBF). Only genes predicted to be imprinted by all four classifiers were considered "high-confidence" predictions. Although all four classifiers use the same initial training set of known imprinted genes, the combined classifier approach helps to control for biases that might arise from different choices for feature selection, classifier learning, or similarity kernel. All four classifiers were trained on DNA sequence features collected from 40 genes known to be imprinted in human and from 52 genes known not to be imprinted in human, plus 500 randomly selected genes suspected not to be imprinted in human. We assessed the generalization accuracy of the combined classifier by both CV and an independent negative test set. In a 40-fold CV, we obtained a sensitivity of 100% (40/40 imprinted genes correctly identified) and a specificity of 99% (545/552 presumably nonimprinted genes correctly identified). The independent negative test set consisted of 13 genes with random monoallelic expression and 88 genes with biallelic expression or synchronous replication, including four genes imprinted in mouse but not human. We correctly predicted all 101 genes not to be imprinted (see Supplemental Fig. 4 for a schematic depiction of the workflow).
Genome-wide prediction of candidate imprinted genes
The high-density bands 15q12 and 7q21.3 contain exclusively known imprinted genes. Included in the high-density 11p15.5 band are well-known imprinted genes, such as H19 and IGF2, and five novel candidates located further distal, including PKP3, an oncogene involved in lung cancer (Furukawa et al. 2005
Previous efforts to determine the sequence characteristics that discriminate imprinted from nonimprinted genes have demonstrated that imprinted loci are deficient in short interspersed transposable elements (SINEs), particularly in the more ancient MIRs (Greally 2002
Among transcription factor binding sites, those of greatest importance in both feature selection strategies were CEBP, E2F, ICP4, IgPE2, NFuE1, NFuE3, PEA1, PEA2, Sp1, and SRF (Supplemental Fig. 1E). E2F family transcription factors are involved with cell proliferation, Sp1 elements have been shown to protect CpG islands from de novo methylation in the embryo (Brandeis et al. 1994
Prediction of parental preference
We predicted maternal expression for 56% (88/156) of the candidate imprinted genes, comparable to the 64% frequency found for mouse imprinted genes (Luedi et al. 2005
Experimental identification of new imprinted genes
DLGAP2 is highly expressed and alternatively spliced in brain and testis (Ranta et al. 2000
KCNK9 resides at chromosomal location 8q24.3. It encodes for the TASK3 (Twik-like acid-sensitive K+) channel and is associated with a variety of human cancers (Patel and Lazdunski 2004
Parent-of-origin effects were first observed >3000 yr ago by mule breeders (Savory 1970
On the conservation of imprinting across species We wished to examine the concordance between the high-confidence human imprinted candidates and the predictions for their orthologs in mouse. We identified a murine ortholog to 119 of the genes proved or predicted with high confidence to be imprinted in human. Only 38 (32%) of these genes are known or predicted to be imprinted in both species (Supplemental Table 3), and over half of these genes have subtelomeric chromosomal locations. This fraction does not change significantly if the same prediction method we used for the mouse is also applied to the human data. Hence, the lack of greater overlap is not solely due to differences in the statistical methodologies.
High levels of discordance of imprinting status between mouse and human have previously been reported (Morison et al. 2005 Although this disparity in the imprint status of genes in mouse and human may be a consequence of our computational approach, it also raises the possibility that despite their immense popularity as models of human disease, mice may not be a suitable choice for studying diseases resulting principally from the epigenetic deregulation of imprinted genes, or for assessing human risk from environmental factors that alter the epigenome.
On the evolution of imprinting
In a cross-species comparison of imprinted regions between mouse and human, it has also been hypothesized that genomic imprinting may have evolved on the basis of dosage compensation following large-scale duplication events (Walter and Paulsen 2003
Other hypotheses for the evolution of genomic imprinting include the proposition that imprinting is a by-product of a host defense against foreign DNA (Barlow 1993
On imprinting and development
Conclusion
Human genome data DNA sequence and annotation data were obtained from Ensembl (http://www.ensembl.org, Version 20). We used a positive training set of 40 imprinted genes compiled from the Imprinted Gene Catalog (http://igc.otago.ac.nz/) and recent literature, as well as a negative training set of 52 genes, for which experimental evidence suggests biallelic expression. Additionally, we used random sets of 500 control genes presumed to be nonimprinted for a number of tasks. These random control genes were sampled from autosomal chromosomal bands known or not suspected to contain imprinted genes and were intended to represent the overall characteristics of biallelically expressed genes. Random control genes were used to compute top pairwise interaction terms, to carry out feature selection with the Equbits classifier, and finally to supplement the final training set that was used to learn our classifiers. To minimize bias, we resampled the set of 500 random control genes for each of these three tasks.
Feature measurements We introduced another statistic regarding the repetitive elements flanking a gene, which we will term "phase change." We define a phase change as an instance of a repeated element changing its orientation compared to a neighboring element of the same family. We counted the number of such phase changes among retrotransposon classes such as Alus, MIRs, and LTRs within 100 kb upstream and downstream. In doing this, we noticed that within the downstream region of imprinted genes, compared to a random sample, a phase change occurred more frequently in one of the following LTRs: MLT1A0, MLT1B, MSTA, MSTB1, MLT1D, MLT2B4, or MLT1G1. Conversely, phase changes in an MLT1C LTR were underrepresented in the flanking regions of imprinted genes.
We also wanted to study whether data on recombination could be used to discern imprinted genes. We downloaded coordinates of recombination hotspots (Myers et al. 2005
The last additional class of feature measurements involved nucleosome formation potential profiles. Such in silico estimates of nucleosome packaging density in the promoter region has previously been used to distinguish tissue-specific genes from housekeeping genes and widely expressed genes (Levitsky et al. 2001
Statistical methods When using Equbits to predict imprinted genes, a 40-fold CV procedure was used; to prevent overfitting or overestimation of prediction accuracy, feature selection using a linear kernel was performed afresh for each fold of the CV (without inclusion of the genes being held out during that fold). Once features were selected in each fold, classifiers for imprint status with linear and RBF kernels were learned for that fold. The number of retained features ranged from 613–638 during CV, whereas 626 features were retained in the final classifier. When using SMLR to predict imprinted genes, a similar scheme was adopted. At each step of a 40-fold CV, feature selection was performed afresh (without inclusion of the genes being held out during that fold) by applying a sparsity-promoting prior directly on the weights of the features (no kernel). Once features were selected in this manner in each fold, classifiers for imprint status with linear and RBF kernels were learned. During CV, the number of retained features averaged 875, while 820 features were retained in the final classifier. SMLR is written in portable Java, with a GUI, and is available with complete source code under a noncommercial use license from http://www.cs.duke.edu/~amink. In addition, all data, and all scripts used to produce the SMLR results, are also available.
To ensure that no straightforward relationships within the training data were obscured by sophisticated learning methods, we also performed CV using three simple classifiers (as implemented in Weka 3.4) (Witten and Frank 2005
To simplify the prediction of parental expression preference, we used Equbits only with a linear kernel and the top 30 features. This procedure is exactly analogous to that used to predict parental preference in the mouse (Luedi et al. 2005
We used
Experimental procedures First-strand cDNA was primed with gene-specific primers, and synthesized from DNaseI-treated RNA using Superscript II as recommended by the manufacturer (Invitrogen). We used Qiagen Hotstart Taq polymerase (Qiagen Sciences, Inc.) in a 25 µL RT-PCR reaction volume, as per manufacturers instructions. RT-PCR products were separated by electrophoresis on a 1.5% agarose gel, and appropriately-sized fragments of cDNA were excised and gel-extracted (GenElute, Sigma Chemical Co.). Products were sequenced (ABI 377 sequencer, PE Biosystems), and analyzed for expression using FinchTV (Geospiza, Inc.). In order to rule out any stochastic effects, we repeated the PCR and the sequencing reactions at least three times in all cases where exclusive monoallelic expression was observed. All sequencing reactions were performed in both directions. This study was performed in accordance with current regulations and standards of the United States Department of Health and Human Services and National Institutes of Health.
We thank the Birth Defects Research Laboratory at the University of Washington for tissue samples. This work was supported in part by grants from NIH to F.S.D., to R.L.J. (ES13053, T32ES07031), and to R.L.J. and A.J.H. (ES015165); from DOE to R.L.J. (DE-FG02-05ER64101); and from NSF and the Sloan Foundation to A.J.H.
6 Corresponding authors. E-mail amink{at}cs.duke.edu; fax (919) 660-6519.
E-mail jirtle{at}radonc.duke.edu; fax (919) 684-5584. [Supplemental material is available online at www.genome.org.] Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.6584707
Allen, E., Horvath, S., Tong, F., Kraft, P., Spiteri, E., Riggs, A.D., and Marahrens, Y. 2003. High concentrations of long interspersed nuclear element sequence distinguish monoallelically expressed genes. Proc. Natl. Acad. Sci. 100: 9940–9945. Arsenian, S., Weinhold, B., Oelgeschlager, M., Ruther, U., and Nordheim, A. 1998. Serum response factor is essential for mesoderm formation during mouse embryogenesis. EMBO J. 17: 6289–6299.[CrossRef][Medline] Bantignies, F. and Cavalli, G. 2006. Cellular memory and dynamic regulation of polycomb group proteins. Curr. Opin. Cell Biol. 18: 275–283.[CrossRef][Medline] Barlow, D.P. 1993. Methylation and imprinting: from host defense to gene regulation? Science 260: 309–310. Barlow, D.P., Stoger, R., Herrmann, B.G., Saito, K., and Schweifer, N. 1991. The mouse insulin-like growth factor type-2 receptor is imprinted and closely linked to the Tme locus. Nature 349: 84–87.[CrossRef][Medline] Bartolomei, M.S., Zemel, S., and Tilghman, S.M. 1991. Parental imprinting of the mouse H19 gene. Nature 351: 153–155.[CrossRef][Medline] Blagitko, N., Mergenthaler, S., Schulz, U., Wollmann, H.A., Craigen, W., Eggermann, T., Ropers, H.H., and Kalscheuer, V.M. 2000. Human GRB10 is imprinted and expressed from the paternal and maternal allele in a highly tissue- and isoform-specific fashion. Hum. Mol. Genet. 9: 1587–1595. Brandeis, M., Frank, D., Keshet, I., Siegfried, Z., Mendelsohn, M., Nemes, A., Temper, V., Razin, A., and Cedar, H. 1994. Sp1 elements protect a CpG island from de novo methylation. Nature 371: 435–438.[CrossRef][Medline] Charlier, C., Segers, K., Wagenaar, D., Karim, L., Berghmans, S., Jaillon, O., Shay, T., Weis-Senbach, J., Cockett, N., Gyapay, G., et al. 2001. Human-ovine comparative sequencing of a 250-kb imprinted domain encompassing the callipyge (clpg) locus and identification of six imprinted transcripts: DLK1, DAT, GTL2, PEG11, antiPEG11, and MEG8. Genome Res. 11: 850–862. Crouse, H.V. 1960. The controlling element in sex chromosome behavior. Genetics 45: 1429–1443. DeChiara, T.M., Robertson, E.J., and Efstratiadis, A. 1991. Parental imprinting of the mouse insulin-like growth factor II gene. Cell 64: 849–859.[CrossRef][Medline] Du, Y., Jenkins, N.A., and Copeland, N.G. 2005. Insertional mutagenesis identifies genes that promote the immortalization of primary bone marrow progenitor cells. Blood 106: 3932–3939. Eun Kwon, H. and Taylor, H.S. 2004. The role of HOX genes in human implantation. Ann. N. Y. Acad. Sci. 1034: 1–18. Furukawa, C., Daigo, Y., Ishikawa, N., Kato, T., Ito, T., Tsuchiya, E., Sone, S., and Nakamura, Y. 2005. Plakophilin 3 oncogene as prognostic marker and therapeutic target for lung cancer. Cancer Res. 65: 7102–7110. Greally, J.M. 2002. Short interspersed transposable elements (SINEs) are excluded from imprinted regions in the human genome. Proc. Natl. Acad. Sci. 99: 327–332. Haig, D. and Westoby, M. 1989. Parent-specific gene expression and the triploid endosperm. Am. Nat. 134: 147–155.[CrossRef] Horike, S., Cai, S., Miyano, M., Cheng, J.F., and Kohwi-Shigematsu, T. 2005. Loss of silent-chromatin looping and impaired imprinting of DLX5 in Rett syndrome. Nat. Genet. 37: 31–40.[CrossRef][Medline] Hunter, P. 2007. The silence of genes. Is genomic imprinting the software of evolution or just a battleground for gender conflict? EMBO Rep. 8: 441–443.[CrossRef][Medline] Jirtle, R.L. and Skinner, M.K. 2007. Environmental epigenomics and disease susceptibility. Nat. Rev. Genet. 8: 253–262.[CrossRef][Medline] Kananura, C., Sander, T., Rajan, S., Preisig-Muller, R., Grzeschik, K.H., Daut, J., Derst, C., and Steinlein, O.K. 2002. Tandem pore domain K+-channel TASK-3 (KCNK9) and idiopathic absence epilepsies. Am. J. Med. Genet. 114: 227–229.[CrossRef][Medline] Karolchik, D., Baertsch, R., Diekhans, M., Furey, T.S., Hinrichs, A., Lu, Y.T., Roskin, K.M., Schwartz, M., Sugnet, C.W., Thomas, D.J., et al. 2003. The UCSC Genome Browser Database. Nucleic Acids Res. 31: 51–54. Ke, X., Thomas, S.N., Robinson, D.O., and Collins, A. 2002. The distinguishing sequence characteristics of mouse imprinted genes. Mamm. Genome 13: 639–645.[CrossRef][Medline] Kimura, M.I., Kazuki, Y., Kashiwagi, A., Kai, Y., Abe, S., Barbieri, O., Levi, G., and Oshimura, M. 2004. Dlx5, the mouse homologue of the human-imprinted DLX5 gene, is biallelically expressed in the mouse brain. J. Hum. Genet. 49: 273–277.[CrossRef][Medline] Krishnapuram, B., Figueiredo, M., Carin, L., and Hartemink, A.J. 2005. Sparse multinomial logistic regression: Fast algorithms and generalization bounds. IEEE Trans. Pattern Anal. Mach. Intell. 27: 957–968.[CrossRef][Medline] LaSalle, J. and Lalande, M. 1996. Homologous association of oppositely imprinted chromosomal domains. Science 272: 725–728.[Abstract] Lee, S.H., Davison, J.A., Vidal, S.M., and Belouchi, A. 2001. Cloning, expression and chromosomal location of NKX6B to 10q26, a region frequently deleted in brain tumors. Mamm. Genome 12: 157–162.[CrossRef][Medline] Levitsky, V.G., Podkolodnaya, O.A., Kolchanov, N.A., and Podkolodny, N.L. 2001. Nucleosome formation potential of eukaryotic DNA: Calculation and promoters analysis. Bioinformatics 17: 998–1010. Li, T., Vu, T.H., Lee, K.O., Yang, Y., Nguyen, C.V., Bui, H.Q., Zeng, Z.L., Nguyen, B.T., Hu, J.F., Murphy, S.K., et al. 2002. An imprinted PEG1/MEST antisense expressed predominantly in human testis and in mature spermatozoa. J. Biol. Chem. 277: 13518–13527. Luedi, P.P., Hartemink, A.J., and Jirtle, R.L. 2005. Genome-wide prediction of imprinted murine genes. Genome Res. 15: 875–884. Mager, J., Montgomery, N.D., de Villena, F.P., and Magnuson, T. 2003. Genome imprinting regulated by the mouse Polycomb group protein Eed. Nat. Genet. 33: 502–507.[CrossRef][Medline] Mancini-Dinardo, D., Steele, S.J., Levorse, J.M., Ingram, R.S., and Tilghman, S.M. 2006. Elongation of the Kcnq1ot1 transcript is required for genomic imprinting of neighboring genes. Genes & Dev. 20: 1268–1282. McInnis, M.G., Lan, T.H., Willour, V.L., McMahon, F.J., Simpson, S.G., Addington, A.M., MacKinnon, D.F., Potash, J.B., Mahoney, A.T., Chellis, J., et al. 2003. Genome-wide scan of bipolar disorder in 65 pedigrees: Supportive evidence for linkage at 8q24, 18q22, 4q32, 2p12, and 13q12. Mol. Psychiatry 8: 288–298.[CrossRef][Medline] Medhurst, A.D., Rennie, G., Chapman, C.G., Meadows, H., Duckworth, M.D., Kelsell, R.E., Gloger, I., and Pangalos, M.N. 2001. Distribution analysis of human two pore domain potassium channels in tissues of the central nervous system and periphery. Brain Res. Mol. Brain Res. 86: 101–114.[Medline] Miltenberger, R.J., Sukow, K.A., and Farnham, P.J. 1995. An E-box-mediated increase in cad transcription at the G1/S-phase boundary is suppressed by inhibitory c-Myc mutants. Mol. Cell. Biol. 15: 2527–2535.[Abstract] Moens, C.B. and Selleri, L. 2006. Hox cofactors in vertebrate development. Dev. Biol. 291: 193–206.[CrossRef][Medline] Monk, D., Arnaud, P., Apostolidou, S., Hills, F.A., Kelsey, G., Stanier, P., Feil, R., and Moore, G.E. 2006. Limited evolutionary conservation of imprinting in the human placenta. Proc. Natl. Acad. Sci. 103: 6623–6628. Morison, I.M., Ramsay, J.P., and Spencer, H.G. 2005. A census of mammalian imprinting. Trends Genet. 21: 457–465.[CrossRef][Medline] Murphy, S.K. and Jirtle, R.L. 2003. Imprinting evolution and the price of silence. Bioessays 25: 577–588.[CrossRef][Medline] Muscheck, M., Sukosd, F., Pesti, T., and Kovacs, G. 2000. High density deletion mapping of bladder cancer localizes the putative tumor suppressor gene between loci D8S504 and D8S264 at chromosome 8p23.3. Lab. Invest. 80: 1089–1093.[Medline] Mustanski, B.S., Dupree, M.G., Nievergelt, C.M., Bocklandt, S., Schork, N.J., and Hamer, D.H. 2005. A genomewide scan of male sexual orientation. Hum. Genet. 116: 272–278.[CrossRef][Medline] Myers, S., Bottolo, L., Freeman, C., McVean, G., and Donnelly, P. 2005. A fine-scale map of recombination rates and hotspots across the human genome. Science 310: 321–324. Niu, X., Renshaw-Gegg, L., Miller, L., and Guiltinan, M.J. 1999. Bipartite determinants of DNA-binding specificity of plant basic leucine zipper proteins. Plant Mol. Biol. 41: 1–13.[CrossRef][Medline] Okita, C., Meguro, M., Hoshiya, H., Haruta, M., Sakamoto, Y., and Oshimura, M. 2003. A new imprinted cluster on the human chromosome 7q21-q31, identified by human-mouse monochromosomal hybrids. Genomics 81: 556–559.[CrossRef][Medline] de Pardo-Manuel Villena, F., de la Casa-Esperon, E., and Sapienza, C. 2000. Natural selection and the function of genome imprinting: Beyond the silenced minority. Trends Genet. 16: 573–579.[CrossRef][Medline] Patel, A.J. and Lazdunski, M. 2004. The 2P-domain K+ channels: role in apoptosis and tumorigenesis. Pflugers Arch. 448: 261–273.[CrossRef][Medline] Pearson, W.R. and Lipman, D.J. 1988. Improved tools for biological sequence comparison. Proc. Natl. Acad. Sci. 85: 2444–2448. Ranta, S., Zhang, Y., Ross, B., Takkunen, E., Hirvasniemi, A., de la Chapelle, A., Gilliam, T.C., and Lehesjoki, A.E. 2000. Positional cloning and characterisation of the human DLGAP2 gene and its exclusion in progressive epilepsy with mental retardation. Eur. J. Hum. Genet. 8: 381–384.[CrossRef][Medline] Reik, W. and Walter, J. 2001. Genomic imprinting: Parental influence on the genome. Nat. Rev. Genet. 2: 21–32.[Medline] Savory, T. 1970. The mule. Sci. Am. 223: 102–109.[Medline] Schratt, G., Weinhold, B., Lundberg, A.S., Schuck, S., Berger, J., Schwarz, H., Weinberg, R.A., Ruther, U., and Nordheim, A. 2001. Serum response factor is required for immediate-early gene activation yet is dispensable for proliferation of embryonic stem cells. Mol. Cell. Biol. 21: 2933–2943. Seitz, H., Youngson, N., Lin, S.P., Dalbert, S., Paulsen, M., Bachellerie, J.P., Ferguson-Smith, A.C., and Cavaille, J. 2003. Imprinted microRNA genes transcribed antisense to a reciprocally imprinted retrotransposon-like gene. Nat. Genet. 34: 261–262.[CrossRef][Medline] Soulez, M., Rouviere, C.G., Chafey, P., Hentzen, D., Vandromme, M., Lautredou, N., Lamb, N., Kahn, A., and Tuil, D. 1996. Growth and differentiation of C2 myogenic cells are dependent on serum response factor. Mol. Cell. Biol. 16: 6065–6074.[Abstract] Strichman-Almashanu, L.Z., Lee, R.S., Onyango, P.O., Perlman, E., Flam, F., Frieman, M.B., and Feinberg, A.P. 2002. A genome-wide screen for normally methylated human CpG islands that can identify novel imprinted genes. Genome Res. 12: 543–554. Umlauf, D., Goto, Y., Cao, R., Cerqueira, F., Wagschal, A., Zhang, Y., and Feil, R. 2004. Imprinting along the Kcnq1 domain on mouse chromosome 7 involves repressive histone methylation and recruitment of Polycomb group complexes. Nat. Genet. 36: 1296–1300.[CrossRef][Medline] Vandromme, M., Gauthier-Rouviere, C., Carnac, G., Lamb, N., and Fernandez, A. 1992. Serum response factor p67SRF is expressed and required during myogenic differentiation of both mouse C2 and rat L6 muscle cell lines. J. Cell Biol. 118: 1489–1500. Walter, J. and Paulsen, M. 2003. The potential role of gene duplications in the evolution of imprinting mechanisms. Hum. Mol. Genet. 12: 215–220.[CrossRef] Witten, I.H. and Frank, E. 2005. Data mining: Practical machine learning tools and techniques. Morgan Kaufmann, San Francisco. 2d ed. Yoder, J.A., Walsh, C.P., and Bestor, T.H. 1997. Cytosine methylation and the ecology of intragenomic parasites. Trends Genet. 13: 335–340.[CrossRef][Medline] Zara, F., Bianchi, A., Avanzini, G., Di Donato, S., Castellotti, B., Patel, P.I., and Pandolfo, M. 1995. Mapping of genes predisposing to idiopathic generalized epilepsy. Hum. Mol. Genet. 4: 1201–1207. Zhang, Y. and Tycko, B. 1992. Monoallelic expression of the human H19 gene. Nat. Genet. 1: 40–44.[CrossRef][Medline]
Received April 6, 2007; accepted in revised format August 31, 2007. This article has been cited by other articles:
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||