Cloning of Two Human Homologs of the Drosophila single-minded Gene SIM1 on Chromosome 6q and SIM2 on 21q Within the Down Syndrome Chromosomal Region

  1. Roman Chrast1,2,
  2. Hamish S. Scott1,
  3. Haiming Chen1,
  4. Jun Kudoh4,
  5. Colette Rossier1,
  6. Shinsei Minoshima4,
  7. Yimin Wang4,
  8. Nobuyoshi Shimizu4, and
  9. Stylianos E. Antonarakis1,2,3,5
  1. 1Laboratory of Human Molecular Genetics, Department of Genetics and Microbiology and 2Graduate Program of Cellular and Molecular Biology, Geneva University Medical School, 1211 Geneva, Switzerland; 3Division of Medical Genetics, Cantonal Hospital of Geneva, 1211 Geneva, Switzerland; 4Department of Molecular Biology, Keio University School of Medicine, Tokyo 160, Japan

Abstract

As part of our effort to clone genes of human chromosome 21 that may contribute to Down syndrome, we have previously isolated four exons with homology to Drosophila single-minded (sim) gene, which encodes a transcription factor that is a master regulator of fruit fly neurogenesis. These exons were used to clone and characterize two human homologs of the Drosophila sim gene, SIM1 and SIM2, which map to chromosomes 6q16.3–q21 and 21q22.2, respectively;SIM2 maps within the so-called Down syndrome chromosomal region. Recently, two mouse homologs, Sim1 and Sim2, also have been identified. There is a high level of homology among human, mouse, andDrosophila sim genes in their amino-terminal half where the conserved bHLH, PAS1, PAS2, and HST domains are present. In contrast, the carboxy-terminal parts are only homologous between SIM1 and Sim1 and SIM2 and Sim2. Two isoforms (SIM2 and SIM2s) of human SIM2 have been detected that differ in their 3′ ends. Northern blot analysis revealed one mRNA SIM1 species of ∼9.5 kb and four different mRNA SIM2 species of 2.7, 3, 4.4, and 6 kb in human fetal kidney. The function of both human SIM1 and SIM2 is unknown. However, three copies of SIM2 may contribute to some specific Down syndrome phenotypes because of (1) mapping position, (2) potential function as transcriptional repressor, (3) likely dimerization with other transcription factors, (4) the temporal and spatial expression pattern of mouse Sim2, and (5) the potentially analogous role of human SIM2 to that of Drosophila sim during neurogenesis.

[The sequence data described in this paper have been submitted to GenBank under accession nos. U70212, U80456, U80457, and AB003185.]

Down syndrome (DS) is the most common autosomal aneuploidy occurring in 1.03 to 1.30 of 1000 live births (Epstein 1995). It is caused by the presence of three copies of human chromosome 21 (HC21). A minimum region of HC21 [Down syndrome chromosomal region (DSCR)] that in triplication is associated with many phenotypic characteristics of DS has been defined between D21S17 and ETS2 (McCormick et al. 1989; Delabar et al. 1993). The DSCR concept was challenged as some patients with DS phenotypes have triplicated regions of HC21 proximal to this DSCR (Korenberg et al. 1994). However, the majority of partial trisomy 21 DS patients have three copies of the DSCR.

To identify genes that contribute to phenotypes of DS, we have performed exon trapping (Chen et al. 1996) on HC21 cosmids and identified a human homolog of the Drosophila single-mindedgene (sim) that maps within the DSCR on HC21 (Chen et al. 1995). This localization subsequently was confirmed by others (Dahmane et al. 1995; Osoegawa et al. 1996).

The Drosophila sim gene is the master regulator of fruit fly neurogenesis (Thomas et al. 1988; Nambu et al. 1991). Sim protein is a transcription factor containing a basic helix–loop–helix (bHLH) motif, two PAS (PER/ARNT/SIM) domains, and the HST (HIF1-α/SIM/TRH) domain (Nambu et al. 1991; Isaac and Andrew 1996). During the course of this study, two mouse homologs of the sim gene (Sim1 and Sim2) were cloned (Ema et al. 1996a,b; Fan et al. 1996; Moffett et al. 1996;Yamaki et al. 1996). Sim1 maps on mouse chromosome 10 andSim2 on mouse chromosome 16 in a region of synteny with HC21 (Fan et al. 1996). Both mouse Sim1 and Sim2 genes are expressed early [Sim2 from embryonic day 8.0 (E8.0) and Sim1 from E9.0] in developing forebrain (Ema et al. 1996b; Fan et al. 1996; Moffett et al. 1996;Yamaki et al. 1996) and outside the central nervous system (CNS), in somites, mesonephric duct, and foregut (Sim1), in facial and trunk cartilage, trunk muscles (Sim2), and in the developing kidney (Sim1 and Sim2) (Dahmane et al. 1995; Ema et al. 1996b; Fan et al. 1996; Moffett et al. 1996). In adult mouse, both Sim1 and Sim2 are expressed in kidney and skeletal muscles, whereas Sim2 is also expressed in the lung (Ema et al. 1996a,b; Moffett et al. 1996).

Here we describe the cloning of the cDNAs for two human homologs (SIM1 and SIM2) of the Drosophila sim gene, and mapping ofSIM1 to chromosome 6q16.3–q21 (human SIM2 was mapped previously to 21q22.2; Chen et al. 1995). The function of the two mammalian SIM genes is unknown, but three copies of SIM2 may contribute to some DS phenotypes.

RESULTS

Cloning of the Human SIM1 cDNA

Northern blot analysis using, as a probe, four trapped exons (see Methods) homologous to the Drosophila sim gene revealed expression only in human fetal kidney from the 16 adult and four fetal tissues tested (data not shown). A human fetal kidney cDNA library subsequently was screened using the same probe. The inserts of 14 positive plaques were partially sequenced; these sequences corresponded to two different SIM genes. Because during this study two mouse homologs of the Drosophila sim gene, Sim1 [GenBank accession nos. U40575 (Fan et al. 1996) and D79209 (Ema et al. 1996b)] and Sim2 [GenBank accession nos. U40576 (Fan et al. 1996), D63383 (Ema et al. 1996a), D64135 (Yamaki et al. 1996), and U42554 (Moffett et al. 1996)], were also described, we grouped the clones according to their homology to the corresponding mouse genes. Clone rch11 was 3.9 kb long and contained the complete predicted coding region of a human homolog of the mouse Sim1 gene (Fig. 1). The human SIM1 (GenBank accession no. U70212) sequence predicted an open reading frame of 766 amino acids. The human SIM1 and mouse Sim1 genes showed a striking similarity. There is 87.4% nucleotide and 96.0% amino acid sequence identity between the SIM1 and Sim1 genes. The published mouse Sim1 sequence (GenBank accession no. U40575; Fan et al. 1996) contains an error in the translation termination codon region; the corrected sequence encodes for a longer predicted protein of 764 amino acids (T2332GAAAAGGC to TGTTTTGGC; C.-M. Fan, pers. comm.). For our analysis we have used the mouse Sim1 (GenBank accession no. D79209) of Ema et al. (1996b). The initiation codon of SIM1 corresponded to that of the mouse Sim1 gene and conforms to the Kozak consensus sequence (Kozak 1987). The SIM1 protein (predicted molecular weight of 85 kD) contains a bHLH (amino acid residues 2–50), PAS1 (98–140), PAS2 (238–292), and HST (295–333) domains. The carboxyl-terminus contains a Ser-rich region (352–766; Fig. 2).

Figure 1.

Comparison of human, mouse, and Drosophila SIM-predicted polypeptides. SIM1 (GenBank accession no. U70212) and SIM2 (U80456), human SIMs; Sim1 (D79209) and Sim2 (D63383), mouse SIMs; dsim (M19020),Drosophila single minded. SIM2s (U80457) represents the 570-amino-acid short isoform of SIM2. An asterisk (*) denotes identical and a period (.) denotes conserved amino acids, respectively. Amino acid color codes: (Red) Identical among all sequences; (blue) conserved among all sequences; (purple) identical between mouse and human SIM1; (turquoise) identical between mouse and human SIM2; (dark yellow) identical among all mammalian SIM proteins. SIM2s is shown from its point of divergence with SIM2.

Figure 2.

The conserved domains bHLH, PAS1, PAS2, and HST of the SIMs are shown schematically together with their alignments to domains from additional proteins. (Red) Identical amino acids; (blue) conserved amino acids. The GenBank numbers of the different SIM sequences used are as in Fig.1; trh (U33427), Drosophila trachealess; sima (U43090),Drosophila similar; hHIF-1a (U22431), humanhypoxia-induciblefactor-1α. SIM2s represents the 570-amino-acid short isoform of SIM2. An asterisk (*) denotes identical and a period (.) denotes conserved amino acids.

Mapping of SIM1 on Chromosome 6

Sim1 has been mapped to mouse chromosome 10 between markers Fyn and Ros1 (Fan et al. 1996). By synteny, the SIM1 gene was predicted to map on human chromosome 6 as confirmed by Muenke et al. (1995) using monochromosomal human–rodent hybrids. We have mapped by fluorescence in situ hybridization (FISH) SIM1 on 6q16 using human bacterial artificial chromosome (BAC) KB153H1 containing the SIM1 gene as a probe (Fig. 3). To refine the localization of human SIM1 gene we have used PCR [with oligonucleotides from the 3′ untranslated region (UTR) of SIM1 cDNA] of the Genebridge 4 radiation hybrid panel. The results of this analysis showed that SIM1 mapped between markers WI-6516 and WI-6530 on 6q16.3–q21 [2.74 cR distal to WI-6516 (lod > 3.0)]. This interval is also defined by polymorphic markers D6S1041 and D6S468. Figure 4 summarizes the mapping positions of the mouse and human SIM genes.

Figure 3.

Chromosomal localization of the BAC clone KB153H1 containing the human SIM1 gene to 6q16 by FISH and Q-banding. (A) Metaphase chromosomes of human B-lymphoblastoid GM130B cell line hybridized with the BAC clone KB153H1. (B) Representative images of human chromosome 6. Hybridization signals appeared in the same position, 6q16, in all cases.

Figure 4.

Mapping position of human and mouse SIM1 and SIM2genes. The localization in mouse chromosomes was reported in Fan et al. (1996). Previously we have mapped SIM2 to chromosome 21 (Chen et al. 1995). SIM1 has been mapped to chromosome 6q16.3–q21 between markers WI-6516 and WI-6530 using the Genebridge 4 radiation hybrid panel. Diagrams are not to scale.

Cloning of the Human SIM2 cDNA

The positive clones from the library screen that did not correspond to human SIM1 were homologous to the mouse Sim2 gene and were designated human SIM2. Sequencing of several clones (rch2, rch3, and rch10) revealed an ORF extending from codon 4 to 525 of mouse Sim2. To obtain additional sequences from both ends of the human cDNA, 5′ and 3′ RACE were performed on human fetal kidney poly(A)+ mRNA. By homology to mouse Sim2, 5′ rapid amplification of cDNA ends (RACE) products completed the sequence of the 5′ coding region. The products obtained from the 3′ RACE did not differ from the 3′ ends of cDNA clones rch3 and rch10; all of these sequences were homologous to mouse Sim2 until codon 525 and then extended for an additional 1174 bp that ended with a poly(A) region and contained an additional in-phase ORF of 45 codons with no significant homologies. These sequences predicted a polypeptide of 570 amino acids, which we named SIM2 short form (SIM2s; GenBank accession no. U80457). A genomic sequence homologous to mouse Sim2 derived from the human BAC clone KB594G10 was used to determine additional sequences of SIM2 (SIM2; GenBank accession no. U80456) corresponding to the last 132 amino acids of mouse Sim2. Human BAC clone KB594G10 was mapped to HC21 by FISH (data not shown) and its 136-kb genomic sequence was determined completely (J. Kudoh, S. Minoshima, and N. Shimizu, in prep.). The predicted ORF of the long form of SIM2 was 667 amino acids long. The human SIM2 and mouse Sim2 genes are highly homologous; there is 86.7% nucleotide and 89.6% amino acid sequence identity between these two sequences. The translation initiation codon of SIM2 and SIM2s corresponds to that of mouse Sim2 and also conforms to the Kozak consensus (Kozak 1987). The SIM2 (predicted molecular mass 74-kD) and SIM2s proteins (predicted molecular mass 64-kD) both contain bHLH, PAS1, PAS2, and HST domains in the same position as described for SIM1 (Figs. 1 and 2; Table 1). The predicted carboxyl terminus of SIM2 and SIM2s both contain Ser/Thr-rich regions (from S348 to T366), Pro/Ser-rich regions (between P385 and S503), Pro/Ala-rich regions (from P504 to P544), and positively charged regions between R367 and R382. In addition, SIM2 contains two Pro/Ala-rich regions (from P553 to P596 and P611 to P644) and a positively charged region (from K559 to R575) (Fig. 2).

Table 1.

Percent Identity among Human, Mouse, andDrosophila SIMs

Northern Blot Analysis of Human SIM1 and SIM2

Northern blot analysis using four trapped exons (see Methods) from the conserved region between SIM1 and SIM2 genes revealed five hybridizing mRNA species of ∼9.5, 6, 4.4, 3, and 2.7 kb only in fetal kidney from 20 different human adult and fetal tissues tested (data not shown). Hybridization using the 3′ half of SIM1 between nucleotides 1364 and 1610 (which is not homologous to SIM2), revealed only the 9.5-kb mRNA species in fetal kidney (Fig. 5, left). In contrast, a SIM2-specific probe (from nucleotide 1140 to 1433) hybridized to the mRNA species of ∼6, 4.4, 3, and 2.7 kb (Fig. 5, right). The 9.5-kb mRNA is therefore specific for SIM1, and the other four mRNA species are specific for SIM2. The different SIM2-hybridizing mRNAs could be attributable to different 5′ or 3′ UTRs, alternative splicing, and/or overlapping transcripts.

Figure 5.

Northern blot analysis of SIM1 and SIM2 expression. The nonhomologous carboxy-terminal parts of SIM1 and SIM2 cDNAs were used as specific probes against filters containing poly(A)+ RNA from several human tissues (see Methods). The same filters were used sequentially for both hybridizations. Both genes are expressed in fetal kidney; SIM1 revealed an mRNA species of ∼9.5 kb (left), whereas SIM2 showed mRNA species of ∼2.7, 3, 4.4, and 6 kb (right).

The Conserved Domains in Human, Mouse, and Drosophila simGenes

All five cDNAs (Drosophila sim, mouse Sim1 and Sim2, and human SIM1 and SIM2) and their predicted polypeptides are remarkably similar to each other in the amino terminus half up to the amino acid corresponding to K351 of SIM1 (Fig. 1). Table 1 displays the percent identity among different regions of the sequences examined. The amino acid sequence identity among the five predicted polypeptides for their first half is between 71% and 99%. Within this region of remarkable homology, all of these proteins contain four recognized domains, namely bHLH, PAS1, PAS2, and HST, often seen in transcription factors (Fig.1). Figure 2 shows a partial amino acid alignment of proteins that contain all four of these domains. These proteins, in addition to the five already mentioned, include the Drosophila trachealess(trh; Isaac and Andrew 1996), Drosophila similar(sima; Nambu et al. 1996), and human hypoxia-inducible factor 1α (HIF-1α; Wang et al. 1995). Within the bHLH domain, all eight proteins have 69% identical and conserved amino acids. These percentages for the PAS1, PAS2, and HST domains are 53%, 45%, and 56%, respectively.

In the carboxy-terminal parts of the predicted amino acid sequences, there are no apparent homologies between Drosophila sim and its mammalian homologs (Fig. 1). However, SIM1 and SIM2 showed high conservation of sequences in the carboxy-terminal part when compared with mouse Sim1 and Sim2, respectively (Table 1; Figs. 1 and 2).

DISCUSSION

We have isolated the cDNAs of two human homologs of theDrosophila sim gene that is involved in neuronal development. We have mapped SIM1 to 6q16.3–q21, whereas SIM2 maps to 21q22.2. The two mouse homologs of sim have also been described: Sim1 (Ema et al. 1996b; Fan et al. 1996) andSim2 (Ema et al. 1996a; Fan et al. 1996; Moffett et al. 1996;Yamaki et al. 1996), which have 96% and 89.6% amino acid sequence identity to human SIM1 and SIM2, respectively. cDNA library screening (fetal kidney) and 3′ RACE (on fetal kidney mRNA) resulted only in a short isoform of SIM2 (SIM2s). Genomic sequencing revealed a longer form of SIM2 with homology to the entire mouse Sim2 cDNA. RT–PCR using oligonucleotides derived from the sequences of the long form failed to amplify the expected long form product from fetal kidney mRNA. However, this result is consistent with the observation that the sequence corresponding to the carboxy-terminal coding region of SIM2 gene was difficult to amplify by PCR, even from cloned DNA fragments. The short isoform, SIM2s, is probably attributable to the use of alternative 3′ UTR contained within intron 10 of SIM2 or mispriming on an A-rich sequence within this intron. The carboxyl terminus of SIM2 (from A526 to R667) is encoded by exon 11, which is separated from exon 10 by a 2528-bp intron 10. The nucleotide sequence encoding the carboxyl terminus of SIM2s (from G526 to K570) starts from the last nucleotide of exon 10 and continues to the sequence of intron 10 (GenBank accession no. AB003185; J. Kudoh, S. Minoshima, and N. Shimizu, in prep.). Whether SIM2s is indeed a functional isoform in different tissues and stages of development remains unknown.

Each of the human, murine, and Drosophila SIM genes contains highly conserved bHLH, PAS1, PAS2, and HST domains. The bHLH domain, present in many transcription factors, including myc, MyoD, and E2A (Murre et al. 1989), contains two amphipathic α helices separated by a variable loop, which participate in the formation of dimers, and a basic region that mediates DNA binding (Murre et al. 1989; Davis et al. 1990). The PAS1 and PAS2 domains were first described in the Drosophila sim gene (Nambu et al. 1991) and are present in other proteins including (1) the Drosophila period (per) (Hardin et al. 1990); (2) the vertebrate aryl hydrocarbon (Ah) receptor (AhR) and (3) AhR nuclear translocators (Arnt and Arnt2) (Hoffman et al. 1991; Reyes et al. 1992; Hirose et al. 1996); (4) the human HIF-1α (Wang et al. 1995); (5) theDrosophila trh (Isaac and Andrew 1996), and (6) theDrosophila sima (Nambu et al. 1996). PAS domains are involved in the dimerization of these proteins (Huang et al. 1993; Sogawa et al. 1995b). The HST domain is conserved among all sim genes, trh, sima, and HIF-1α; its function is presently unknown.

It is likely that both human SIM proteins are transcription factors that form dimers because they contain the domains described above. Both mouse Sim proteins can heterodimerize with Arnt and Arnt2 (Ema et al. 1996b); however, it is possible that there are additional partners for SIM because only the known PAS-containing proteins (Arnt, Arnt2, Ahr, and HIF1-α) have been tested for heterodimerization with Sim. Ema et al. (1996b) also showed that Sim1 and Sim2 do not form homodimers or dimerize with each other. The carboxy-terminal parts of mouse Sim1 and Sim2 are different from those of other bHLH/PAS family members (Arnt, Arnt2, Ahr, HIF-1α, and sim); this can explain the fact that both Sim1 and Sim2 function as transcriptional repressors, whereas all of the other proteins mentioned function as transcriptional activators (Franks and Crews 1994; Jain et al. 1994; Sogawa et al. 1995a; Hirose et al. 1996). No specific domain in the carboxy-terminal half of SIM1 (from amino acid 352) has been recognized, except that this region is rich in serine residues, as in mouse Sim1 (Ema et al. 1996b). In the carboxy-terminal half of SIM2 there are regions similar to those in mouse Sim2 (Yamaki et al. 1996) rich in serine/threonine, proline/serine, and proline/alanine. Domains rich in serine/threonine and proline residues are present in both transcription repressors (Madden et al. 1991) and transcription activators (Mermod et al. 1989;Franks and Crews 1994). Based on the amino acid homologies of human SIM1 and SIM2 to their mouse counterparts it is logical to predict that they also could function as transcriptional repressors.

Because the carboxy-terminal half of Drosophila sim is not homologous to those found in human SIM1/SIM2 and mouse Sim1/Sim2, it is possible that these vertebrate homologs of the Drosophila simgene do not represent its true homologs. There may be another vertebrate SIM gene with homology spanning the 3′ half or theDrosophila sim, although such mammalian expressed sequence tags (ESTs) have not been found. It is also possible that theDrosophila sim transcriptional activation domains in the carboxyl terminus had changed during evolution and this region of the protein had acquired repression function.

Phenotypes that may be associated with deleterious mutations inSIM1 and SIM2 are unknown. No candidate Mendelian or complex phenotype has been mapped to the chromosomal localizations of the human and mouse single-minded homologs. Mice with targeted disruption of either Sim1 or Sim2 may contribute to the elucidation of their involvement in normal mammalian development and pathological conditions associated with null mutations. The study of the phenotype of mice heterozygous for a disrupted Sim2 gene will potentially elucidate the contribution of this gene in partial monosomy 21 that includes 21q22.2 (Huret et al. 1995).

SIM2, with its three copies, is an exceptional candidate to be associated with certain phenotypes of DS because (1) it maps to the chromosomal region that is present in three copies in the majority of DS individuals with partial trisomy 21 (Delabar et al. 1993); (2) theDrosophila sim is a master regulator of fruit fly neurogenesis and mutations in the sim gene result in loss of precursor cells that give rise to the midline cells of the embryonic CNS (Crews et al. 1988; Nambu et al. 1991); (3) the mouse Sim2 gene is expressed early (E8.0) in the forebrain of mouse embryo (Ema et al. 1996b; Fan et al. 1996; Moffett et al. 1996; Yamaki et al. 1996). It has been shown that DS patients have selective anatomic abnormalities in brain compared with normals (Raz et al. 1995; Kesslak et al. 1994). Outside the CNS, Sim2 is expressed in facial and trunk cartilage, trunk muscles, and kidney (Dahmane et al. 1995; Ema et al. 1996b; Fan et al. 1996; Moffett et al. 1996). We have confirmed these data using whole-mount in situ hybridization of E11.5 mouse embryos with an antisense Sim2-specific probe (R. Chrast, Y. Herault, S. Antonarakis, and D. Duboule, unpubl.). In addition, (4) SIM2 is a potential transcriptional repressor, and (5) it may form heterodimers. Its overexpression may alter the kinetics of dimer formation resulting in abnormal regulation of downstream genes. The generation of Sim2 transgenic mice may enhance our understanding of the involvement of this gene in DS. A mouse model for DS, attributable to partial mouse trisomy 16, has been reported (Reeves et al. 1995). This model contains three copies of Sim2, among many other genes. Mating of this animal with a Sim2 knockout mouse would result in partial trisomy 16 mice with two copies of Sim2 and thus may elucidate the contribution of Sim2 to the phenotype in the context of this partial trisomy 16.

METHODS

Cloning of Human SIM1 and SIM2 cDNAs

To clone the full-length coding sequences of the human SIM1 and SIM2 cDNAs ∼106 PFUs of a human fetal kidney 5′ stretch λDR2 cDNA library (Clontech, HL1150x) was screened using a probe derived from a mixture of four previously trapped exons homologous to the Drosophila sim gene (HMC13F06, HMC29C01, HMC05F04, and RCH03D09; GenBank accession nos. X83514, X83515, X83513, and X83516, respectively) (Chen et al. 1995). Radioactive32P-labeling by PCR was performed as described (Scott et al. 1995), except that the exon-trapping vector pSPL3 (GIBCO BRL, 18449-017)-derived oligonucleotides dUSD2 (5′-CUACUACUACUAGTGAACTGCACTGTGACAAGCTGC-3′) and dUSA4 (5′-CUACUACUACUACACCTGAGGAGTGAATTGGTCG-3′) were used for PCR amplification. The inserts of positive phages were converted to plasmids (pDR2) as described (Clontech, PT1011-1) and sequenced using the ABI373A sequencer. The sequence contigs of SIM1 and SIM2 cDNAs were made using the SeqEd program (ABI); nucleotide and predicted amino acid homologies were analyzed by use of BLAST (Altschul et al. 1990). Analyses of the resulting sequences were performed with numerous programs of the Tools option of EXPASY (http://expasy.hcuge.ch/www/tools.html). Multiple sequence alignments were made with ClustalW 1.6 (http://dot.imgen.bcm.tmc.edu:9331/multi-align/multi-align.html).

To obtain 5′ and 3′ coding sequences of SIM2 cDNA, RACE was performed on fetal kidney poly(A)+ mRNA (Clontech, 6526-1) using a Marathon cDNA amplification kit (Clontech, K1802-1). Double-stranded cDNA synthesis and adapter ligation to the synthesized cDNAs were carried out according to manufacturer’s protocol PT1115-1. A first-round RT–PCR amplification was performed on the synthesized cDNA with primers AP1 (see protocol) and SM2RE2 (5′-AGTGCGATCCCAGCTCCTTGGCGA-3′) for the 5′ RACE or AP1 and SM2F03 (5′-GCACAATGCCAGCCAGCGGTGAAT-3′) for the 3′ RACE. A nested PCR amplification was then performed with primers AP2 (see protocol) and SM2RE1 (5′-GAAGACGGCGCGCATCTTCAGGTA-3′) for the 5′ RACE or AP2 and SM2F04 (5′-GAATGCCAGTGGCATTATGCCAAC-3′) for the 3′ RACE. The PCR products have been purified using QIAquick PCR purification kit (Qiagen) and subjected to sequencing.

Only the short form of SIM2 (SIM2s) has been found by cDNA library screening or 3′ RACE; the sequence of the long form of SIM2, homologous to mouse Sim2, was determined by genomic sequencing of the appropriate fragment of human BAC clone KB594G10 as described (Kawasaki et al. 1997).

Northern Blot Analysis

PCR-derived probes of 246 and 293 bp from the 3′ half of SIM1 and SIM2 cDNAs, respectively, which showed no homology to each other, were prepared as described (Scott et al. 1995) using oligonucleotides S11D09F3 5′-GCTGGTGGAAGAGAGGCATT-3′ and S11D09R4 5′-TGGAGAACTGACCACACTAT-3′ for SIM1, and S2F04F2 5′-GCTGAGAACAAACCCTTACC-3′ and S2F04R3 5′-AAGCGTGCCACCTCACAA-3′ for SIM2. The human adult and fetal multiple tissue Northern blots (Clontech, 7756-1, 7760-1, and 7759-1) were hybridized and washed at high stringency.

Mapping of SIM1

The Genebridge 4 radiation hybrid DNA panel (Walter et al. 1994) (HGMP resource center;http://www.hgmp.mrc.ac.uk/Public/Docs/Bio/Hybrid_panels.html) was tested by PCR for the presence of the specific 115-bp amplification fragment derived from the 3′ UTR of the SIM1 gene by use of oligonucleotides SIM1F 5′-TTCATGTAGATCAAGTGTGC-3′ and SIM1R 5′-ACCGCAGTTTGTAAGTAATA-3′. The scoring was performed on a 1.2% agarose gel and confirmed by hybridization. The data were submitted to and analyzed by the MIT Radiation Hybrid Mapping site (http://www-genome.wi.mit.edu/cgi-bin/contig/rhmapper.pl).

FISH was performed on Q-banded metaphase chromosomes of human B-lymphoblastoid cell line GM00130B as described (Wang et al. 1996) using BAC clone KB153H1 containing the human SIM1 gene as a probe. Human BAC clones KB153H1, containing the SIM1 gene, and KB594G10, containing the SIM2 gene, were isolated from a total human BAC library using a mouse Sim2 cDNA clone, mSIMλ1 (Yamaki et al. 1996), as a probe (Asakawa et al. 1997).

Acknowledgments

We thank M. Lalioti, K. Kawasaki, M.A. Morris, and all members of S.E.A.’s laboratory for assistance in some experiments and critical comments. We also thank Drs. C.-M. Fan and M. Tessier-Lavigne for providing mouse Sim1 and Sim2 sequences. The whole-mount in situ experiment was performed in the laboratory of D. Duboule. R.C. is a trainee of the Graduate Program of Molecular and Cellular Biology of the University of Geneva Medical School. This study was supported by grants from the Swiss Fonds National de Recherche Scientifique 31.33965.92 and 31-40500.94 and the European Union/Office Federal de l’Education et Sciences CT93-0015 and funds from the University and Cantonal Hospital of Geneva; by Grants in Aid for Scientific Research on Priority Areas and Scientific Research from the Ministry of Education, Science, Sports and Culture of Japan; and Funds for Human Genome Sequencing Project from the Japan Science and Technology Corporation. H.S.S. is supported by a C.J. Martin Fellowship from the National Health and Medical Research Council of Australia (957348).

The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked “advertisement” in accordance with 18 USC section 1734 solely to indicate this fact.

Footnotes

  • 5 Corresponding author.

  • E-MAIL sea{at}medsun.unige.ch; FAX 41 22 702 57 06.

    • Received February 6, 1997.
    • Accepted April 1, 1997.

REFERENCES

| Table of Contents

Preprint Server