|
|
|
|
Published online before print
October 25, 2006, 10.1101/gr.5085606 Genome Res. 16:1345-1351, 2006 ©2006 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/06 $5.00 OPEN ACCESS ARTICLE
Letter Canonical TTAGG-repeat telomeres and telomerase in the honey bee, Apis mellifera1Department of Entomology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA; 2CSIRO Entomology, Canberra, ACT 2601 Australia
The draft assembly of the honey bee Apis mellifera genome sequence reveals that the 17 centromeric-distal telomeres are of a simple, shared, and canonical structure, with 34 kb of a unique subtelomeric sequence, followed by several kilobases of TTAGG or variant telomeric repeats. This simple subtelomeric structure differs from the centromeric-proximal telomeres on the short arms of the 15 acrocentric chromosomes, which are apparently composed primarily of the 176-bp AluI tandem repeat. This dichotomy between the distal and proximal telomeres may involve differential participation of the telomeres of the 15 acrocentric chromosomes in the Rabl configuration after mitosis and the chromosome bouquet in meiotic prophase I. As expected from the presence of canonical TTAGG telomeric repeats, we identified a candidate telomerase gene in the bee, as well as the silkmoth Bombyx mori and the flour beetle Tribolium castaneum.
Telomeres are fundamental structures at the ends of linear eukaryotic chromosomes. They have been the subject of intense study in diverse organisms since their discovery in the ciliate Tetrahymena thermophila (Greider and Blackburn 1985
Telomeres serve at least three major roles (Blackburn 2001
Among the insects in which the TTAGG has been identified are many Coleoptera (Sahara et al. 1999
In a number of insects with the TTAGG telomeric repeat, the expected telomerase activity has been shown to be present (Sasaki and Fujiwara 2000
The chromosomes of the honey bee A. mellifera have been studied cytologically for over 100 yr (see Beye and Moritz 1995
Here, we describe the 17 centromeric-distal telomeres from the 15 acrocentric chromosomes and both telomeres of the metacentric chromosome 1, and show that their organization differs from the apparent AluI tandem repeat structures of the centromeric-proximal telomeres visualized by Beye and Moritz (1995)
The 17 distal telomeres The first distal bee telomere was identified as a result of an effort to "superscaffold" the smallest chromosome, number 16, using manual methods and all available sources of evidence to evaluate the quality of the honey bee genome assembly v2 (Honey Bee Genome Sequencing Consortium 2006 The subtelomeric 3.5-kb region of chromosome 16 was used in BLASTN searches to identify the partially or fully assembled telomeric ends of 14 other chromosomes in Assembly v4 (Fig. 1). Twelve of the 17 telomeres have their subtelomeric regions partially or fully assembled at the ends of the terminally mapped scaffold. The scaffold containing the telomere of chromosome 15, Group15.40, was already mapped to near the chromosome end, but reoriented and placed unequivocally at the terminus using superscaffolding (genetic mapping of several short scaffolds near the telomere of this chromosome has no resolution, and Groups 15.41 and 15.42 are actually internal to both 15.39 and 15.40). The telomeric regions of four chromosomes were already assembled as unoriented and unmapped GroupUn scaffolds, and these were associated with their appropriate chromosomes (1R, 8, 9, and 13) by manual superscaffolding. The telomeres of chromosomes 2 and 13 were assembled manually off the existing ends of the chromosomes using mate pair information from the flanking unique sequences. Only three other short 12-kb contigs in assembly v4 match this subtelomeric region, Groups Un.2223, Un.8960, and Un.9020, which appear to be partial separately assembled haplotypes for chromosomes 3, 16, and 12, respectively, so we believe we have identified all 17 distal telomeres and that the 15 proximal telomeres of the acrocentric chromosomes do not contain this sequence.
The subtelomeric sequences of mammals and some other organisms consist of internally repeated segments, but there is little sign of such repeats in the bee subtelomeric regions. A region near the end of the subtelomere has a repeated structure, but it is a simple sequence repeat identified by dotplot analysis (data not shown). The start of the TTAGG tandem telomeric repeats is evident at the end of the subtelomeric sequence for 13 of the chromosomes, but confidently alignable high-quality reads do not quite extend this far for the other four (Fig. 1). In addition, for all but chromosome 5, at least one and commonly several confident mate pairs of the 24-kb insert clones extending off the ends of the assembled subtelomeres, or the flanking unique sequences, consist entirely of TTAGG or variant tandem telomeric repeats, demonstrating that each telomere terminal consists of at least several kilobases of this telomeric repeat. The most common variants are TCAGG and TTGGG. All available mate pairs off the ends of the subtelomeres consist entirely of these repeats, so there is no evidence for additional complexity to the telomere structures. The similarity of the subtelomeric sequence of 34 kb that allowed recognition of all of these telomeres also means that assembling them as unique sequences is sometimes difficult, and this was compounded by the presence of two haplotypes from the original queen bee DH4 (the genomic DNA of hundreds of drone sons was sequenced). The automated assembly therefore succeeded in subjugating this haplotype variation, at least for part of the subtelomeric sequence, for 14 of the 17 telomeres. Those subtelomeres that were partially assembled ended at a point of major haplotype divergence, usually a large indel. We attempted to assemble both haplotypes separately for these and the two subtelomeres that were not assembled (2 and 13); however, it is not always possible to assemble both haplotypes completely due to the relatively low-sequence coverage depth. We were most successful with chromosome 7, where the two haplotypes differ by 15% (counting all nucleotides in numerous indels). Differences between chromosomes range from 8% for 3 and 6%30% for 3 and 10. This range of sequence divergence is shown graphically in a distance phylogenetic tree of the 17 telomeres plus both chromosome 7 haplotypes (Fig. 2). Visual inspection of the CLUSTALX alignments suggested that the telomeres fell into two roughly equal size groups with considerable sequence divergence, and the tree reflects this, with the position of the telomere from chromosome 9 being somewhat ambiguous. Presumably, these groupings reflect a history of gene conversion between telomeres on different chromosomes, followed by private mutation histories for each. A detailed study of this gene conversion would require separate assembly of all 34 haplotypes, which is not possible with the current sequence read depth.
The chromosome 16 subtelomeric sequence began 6 kb from the nearest confident gene model, and no other candidate genes could be identified in this 6 kb. Remarkably, this is true for the other 16 telomeres, with confident gene models for proteins conserved in other insects and/or other animals starting or ending between 0.5 and 7 kb of the subtelomeric sequence (Fig. 1). None of these genes is closely related to each other, so the gene-conversion processes responsible for maintaining the homogeneity of the subtelomeric sequences do not extend beyond them.
A candidate insect telomerase
Identification of a candidate bee telomerase allowed us to identify orthologs in the available silk moth B. mori and flour beetle T. castaneum genomes. Most of the B. mori gene is in the 6-kb Ctg021552 (Accession AADK01021552) of the Chinese 6X assembly (Xia et al. 2004
The three candidate insect telomerases contain most, but not all, of the motifs identified in TERT proteins from other eukaryotes, and therefore differ from other known TERTs. The conserved motifs present in the insect enzymes include the seven motifs (1, 2, AE) defining the well-conserved core RT domain as shown in Figure 3. The insect TERTs also include the TERT-specific T motif that is located immediately upstream from the core RT domain (Fig. 3). This T motif is not found in non-TERT RTs; it is also absent from the Giardia lamblia TERT (Malik et al. 2000
Amino acid sequences comprising the core RT domains (Malik et al. 2000
Our identification of the 17 centromere-distal telomeres of honey bee as having the relatively simple canonical structure of a shared short 34-kb nonrepetitive subtelomeric sequence, followed by extensive TTAGG or variant tandem telomeric repeats, and the discovery of a candidate bee, moth, and beetle telomerase have several implications.
Fifteen of these subtelomeric regions were already at least partially assembled by the automated assembly program ATLAS (Honey Bee Genome Sequencing Consortium 2006
These 17 distal telomeres show no evidence of insertions of non-LTR or any other transposons. This is in keeping with observations from the genome assembly that non-LTR transposons are essentially absent (Honey Bee Genome Sequencing Consortium 2006
The subtelomeric regions of these distal bee telomeres are also quite different from those of Drosophila telomeres, not only as expected in sequence, but in their simple organization. Characterization of the 2L and X subtelomeric regions showed them to consist of many kilobases of at least three kinds of tandem repeats and, in fact, to be quite different from each other (Karpen and Spradling 1992
The distal telomeres are apparently quite different, at least in their subtelomeric region, from the 15 proximal telomeres. Sahara et al. (1999)
Thus, the proximal and distal honey bee telomeres appear to be quite different, at least in their subtelomeric regions. This difference might be important in the context of the Rabl configuration typical of chromosomes immediately after mitosis, in which the centromeres and telomeres are clustered at opposite poles of each new nucleus (for review, see Cowan et al. 2001
The candidate insect telomerase we have identified encoded in the Apis, Bombyx, and Tribolium genomes has roughly 20% sequence identity to the vertebrate, fungal, and plant telomerases across most of its length and most of the known conserved motifs of these telomerases. It is a unique protein in these insect genomes, and in agreement with previous studies, no homolog is encoded by the Drosophila or Anopheles genomes. The insect telomerases all carry the RT-specific domains and one of the major telomerase-specific domains identified by others. The insect telomerase sequences also contain some, but not all of the amino-terminal telomerase domains identified in the enzymes from vertebrates, fungi, and ciliates, but not nematodes or Giardia. Initial results from a genome-wide tiling array and quantitative RT/PCR indicate that the bee telomerase is expressed in most tissues at low levels (Honey Bee Genome Sequencing Consortium 2006
We have also sought other telomere- or telomerase-associated proteins (e.g., Vega et al. 2003
We used "manual superscaffolding" to connect four partially assembled subtelomeric regions to their appropriate chromosomes ends (1R, 8, 9, and 13), and to orient and position terminally the partially mapped assembled telomere of chromosome 15. The automated assembly was conservative in requiring at least two unconflicted mate pairs to join contigs into scaffolds, so we used single mate pairs to join scaffolds as well as existing unmerged overlaps between scaffolds built from the two alternative haplotypes in this genome sequence. The telomeres of chromosomes 2 and 13, and both haplotypes of chromosome 7 were manually assembled using mate pair information connected from their unique flanking sequences. All reads and mate pair information were from 14 kb insert plasmid clones at the Trace Archive at NCBI.
Phylogenetic analysis of the telomeres was performed using uncorrected distances in PAUP4* (Swofford 2001
We thank Kim Walden for amplification and sequencing of the C-terminal region of the bee telomerase gene and RACE experiments on telomerase cDNAs; Michel Solignac for sharing his 2000-microsatellite-marker linkage map used for mapping genome assembly v4 before publication and for specifically mapping the telomerase gene; the Baylor Human Genome Sequencing Center for access to the Apis mellifera and Tribolium castaneum draft genome sequences before publication; and Bill Gelbart for suggesting the connection of the different subtelomeric sequences to the Rabl configuration and meiotic bouquet. This work was supported by NIH grant AI56081 to H.M.R. and funding from the Grains Research and Development Corporation and the Cotton Research and Development Corporation in Australia to K.H.J.G.
3 Corresponding author.
E-mail hughrobe{at}uiuc.edu; fax (217) 244-3499. [Supplemental material is available online at www.genome.org.] Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.5085606.
Abad, J.P., de Pablos, B., Osoegawa, K., de Jong, P.J., Martin-Gallardo, A., and Villasante, A. 2004. Genomic analysis of Drosophila melanogaster telomeres: Full-length copies of HeT-A and TART elements at the telomeres. Mol. Biol. Evol. 21: 16131619. Adams, M.D., Celniker, S.E., Holt, R.A., Evans, C.A., Gocayne, J.D., Amanatides, P.G., Scherer, S.E., Li, P.W., Hoskins, R.A., and Galle, R.F., et al. 2000. The genome sequence of Drosophila melanogaster . Science 287: 21852195. Beye, M. and Moritz, R.F. 1995. Characterization of honeybee (Apis mellifera L.) chromosomes using repetitive DNA probes and fluorescence in situ hybridization. J. Hered. 86: 145150. Biessmann, H., Kobeski, F., Walter, M.F., Kasravi, A., and Roth, C.W. 1998. DNA organization and length polymorphism at the 2L telomeric region of Anopheles gambiae . Insect Mol. Biol. 7: 8393.[CrossRef][Medline] Biessmann, H., Prasad, S., Walter, M.F., and Mason, J.M. 2005. Euchromatic and heterochromatic domains at Drosophila telomeres. Biochem. Cell Biol. 83: 477485.[CrossRef][Medline] Blackburn, E.H. 2001. Switching and signaling at the telomere. Cell 106: 661673.[CrossRef][Medline] Boule, J.B., Vega, L.R., and Zakian, V.A. 2005. The yeast Pif1p helicase removes telomerase from telomeric DNA. Nature 438: 5761.[CrossRef][Medline] Casacuberta, E. and Pardue, M.L. 2005. HeT-A and TART, two Drosophila retrotransposons with a bona fide role in chromosome structure for more than 60 million years. Cytogenet. Genome Res. 110: 152159.[CrossRef][Medline] Cech, T.R. 2004. Beginning to understand the end of the chromosome. Cell 116: 273279.[CrossRef][Medline] Cenci, G., Ciapponi, L., and Gatti, M. 2005. The mechanism of telomere protection: A comparison between Drosophila and humans. Chromosoma 114: 135145.[CrossRef][Medline] Chen, J.-L. and Greider, C.W. 2004. An emerging consensus for telomerase RNA structure. Proc. Natl. Acad. Sci. 101: 1468314684. Cohn, M. and Edström, J.E. 1992. Chromosome ends in Chironomus pallidivittatus contain different subfamilies of telomere-associated repeats. Chromosoma 101: 634640.[CrossRef][Medline] Cowan, C.R., Carlton, P.M., and Cande, W.Z. 2001. The polar arrangement of telomeres in interphase and meiosis. Rabl organization and the bouquet. Plant Physiol. 125: 532538. Fajkus, J., Sykorova, E., and Leitch, A.R. 2005. Telomeres in evolution and evolution of telomeres. Chromosome Res. 13: 469479.[CrossRef][Medline] Felsenstein, J. 1989. PHYLIPPhylogeny inference package (Version 3.2). Cladistics 5: 164166. Frydrychova, R., Grossmann, P., Trubac, P., Vitkova, M., and Marec, F. 2004. Phylogenetic distribution of TTAGG telomeric repeats in insects. Genome 47: 163178.[Medline] Fujiwara, H., Osanai, M., Matsumoto, T., and Kojima, K.K. 2005. Telomere-specific non-LTR retrotransposons and telomere maintenance in the silkwormBombyx mori. Chromosome Res. 13: 455467.[CrossRef][Medline] Greider, C.W. 1996. Telomere length regulation. Annu. Rev. Biochem. 65: 337365.[CrossRef][Medline] Greider, C.W. and Blackburn, E.H. 1985. Identification of a specific telomere terminal transferase activity in Tetrahymena extracts. Cell 43: 405413.[CrossRef][Medline] The Honey Bee Genome Sequencing Consortium 2006. Insights into social insects from the genome of the honey bee Apis mellifera . Nature in press. Karpen, G.H. and Spradling, A.C. 1992. Analysis of subtelomeric heterochromatin in the Drosophila minichromosome Dp1187 by single-P element insertional mutagenesis. Genetics 132: 737753.[Abstract] Kojima, K.K., Kubo, Y., and Fujiwara, H. 2002. Complex and tandem repeat structure of subtelomeric regions in the Taiwan cricket, Teleogryllus taiwanemma. J. Mol. Evol. 54: 474485.[CrossRef][Medline] Kuramoto, M., Ohsumi, K., Kishimoto, T., and Ishikawa, F. 2001. Identification and analyses of the Xenopus TERT gene that encodes the catalytic subunit of telomerase. Gene 277: 101110.[CrossRef][Medline] Lin, J., Ly, H., Hussain, A., Abraham, M., Pearl, S., Tzafati, Y., Parslow, T.G., and Blackburn, E.H. 2004. A universal telomerase RNA core structure includes structured motifs required for binding the telomerase reverse transcriptase protein. Proc. Natl. Acad. Sci. 101: 1471314718. Lorite, P., Carrillo, J.A., and Palomque, T. 2002. Conservation of (TTAGG)n telomeric sequences among ants (Hymenoptera, Formicidae). J. Hered. 93: 282285. Malik, H.S., Burke, W.D., and Eickbush, T.H. 2000. Putative telomerase catalytic subunits from Giardia lamblia and Caenorhabditis elegans . Gene 251: 101108.[CrossRef][Medline] Melnikova, L. and Georgiev, P. 2005. Drosophila telomeres: The non-telomerase alternative. Chromosome Res. 13: 431441.[CrossRef][Medline] Meyne, J., Ratliff, R.L., and Moyzis, R.K. 1989. Conservation of the human telomere sequence (TTAGGG)n among vertebrates. Proc. Natl. Acad. Sci. 86: 70497053. Mita, K., Kasahara, M., Sasaki, S., Nagayasu, Y., Yamada, T., Kanamori, H., Namiki, N., Kitagawa, M., Yamashita, H., and Yasukochi, Y., et al. 2004. The genome sequence of silkworm, Bombyx mori. DNA Res. 11: 2735.[Abstract] Nakamura, T.M., Morin, G.B., Chapman, K.B., Weinrich, S.L., Andrews, W.H., Lingner, J., Harley, C.B., and Cech, T.R. 1997. Telomerase catalytic subunit homologs from fission yeast and human. Science 277: 955959. Oguchi, K., Tamura, K., and Takahashi, H. 2004. Characterization of Oryza sativa telomerase reverse transcriptase and possible role of its phosphorylation in the control of telomerase activity. Gene 342: 5766.[CrossRef][Medline] Okazaki, S., Tsuchida, K., Maekawa, H., Ishikawa, H., and Fujiwara, H. 1993. Identification of a pentanucleotide telomeric sequence, TTAGG.n, in the silkworm Bombyx mori and in other insects. Mol. Cell. Biol. 13: 14241432. Pardue, M.L. and DeBaryshe, P.G. 1999. Telomeres and telomerase: More than the end of the line. Chromosoma 108: 7382.[CrossRef][Medline] Pardue, M.L., Brown, D.D., and Birnstiel, M.L. 1973. Location of the genes for 5S ribosomal RNA in Xenopus laevis . Chromosoma 42: 191203.[CrossRef][Medline] Rosen, M. and Edström, J.E. 2000. DNA structures common for chironomid telomeres terminating with complex repeats. Insect Mol. Biol. 9: 314347. Sahara, K., Marec, F., and Traut, W. 1999. TTAGG telomeric repeats in chromosomes of some insects and other arthropods. Chromosome Res. 7: 449460.[CrossRef][Medline] Saiga, H. and Edström, J.-E. 1985. Long tandem arrays of complex repeat units in Chironomus telomeres. EMBO J. 4: 799804.[Medline] Sasaki, T. and Fujiwara, H. 2000. Detection and distribution patterns of telomerase activity in insects. Eur. J. Biochem. 267: 30253031.[Medline] Scherthan, H. 2001. A bouquet makes ends meet. Nat. Rev. Mol. Cell Biol. 2: 621627.[CrossRef][Medline] Schulz, V.P. and Zakian, V.A. 1994. The Saccharomyces PIF1 DNA helicase inhibits telomere elongation and de novo telomere formation. Cell 76: 145155.[CrossRef][Medline] Smogorzewska, A. and de Lange, T. 2004. Regulation of telomerase by telomeric proteins. Annu. Rev. Biochem. 73: 177208.[CrossRef][Medline] Solignac, M., Vautrin, D., Baudry, E., Mougel, F., Loiseau, A., and Cornuet, J.M. 2004. A microsatellite-based linkage map of the honeybee, Apis mellifera L. Genetics 167: 253262. Swofford, D.L. 2001. PAUP*: Phylogenetic analysis using parsimony and other methods. Version 4. Sinauer Press, New York. Tares, S., Cornuet, J.M., and Abad, P. 1993. Characterization of an unusually conserved AluI highly reiterated DNA sequence family from the honeybee, Apis mellifera. Genetics 134: 11951204.[Abstract] Thomson, J.D., Higgins, D.J., and Gibson, T.J. 1994. CLUSTAL W. Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position specific gap penalties and weight matrix choice. Nucleic Acids Res. 22: 46734680. Vega, L.R., Mateyak, M.K., and Zakian, V.A. 2003. Getting to the end: Telomerase access in yeast and humans. Nat. Rev. Mol. Cell Biol. 4: 948959.[CrossRef][Medline] Vitkova, M., Kral, J., Traut, W., Zrzavy, J., and Marec, F. 2005. The evolutionary origin of insect telomeric repeats, (TTAGG)n. Chromosome Res. 13: 145156.[CrossRef][Medline] Walter, M.F., Jang, C., and Kasravi, B. 1995. DNA organization and polymorphism of a wild-type Drosophila telomere region. Chromosoma 104: 229241.[Medline] Xia, J., Peng, Y., Mian, I.S., and Lue, N.F. 2000. Identification of functionally important domains in the N-terminal region of telomerase reverse transcriptase. Mol. Cell. Biol. 20: 51965207. Xia, Q., Zhou, Z., Lu, C., Cheng, D., Dai, F., Li, B., Zhao, P., Zha, X., Cheng, T., and Chai, C., et al. 2004. A draft sequence for the genome of the domesticated silkworm Bombyx mori . Science 306: 19371940. Yap, W.H., Yeoh, E., Brenner, S., and Venkatesh, B. 2005. Cloning and expression of the reverse transcriptase component of pufferfish (Fugu rubripes) telomerase. Gene 353: 207217.[CrossRef][Medline] Zdobnov, E.M., von Mering, C., Letunic, I., Torrents, D., Suyama, M., Copley, R.R., Christophides, G.K., Thomasova, D., Holt, R.A., and Subramanian, G.M., et al. 2002. Comparative genome and proteome analysis of Anopheles gambiae and Drosophila melanogaster . Science 298: 149159. Zhou, J., Monson, E.K., Teng, S.C., Schulz, V.P., and Zakian, V.A. 2000. Pif1p helicase, a catalytic inhibitor of telomerase in yeast. Science 289: 771774.
Received December 21, 2005; accepted in revised format May 31, 2006. This article has been cited by other articles:
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||