|
|
|
|
Genome Res. 14:1686-1695, 2004 ©2004 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/04 $5.00 Letter Comparative Analysis of Apicomplexa and Genomic Diversity in Eukaryotes1 Department of Microbiology and Immunology, Weill Medical College and the Program in Immunology and Microbial Pathogenesis, Weill Graduate School of Medical Sciences of Cornell University, New York, New York 10021, USA 2 National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA 3 Department of Veterinary Pathobiology, University of Minnesota, St. Paul, Minnesota 55108, USA 4 Biomedical Genomics Center, University of Minnesota, St. Paul, Minnesota 55108, USA 5 Human Genome Sciences, Rockville, Maryland 20850, USA 6 Sanaria Inc., Rockville, Maryland 20852, USA
The apicomplexans Plasmodium and Cryptosporidium have developed distinctive adaptations via lineage-specific gene loss and gene innovation in the process of diverging from a common parasitic ancestor. The two lineages have acquired distinct but overlapping sets of surface protein adhesion domains typical of animal proteins, but in no case do they share multidomain architectures identical to animals. Cryptosporidium, but not Plasmodium, possesses an animal-type O-linked glycosylation pathway, along with >30 predicted surface proteins having mucin-like segments. The two parasites have notable qualitative differences in conserved protein architectures associated with chromatin dynamics and transcription. Cryptosporidium shows considerable reduction in the number of introns and a concomitant loss of spliceosomal machinery components. We also describe additional molecular characteristics distinguishing Apicomplexa from other eukaryotes for which complete genome sequences are available.
The availability of two apicomplexan complete genome sequences, Plasmodium (Gardner et al. 2002
The current phylogenetic analysis of the characterized Apicomplexa suggest a basal position for Cryptosporidium and the gregarines with respect to a poorly defined "crown group" composed of hemosporidians and coccidians (Carreno et al. 1999
The Relationship of Apicomplexans to Other Eukaryotes and the Degree of Relatedness of the Apicomplexan Proteomes To obtain a robust phylogenetic model for the relationship of apicomplexans with other eukaryotes having complete genome sequences, we prepared a concatenated multiple alignment (see Supplemental data 1) of >30 conserved C. parvum proteins, such as ribosomal proteins, DNA and RNA polymerases, translation factors, and tRNA synthetases having orthologs in Plasmodium (Apicomplexa); Arabidopsis (plants); Caenorhabditis, Drosophila and Homo (animals); Neurospora, Saccharomyces, and Schizosaccharomyces (fungi); Giardia (Parabasalids); and Aeropyrum and Archaeoglobus (Archaea). This multiple alignment, spanning >4000 aligned positions, was used to compute maximum likelihood, maximum parsimony, neighbor joining, and least squares trees, all rooted using the archaeal sequences. These methods uniformly yield a tree topology with Plasmodium and Cryptosporidium forming a monophyletic lineage lying outside of a strongly supported "crown group" composed of animals, fungi, and plants (Fig. 1A). Giardia occupies a basal position amidst the eukaryotes included in this analysis. This topology is also supported by domain architecture analysis of 400 proteins belonging to different functional categories as discrete characters. Previous reports propose a weak association between the "plant clade" (comprised of green plants, rhodophytes, and glaucocystophytes) and a large assemblage of eukaryotes that include Stramenopiles and Alveolates (including Apicomplexa; Baldauf et al. 2000
To determine the relatedness of the apicomplexan proteome, and to provide a quantitative measurement of proteome similarity and divergence across protein functional categories, we used a simple measure termed the orthology coefficient (OC). For a set of proteins from two compared organisms, the OC represents the fraction occurring in orthologous groups. Thus, an OC = 1 indicates that all the proteins within a compared set have an orthologous relationship. Likewise, if only a fraction of the proteins in the set form orthologous groups, then the OC would fall between 1 and 0. Plasmodium and C. parvum shared 2000 orthologous groups, with an overall orthology coefficient of 0.41 (Fig. 2A). This suggests that both parasites possess a significant complement of genes that do not have any orthologous representatives in the other. We further defined OCs for protein sets classified into functional categories (Fig. 2A), revealing that orthology coefficients span a striking range from 0.2 to 0.4 for proteins related to extracellular adhesion and surface protein glycosylation, and 0.8 to 0.85 for core cellular functions such as translation, RNA processing, ubiquitination, DNA repair/replication, and chromatin dynamics. This suggests that evolutionary divergence of the two parasites has differentially affected various functional classes (Fig. 2A).
Many differences in ortholog distribution could be attributed to gene loss accompanying mitochondrion or apicoplast organellar degradation, or elimination of metabolic pathways in C. parvum. For example, versions of the tRNA synthetases and DNA repair enzymes, present in Plasmodium but lost in Cryptosporidium, likely represent forms with mitochondrion- and apicoplast-specific functions in Plasmodium. This prediction, based on patterns of gene loss, is corroborated by the presence of long N-terminal extensions mediating organellar targeting only in the Plasmodium versions of these proteins. Qualitative differences between the apicomplexan lineages occur even in the high-OC-value protein sets corresponding to core cellular processes. When the demography of the conserved domains in the two apicomplexan proteomes were compared to each other and against Saccharomyces cerevisiae, a unicellular eukaryote with a roughly comparable number of protein-coding genes, certain interesting large-scale trends were observed (Fig. 2B,C). The two apicomplexans show independent lineage-specific expansions of entirely different protease families, and C. parvum does not share the prominent lineage-specific expansion of RESA-type DnaJ domains that is encountered in Plasmodium falciparum (Fig. 2B,C). Relative to yeast, Cryptosporidium, like Plasmodium (Aravind et al. 2003
Functional Categories With Low OC Values: Surface Proteins and the Glycosylation Machinery
We systematically investigated the affinities of the surface protein domains by searching the C. parvum proteome with a comprehensive library of PSI-BLAST-derived position-specific score matrices and hidden Markov models for surface protein domains. These profiles were previously used to detect such domains in adhesion proteins of Caenorhabditis elegans, Homo sapiens, and P. falciparum (Aravind and Subramanian 1999
The surface protein adhesion domains in the apicomplexan proteomes can be attributed to multiple distinct heritages: those originally derived from bacteria and animals and laterally transferred to Apicomplexa, and those "invented" within the Apicomplexa, typically in a lineage-specific manner. In principle, it is possible that the surface protein domains shared by animals and apicomplexans were present in the ancestral eukaryotes and secondarily lost in other lineages. Although gene loss occurs frequently in eukaryotes, most of these domains shared by both these lineages are often undetectable in the (nearly) completely sequenced genomes of multicellular (filamentous) fungi, plants, and other unicellular eukaryotes such as trypanosomes and parabasalids. Thus, if multiple gene losses were to be invoked, it would imply that the common ancestor of these lineages was probably more diverse in its protein complement than most of the descendents, and this is not consistent with the demography of the protein families encoded by eukaryotic genomes (Lespinet et al. 2002
The plausibility of horizontal transfer from an animal source is also supported by the intracellular location of apicomplexan parasites for most of their life cycle and, in the case of Plasmodium, there is facile uptake and expression of DNA constructs introduced to the erythrocyte cytoplasm prior to parasite infection (Deitsch et al. 2001
For P. falciparum and C. parvum surface proteins, the OC is no more than 0.2 for proteins having conserved "animal-type" and "bacterial-type" domains, with few conserved architectures shared by the two lineages. Furthermore, the set of surface domains in these proteins is overlapping but not identical in the apicomplexans. For example, the MAC/perforin-type domain (Aravind et al. 2003
Comparison of the surface protein glycosylation apparatus reveals dramatic divergence between these two apicomplexans (Fig. 3B). Both possess a well-developed GPI anchor synthesis apparatus that is largely similar to the corresponding pathway in other eukaryotes. Unlike Plasmodium, Cryptosporidium lacks the canonical N-acetylglucosaminylphosphatidylinositol deacetylase that catalyzes the second step in the GPI anchor biosynthetic pathway. However, sequence analysis revealed the presence of an unrelated bacterial-type sugar deacetylase (cgd1_3060) that is likely to catalyze the same reaction. Whereas there is only a rudimentary N-linked glycosylation pathway present in Plasmodium, a more developed pathway is predicted in Cryptosporidium. N-linked glycosylation has been widely detected in other eukaryotes, including Toxoplasma (Odenthal-Schnittler et al. 1993
Low OC of Metabolic Pathway Components Suggest Life-Cycle-Specific Adaptations
The stem of the glycolytic pathway represents the most highly conserved metabolic pathway between Cryptosporidium and Plasmodium. However, unlike Plasmodium, Cryptosporidium also possesses enzymes for the terminal metabolism of pyruvate such as pyruvate:ferredoxin oxidoreductase, pyruvate decarboxylase, and malate dehydrogenase. Phylogenetic analysis of pathway components reveals a mosaic of strong affinities to enzyme versions of plants and bacteria. For example, the apicomplexan phosphoglucomutase, phosphofructokinase, and enolase enzymes grouped with the plant versions, whereas fructose bisphosphate phosphatase and phosphoglucomutase showed bacterial affinities. These affinities suggest displacements of the ancestral eukaryotic enzymes by versions derived from the apicoplast precursor and bacterial sources. Nevertheless, the current state of the data precludes us from determining the temporal point in alveolate evolution at which these displacements occurred. Interestingly, similar to the parasite Leishmania, C. parvum possesses a plant-type 2-phosphoglycerate kinase implicated in archaea in the synthesis of the possible denaturation protectant, 2-3 cyclic phosphoglyceric acid (Matussek et al. 1998
Differences Between Cryptosporidium and Plasmodium in Functional Classes With Moderate to High OCs, and Comparisons With Other Eukaryotes
The ratio of the total number of proteins in the proteome to the predicted specific transcription factors in Cryptosporidium and Plasmodium, 340 and 800, respectively, is in great contrast to the ratio of 29 in S. cerevisiae. The decreased ratio in C. parvum relative to Plasmodium is caused by a greater absolute number of specific transcription factors possessing a variety of conserved DNA-binding domains, such as E2F/DP1, bZip, and GATA DNA-binding domains, in conjunction with a lower overall gene count (Fig. 2B,C). Nevertheless, the numbers of specific transcription factors are far fewer than those encountered in yeast and other eukaryotes, suggesting major differences in the mechanisms of apicomplexan gene regulation. Recent microarray studies on P. falciparum indicate a continuous cascade of gene expression in the course of its intraerythrocytic stage cycle, in which groups of functionally related genes are coexpressed, with those involved in generalized functions being expressed first followed by those for increasingly specialized, lineage-specific functions (Bozdech et al. 2003
Although there is a much higher correspondence between the two apicomplexans in chromatin proteins than specific transcription factors, interesting differences are found in both absolute numbers and architectures of these proteins (Fig. 4). C. parvum has 14 chromatin-remodeling SNF2/SWI2 ATPases, whereas Plasmodium has just 11. Comparisons of the apicomplexan Swi2/Snf2 ATPases with the other eukaryotes suggests that Plasmodium has lost the Rad26 and Swr1 orthologs, whereas Cryptosporidium appears to have lost one of the Rad16/Rad5-like Ring finger-containing forms and the version fused to a C-terminal Endonuclease VII domain. Cryptosporidium possesses a version of the Swi2/Snf2 ATPase, with a unique architecture containing two N-terminal chromo domains and one bromo domain. Likewise, Plasmodium possesses a unique predicted chromatin-associated protein, having an amine oxidase domain fused to a C-terminal PHD finger that is predicted to function as a novel enzyme that might modify histone amino groups or chromatin basic amines (Fig. 4). It may represent a case of an independently derived chromatin-associated oxidase, parallel to the amine oxidase fused to the SWIRM domain that is seen in the crown group eukaryotes (Aravind and Iyer 2002
The pellicle in several protozoans is supported by a distinctive fibrous cytoskeletal structure predominantly composed of low complexity, proline- and valine-rich proteins called articulins (Mann and Beckers 2001
Conclusions However, beyond the core set of genes, the evolution of parasitism has involved lineage-specific adaptation occurring through gene loss, additional lateral transfers, and lineage-specific expansions. Considerable lineage-specific gene loss is indicated by the absence of widespread eukaryotic proteins specifically in either one of the apicomplexan lineages (Fig. 4), suggesting that the common ancestral apicomplexan possessed a more complex genome encoding a greater repertoire of biochemical activities. The streamlining appears to correlate with increasing propensity of the parasite to use its host(s) for most of its metabolic requirements. Some of the losses are not easily explained: for example, in Cryptosporidium, the apparent elimination of Skp1p and Skp2p-like proteins, despite the presence of cullins, suggests potential differences in the cell cycle related ubiquitination complexes relative to other eukaryotes. Thus, as a consequence of gene losses and lineage-specific innovations, apicomplexans possess proteomes quite different from free-living unicellular eukaryotes that have similar overall gene numbers (Fig. 2A-D). Most notably, the apicomplexan proteomes have a large component devoted to pathogenesis, immune evasion, and adhesion rather than transcription, posttranscriptional regulation, or metabolism.
The apicomplexans confirm large-scale trends in the evolution of the eukaryotes, specifically the involvement of lineage-specific expansions in the generation of specific adaptations (Abrahamsen et al. 2004
Sequence Analysis and Phylogenetic Tree Constructions C. parvum genome sequence information and annotation supporting this manuscript are available on an in-house genome browser (http://134.84.110.219/cgi-bin/gbrowse/crypto909) and at (http://www.cryptodb.org). General methodologies supporting genome sequence annotation, including BLAST searches, multiple sequence alignments, protein structure determinations, gene family clustering, and phylogenetic analyses were performed as briefly follows. The nonredundant (NR) database of protein sequences (National Center for Biotechnology Information, NIH, November 21, 2003) was searched using the BLASTP program. Profile searches were conducted using the PSI-BLAST program (Altschul et al. 1997
Phylogenetic analyses were carried out using the maximum-likelihood, neighbor-joining, protein parsimony, and least-squares methods (Felsenstein 1989
Calculation of Orthology Coefficients (OC)
This work was supported in part by the Niarchos Foundation. The Department of Microbiology and Immunology at Weill Medical College acknowledges the support of the William Randolph Hearst Foundation. This study utilized the high-performance computational capabilities of the Biowulf PC/Linux cluster at the National Institutes of Health, Bethesda, MD. (http://biowulf.nih.gov). The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
7 These authors contributed equally to this work.
8 Corresponding authors. [Supplemental material is available online at www.genome.org.] Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.2615304.
Abrahamsen, M.S., Templeton, T.J., Enomoto, S., Abrahante, J.E., Zhu, G., Lancto, C.A., Deng, M., Liu, C., Widmer, G., Tzipori, Z., et al. 2004. The complete genome sequence of the apicomplexan, Cryptosporidium parvum. Science 304: 441-445.
Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D.J. 1997. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25: 3389-3402. Aravind, L. and Iyer, L.M. 2002. The SWIRM domain: A conserved module found in chromosomal proteins points to novel chromatin-modifying activities. Genome Biol. 3: RESEARCH0039. Aravind, L. and Subramanian, G. 1999. Origin of multicellular eukaryotesInsights from proteome comparisons. Curr. Opin. Genet. Dev. 9: 688-694.[CrossRef][Medline]
Aravind, L., Watanabe, H., Lipman, D.J., and Koonin, E.V. 2000. Lineage-specific loss and divergence of functionally linked genes in eukaryotes. Proc. Natl. Acad. Sci. 97: 11319-11324. Aravind, L., Iyer, L.M., Wellems, T.E., and Miller, L.H. 2003. Plasmodium biology: Genomic gleanings. Cell 115: 771-785.[CrossRef][Medline] Baker, D.A., Thompson, J., Daramola, O.O., Carlton, J.M., and Targett, G.A. 1995. Sexual-stage-specific RNA expression of a new Plasmodium falciparum gene detected by in situ hybridization. Mol. Biochem. Parasitol. 72: 193-201.[CrossRef][Medline]
Baldauf, S.L., Roger, A.J., Wenk-Siefert, I., and Doolittle, W.F. 2000. A kingdom-level phylogeny of eukaryotes based on combined protein data. Science 290: 972-977. Baruch, D.I., Pasloske, B.L., Singh, H.B., Bi, X., Ma, X.C., Feldman, M., Taraschi, T.F., and Howard, R.J. 1995. Cloning the P. falciparum gene encoding PfEMP1, a malarial antigen and adherence receptor on the surface of parasitized human erythrocytes. Cell 82: 77-87.[CrossRef][Medline] Bateman, A., Birney, E., Cerruti, L., Durbin, R., Etwiller, L., Eddy, S.R., Griffiths-Jones, S., Howe, K.L., Marshall, M., and Sonnhammer, E.L. 2002. The Pfam protein families database. Nucleic Acids Res. 30: 2762-2800. Bozdech, Z., Llinas, M., Pulliam, B.L., Wong, E.D., Zhu, J., and DeRisi, J.L. 2003. The transcriptome of the intraerythrocytic developmental cycle of Plasmodium falciparum. PLoS Biol. 1: E5.[Medline]
Bruno, W.J., Socci, N.D., and Halpern, A.L. 2000. Weighted neighbor joining: A likelihood-based approach to distance-based phylogeny reconstruction. Mol. Biol. Evol. 17: 189-197. Carreno, R.A., Martin, D.S., and Barta, J.R. 1999. Cryptosporidium is more closely related to the gregarines than to coccidia as shown by phylogenetic analysis of Apicomplexan parasites inferred using small-subunit ribosomal RNA gene sequences. Parasitol. Res. 85: 899-904.[CrossRef][Medline] Cheng, Q., Cloonan, N., Fischer, K., Thompson, J., Waine, G., Lanzer, M., and Saul, A. 1998. stevor and rif are Plasmodium falciparum multicopy gene families which potentially encode variant antigens. Mol. Biochem. Parasitol. 97: 161-176.[CrossRef][Medline]
Chervitz, S.A., Aravind, L., Sherlock, G., Ball, C.A., Koonin, E.V., Dwight, S.S., Harris, M.A., Dolinski, K., Mohr, S., Smith, T., et al. 1998. Comparison of the complete protein sets of worm and yeast: Orthology and divergence. Science 282: 2022-2028.
Claros, M.G. and von Heijne, G. 1994. TopPred II: An improved software for membrane protein structure predictions. Comput. Appl. Biosci. 10: 685-686.
Deitsch, K., Driskill, C., and Wellems, T. 2001. Transformation of malaria parasites by the spontaneous uptake and expression of DNA from human erythrocytes. Nucleic Acids Res. 29: 850-853.
Eddy, S.R. 1998. Profile hidden Markov models. Bioinformatics 14: 755-763. Felsenstein, J. 1989. PHYLIPPhylogeny inference package (Version 3.2). Cladistics 5: 164-166. ____. 1996. Inferring phylogenies from protein sequences by parsimony, distance, and likelihood methods. Methods Enzymol. 266: 418-427.[Medline]
Gardner, M.J., Tettlin, H., Carucci, D.J., Cummings, L.M., Aravind, L., Koonin, E.V., Shallom, S., Mason, T., Yu, K., Fujii, C., et al. 1998. Chromosome 2 sequence of the human malaria parasite Plasmodium falciparum. Science 282: 1126-1132. Gardner, M.J., Hall, N., Fung, E., White, O., Berriman, M., Hyman, R.W., Carlton, J.M., Pain, A., Nelson, K.E., Bowman, S., et al. 2002. Genome sequence of the human malaria parasite Plasmodium falciparum. Nature 419: 498-511.[CrossRef][Medline] Hall, T.M., Porter, J.A., Young, K.E., Koonin, E.V., Beachy, P.A., and Leahy, D.J. 1997. Crystal structure of a Hedgehog autoprocessing domain: Homology between Hedgehog and self-splicing proteins. Cell 91: 85-97.[CrossRef][Medline] Hasegawa, M., Kishino, H., and Saitou, N. 1991. On the maximum likelihood method in molecular phylogenetics. J. Mol. Evol. 32: 443-445.[CrossRef][Medline] Krogh, A., Larsson, B., von Heijne, G., and Sonnhammer, E. L. 2001. Predicting transmembrane protein topology with a hidden Markov model: Application to complete genomes. J. Mol. Biol. 305: 567-580.[CrossRef][Medline]
Kumar, S., Tamura, K., Jakobsen, I.B., and Nei, M. 2001. MEGA2: Molecular evolutionary genetics analysis software. Bioinformatics 17: 1244-1245. Lander, E.S., Linton, L.M., Birren, B., Nusbaum, C., Zody, M.C., Baldwin, J., Devon, K., Dewar, K., Doyle, M., Fitzhugh, W., et al. 2001. Initial sequencing and analysis of the human genome. Nature 409: 860-921.[CrossRef][Medline] Laybourn-Parry, J. 1984. A functional biology of the free-living protozoa. University of California Press, Berkeley, CA.
Le Roch, K.G., Zhou, Y., Blair, P.L., Grainger, M., Moch, J.K., Haynes, J.D., De La Vega, P., Holder, A.A., Batalov, S., Carucci, D.J., et al. 2003. Discovery of gene function by expression profiling of the malaria parasite life cycle. Science 301: 1503-1508.
Lespinet, O., Wolf, Y.I., Koonin, E.V., and Aravind, L. 2002. The role of lineage-specific gene family expansion in the evolution of eukaryotes. Genome Res. 12: 1048-1059. Mann, T. and Beckers, C. 2001. Characterization of the subpellicular network, a filamentous membrane skeletal component in the parasite Toxoplasma gondii. Mol. Biochem. Parasitol. 115: 257-268.[CrossRef][Medline]
Matussek, K., Moritz, P., Brunner, N., Eckerskorn, C., and Hensel, R. 1998. Cloning, sequencing, and expression of the gene encoding cyclic 2,3-diphosphoglycerate synthetase, the key enzyme of cyclic 2,3-diphosphoglycerate metabolism in Methanothermus fervidus. J. Bacteriol. 180: 5997-6004.
Nielsen, H., Engelbrecht, J., Brunak, S., and von Heijne, G. 1997. Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites. Protein Eng. 10: 1-6. Notredame, C., Higgins, D.G., and Heringa, J. 2000. T-Coffee: A novel method for fast and accurate multiple sequence alignment. J. Mol. Biol. 302: 205-217.[CrossRef][Medline] Odenthal-Schnittler, M., Tomavo, S., Becker, D., Dubremetz, J.F., and Schwarz, R.T. 1993. Evidence of N-linked glycosylation in Toxoplasma gondii. Biochem. J. 291: 713-721.
Peterson, D.S., Miller, L.H., and Wellems, T.E. 1995. Isolation of multiple sequences from the Plasmodium falciparum genome that encode conserved domains homologous to those in erythrocyte-binding proteins. Proc. Natl. Acad. Sci. 92: 7100-7104. Pollack, Y., Kogan, N., and Golenser, J. 1991. Plasmodium falciparum: Evidence for a DNA methylation pattern. Exp. Parasitol. 72: 339-344.[CrossRef][Medline] Ponting, C.P., Aravind, L., Schultz, J., Bork, P., and Koonin, E.V. 1999. Eukaryotic signalling domain homologues in archaea and bacteria. Ancient ancestry and horizontal gene transfer. J. Mol. Biol. 289: 729-745.[CrossRef][Medline]
Pradel, G., Hayton, K., Aravind, L., Iyer, L.M., Abrahamsen, M.S., Bonawitz, A., Mejia, C., and Templeton, T.J. 2004. A multidomain adhesion protein family expressed in Plasmodium falciparum is essential for transmission to the mosquito. J. Exp. Med. 199: 1533-1544. Reeves, P.R., Hobbs, M., Valvano, M.A., Skurnik, M., Whitfield, C., Coplin, D., Kido, N., Klena, J., Maskell, D., Raetz, C.R., et al. 1996. Bacterial polysaccharide synthesis and gene nomenclature. Trends Microbiol. 4: 495-503.[CrossRef][Medline]
Ritte, G., Lloyd, J.R., Eckermann, N., Rottmann, A., Kossmann, J., and Steup, M. 2002. The starch-related R1 protein is an
Ronquist, F. and Huelsenbeck, J.P. 2003. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19: 1572-1574. Rost, B., Fariselli, P., and Casadio, R. 1996. Topology prediction for helical transmembrane proteins at 86% accuracy. Protein Sci. 5: 1704-1718.[Abstract]
Schmidt, H.A., Strimmer, K., Vingron, M., and von Haeseler, A. 2002. TREE-PUZZLE: Maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics 18: 502-504. Smith, J.D., Chitnis, C.E., Craig, A.G., Roberts, D.J., Hudson-Taylor, D.E., Peterson, D.S., Pinches, R., Newbold, C.I., and Miller, L.H. 1995. Switches in expression of Plasmodium falciparum var genes correlate with changes in antigenic and cytoadherent phenotypes of infected erythrocytes. Cell 82: 101-110.[CrossRef][Medline] Su, X.Z., Heatwole, V.M., Wertheimer, S.P., Guinet, F., Herrfeldt, J.A., Peterson, D.S., Ravetch, J.A., and Wellems, T.E. 1995. The large diverse gene family var encodes proteins involved in cytoadherence and antigenic variation of Plasmodium falciparum-infected erythrocytes. Cell 82: 89-100.[CrossRef][Medline]
Subramanian, G., Koonin, E.V., and Aravind, L. 2000. Comparative genome analysis of the pathogenic spirochetes Borrelia burgdorferi and Treponema pallidum. Infect. Immun. 68: 1633-1648.
Tang, L.Y., Reddy, M.N., Rasheva, V., Lee, T.L., Lin, M.J., Hung, M.S., and Shen, C.K. 2003. The eukaryotic DNMT2 genes encode a new class of cytosine-5 DNA methyltransferases. J. Biol. Chem. 278: 33613-33616.
Templeton, T.J., Lancto, C.A., Vigdorovich, V., Liu, C., London, N.R., Hadsell, K.Z., and Abrahamsen, M.S. 2004. The Cryptosporidium oocyst wall protein is a member of a multigene family and has a homolog in Toxoplasma. Infect. Immun. 72: 980-987. Varki, A., Cummings, R., Esko, J., Freeze, H., Hart, G., and Marth, J. 1999. Essentials of glycobiology. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. Zhu, G., Keithly, J.S., and Philippe, H. 2000. What is the phylogenetic position of Cryptosporidium? Int. J. Syst. Evol. Microbiol. 50: 1673-1681.[Abstract]
ftp://ftp.ncbi.nih.gov/blast/documents/blastclust.txt; BLASTCLUST. http://134.84.110.219/cgi-bin/gbrowse/crypto909; C. parvum genome sequence information and annotation. http://bioweb.pasteur.fr/seqanal/interfaces/toppred.html; TOPRED1.0. http://www.cbs.dtu.dk/services/SignalP-2.0/; SIGNALP. http://www.cryptodb.org; C. parvum genome sequence information and annotation. http://www.ncbi.nlm.nih.gov/CBBresearch/Walker/SEALS/index.html; SEALS. http://www.sanger.ac.uk/Software/Pfam/index.shtml; PFAM database. http://www.tree-puzzle.de; TreePuzzle 4.02. http://biowulf.nih.gov; Biowulf processing system at the National Institutes of Health.
Received March 23, 2004; accepted in revised format June 14, 2004. This article has been cited by other articles:
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||