Novel Protein Domains and Repeats in Drosophila melanogaster: Insights into Structure, Function, and Evolution

  1. Chris P. Ponting1,4,
  2. Richard Mott2,
  3. Peer Bork3, and
  4. Richard R. Copley3
  1. 1MRC Functional Genetics Unit, Department of Human Anatomy and Genetics, University of Oxford, Oxford OX1 3QX, UK; 2Wellcome Trust Centre for Human Genetics, Oxford OX3 7BN, UK; 3European Molecular Biology Laboratory, 69012 Heidelberg, Germany

Abstract

Sequence database searching methods such asBLAST, are invaluable for predicting molecular function on the basis of sequence similarities among single regions of proteins. Searches of whole databases however, are not optimized to detect multiple homologous regions within a single polypeptide. Here we have used the prospero algorithm to perform self-comparisons of all predicted Drosophila melanogaster gene products. Predicted repeats, and their homologs from all species, were analyzed further to detect hitherto unappreciated evolutionary relationships. Results included the identification of novel tandem repeats in the human X-linked retinitis pigmentosa type-2 gene product, repeated segments in cystinosin, associated with a defect in cystine transport, and ‘nested’ homologous domains in dysferlin, whose gene is mutated in limb girdle muscular dystrophy. Novel signaling domain families were found that may regulate the microtubule-based cytoskeleton and ubiquitin-mediated proteolysis, respectively. Two families of glycosyl hydrolases were shown to contain internal repetitions that hint at their evolution via a piecemeal, modular approach. In addition, three examples of fruit fly genes were detected with tandem exons that appear to have arisen via internal duplication. These findings demonstrate how completely sequenced genomes can be exploited to further understand the relationships between molecular structure, function, and evolution.

Footnotes

  • 4 Corresponding author.

  • E-MAIL Chris.Ponting{at}anat.ox.ac.uk; FAX 44-1865-272175.

  • Article published on-line before print: Genome Res.,10.1101/gr.198701.

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.198701.

    • Received May 31, 2001.
    • Accepted September 10, 2001.

Articles citing this article

| Table of Contents

Preprint Server