Generation, Annotation, Evolutionary Analysis, and Database Integration of 20,000 Unique Sea Urchin EST Clusters
- Albert J. Poustka1,5,
- Detlef Groth1,
- Steffen Hennig2,
- Sabine Thamm1,
- Andrew Cameron4,
- Alfred Beck3,
- Richard Reinhardt3,
- Ralf Herwig2,
- Georgia Panopoulou1, and
- Hans Lehrach1,2,3
- 1 Evolution and Development Group, Max Planck Institute for Molecular Genetics, Department of Vertebrate Genomics, 14195 Berlin, Germany
- 2 Bioinformatics Group, Max Planck Institute for Molecular Genetics, Department of Vertebrate Genomics, 14195 Berlin, Germany
- 3 Sequencing Group, Max Planck Institute for Molecular Genetics, Department of Vertebrate Genomics, 14195 Berlin, Germany
- 4 California Institute of Technology, Division of Biology 156-29, Pasadena, California 91125, USA
Abstract
Together with the hemichordates, sea urchins represent basal groups of nonchordate invertebrate deuterostomes that occupy a key position in bilaterian evolution. Because sea urchin embryos are also amenable to functional studies, the sea urchin system has emerged as one of the leading models for the analysis of the function of genomic regulatory networks that control development. We have analyzed a total of 107,283 cDNA clones of libraries that span the development of the sea urchin Strongylocentrotus purpuratus. Normalization by oligonucleotide fingerprinting, EST sequencing and sequence clustering resulted in an EST catalog comprised of 20,000 unique genes or gene fragments. Around 7000 of the unique EST consensus sequences were associated with molecular and developmental functions. Phylogenetic comparison of the identified genes to the genome of the urochordate Ciona intestinalis indicate that at least one quarter of the genes thought to be chordate specific were already present at the base of deuterostome evolution. Comparison of the number of gene copies in sea urchins to those in chordates and vertebrates indicates that the sea urchin genome has not undergone extensive gene or complete genome duplications. The established unique gene set represents an essential tool for the annotation and assembly of the forthcoming sea urchin genome sequence. All cDNA clones and filters of all analyzed libraries are available from the resource center of the German genome project at http://www.rzpd.de.
Footnotes
-
[Supplemental material is available online at www.genome.org. All data described are deposited in a searchable online database available at http://www.molgen.mpg.de/ag_seaurchin/. All sequences described in this study have been submitted to the GenBank EST section under accession nos. CD289359-CD297607, CD303636-CD324638, and CD330178-CD342180.]
-
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1674103.
-
↵5 Corresponding author. E-MAIL poustka{at}molgen.mpg.de; FAX 49-30-84131128.
-
- Accepted September 11, 2003.
- Received July 19, 2003.
- Cold Spring Harbor Laboratory Press











