An Oligonucleotide Fingerprint Normalized and Expressed Sequence Tag Characterized Zebrafish cDNA Library

  1. Matthew D. Clark1,5,
  2. Steffen Hennig1,
  3. Ralf Herwig1,
  4. Sandy W. Clifton3,
  5. Marco A. Marra3,
  6. Hans Lehrach1,
  7. Stephen L. Johnson2, and
  8. the WU-GSC EST Group3,4
  1. 1Max-Planck-Institut für Molekulare Genetik, 14195 Berlin, Germany; 2Department of Genetics, Washington University, St. Louis, Missouri 63110, USA; 3Washington University Genome Sequencing Center EST Lab, St. Louis, Missouri 63110, USA

Abstract

The zebrafish is a powerful system for understanding the vertebrate genome, allowing the combination of genetic, molecular, and embryological analysis. Expressed sequence tags (ESTs) provide a rapid means of identifying an organism's genes for further analysis, but any EST project is limited by the availability of suitable libraries. Such cDNA libraries must be of high quality and provide a high rate of gene discovery. However, commonly used normalization and subtraction procedures tend to select for shorter, truncated, and internally primed inserts, seriously affecting library quality. An alternative procedure is to use oligonucleotide fingerprinting (OFP) to precluster clones before EST sequencing, thereby reducing the re-sequencing of common transcripts. Here, we describe the use of OFP to normalize and subtract 75,000 clones from two cDNA libraries, to a minimal set of 25,102 clones. We generated 25,788 ESTs (11,380 3′ and 14,408 5′) from over 16,000 of these clones. Clustering of 10,654 high-quality 3′ ESTs from this set identified 7232 clusters (likely genes), corresponding to a 68% gene diversity rate, comparable to what has been reported for the best normalized human cDNA libraries, and indicating that the complete set of 25,102 clones contains as many as 17,000 genes. Yet, the library quality remains high. The complete set of 25,102 clones is available for researchers as glycerol stocks, filters sets, and as individual EST clones. These resources have been used for radiation hybrid, genetic, and physical mapping of the zebrafish genome, as well as positional cloning and candidate gene identification, molecular marker, and microarray development.

[The sequence data described in this paper have been submitted to the dbEST/GenBank data library under accession nos. AA497144AA497369, AA542435AA542678,AA545709AA545724, AI384176AI384205, AI384761AI384796,AI396646AI396663, AI396733AI396777, AI396895AI396938,AI397015AI397130, AI397219AI397252, AI397388AI397484,AI415743AI416403, AI436854AI437493, AI444118AI444540,AI461280AI461395, AI476823AI478024, AI496677AI497576,AI522337AI522810, AI544445AI546083, AI558267AI558995,AI584192AI585023, AI585025AI585238, AI588088AI588836,AI601277AI601868, AI626134AI626875, AI629052AI629398,AI641018AI641780, AI657549AI658347, AI666929AI667197,AI667264AI667414, AI667488AI667567, AI721460AI721747,AI721839AI721978, and AI722283AI722483.]

Footnotes

  • 4 The WU-GSC Group is Deana Pape, Tammie Kucaba, Jennifer Bennett, Yvette E. Ritter, Irina Ronko, Brenda Theising, Rusudan Tsagearishivili, Todd Wylie, Rick Waterman, and Dan Fischer.

  • 5 Corresponding author.

  • E-MAIL clark{at}molgen.mpg.de; FAX ;49 30 84131380.

  • Article published on-line before print: Genome Res.,10.1101/gr.186901.

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.186901.

    • Received March 1, 2001.
    • Accepted June 12, 2001.
| Table of Contents

Preprint Server