LETTER

Identification, Characterization, and Mapping of Expressed Sequence Tags from an Embryonic Zebrafish Heart cDNA Library

Published December 1, 2000. Vol 10 Issue 12, pp. 1915-1927. https://doi.org/10.1101/gr.154000
Download PDF Cite Article Permissions Share
cover of Genome Research Vol 36 Issue 6
Current Issue:

Abstract

The generation of expressed sequence tags (ESTs) has proven to be a rapid and economical approach by which to identify and characterize expressed genes. We generated 5102 ESTs from a 3-d-old embryonic zebrafish heart cDNA library. Of these, 57.6% matched to known genes, 14.2% matched only to other ESTs, and 27.8% showed no match to any ESTs or known genes. Clustering of all ESTs identified 359 unique clusters comprising 1771 ESTs, whereas the remaining 3331 ESTs did not cluster. This estimates the number of unique genes identified in the data set to be approximately 3690. A total of 1242 unique known genes were used to analyze the gene expression patterns in the zebrafish embryonic heart. These were categorized into seven categories on the basis of gene function. The largest class of genes represented those involved in gene/protein expression (25.9% of known transcripts). This class was followed by genes involved in metabolism (18.7%), cell structure/motility (16.4%), cell signaling and communication (9.6%), cell/organism defense (7.1%), and cell division (4.4%). Unclassified genes constituted the remaining 17.91%. Radiation hybrid mapping was performed for 102 ESTs and comparison of map positions between zebrafish and human identified new synteny groups. Continued comparative analysis will be useful in defining the boundaries of conserved chromosome segments between zebrafish and humans, which will facilitate the transfer of genetic information between the two organisms and improve our understanding of vertebrate evolution.

[The sequence data described in this paper have been submitted to the GenBank data library under accession nos.BE693120BE693210 and BE704450.]


The Human Genome Project (HGP) has amassed a vast quantity of sequencing data; over 90% of the human genes have been deposited into GenBank (June 2000). However, functional interpretation of this sequence data has proven more challenging. Much of this work has involved the study of model organisms because functional inferences based on interspecies comparison of sequences have identified implied function of many orthologous human sequences (Makalowski and Boguski 1998).

Recently the zebrafish, Danio rerio, has been recognized as a useful model for the study of development biology and genetics (for review, see Driever and Fishman 1996). One significant advantage of using the zebrafish as a model organism for developmental study is the external development and transparency of the zebrafish embryo. This permits the study of subtle developmental phenotypes in vivo. The zebrafish is also well suited for studies in cardiovascular development because a beating heart is formed and functions within 1 d of fertilization. In addition, the zebrafish embryo does not require blood flow for survival during the first 2 d of development. Thus, zebrafish mutants lacking a circulatory system can still develop normally in the first 2 d (Warren and Fishman 1998) and this allows for studies of mutations that affect the development of the zebrafish heart. Despite all these advantages, the zebrafish suffers from the major drawback of being a new model organism. For example, the number of genes that have been characterized from this species is small compared with other model organisms such as mouse, Drosophila, and Caenorhabditis elegans.

Expressed sequence tags (ESTs) have proven to be a powerful and rapid approach to identify new genes that are preferentially expressed in certain tissue or cell types (Hwang et al. 1997; Liew et al. 1994;Adams et al. 1995). ESTs have also been used for physical mapping, as has been demonstrated in the development of the human and mouse gene maps (Hayes et al. 1996; McCarthy et al. 1997; Deloukas et al. 1998). Currently, the number of zebrafish ESTs in the public databases is still small compared with mammalian sequences, and there are relatively few tissue-specific cDNA libraries.

Mutational screens in the zebrafish have identified several thousand mutations that affect normal development of the embryo (Development, Dec. 1996), including many with essential functions during embryonic heart development (Chen et al. 1996;Stainier et al. 1996). However, the usefulness of these mutations remains limited until the genes responsible for the observed phenotypes are cloned. This is limited in part by a paucity of ordered genes on the zebrafish gene map. Linkage maps based on rapid amplified polymorphic DNAs and microsatellite markers have been produced (Postlethwait et al. 1994, 1999; Johnson et al. 1996; Knapik et al. 1996, 1998; Shimoda et al. 1999). Because linkage mapping requires polymorphic markers for map construction, radiation hybrid (RH) mapping provides a complementary approach to rapidly assign genes and ESTs on the zebrafish map because RH mapping is able to map virtually any marker. Two recent RH maps (Geisler et al. 1999; Hukriede et al. 1999) of more than 3000 markers, genes, and ESTs have dramatically increased the density of the zebrafish gene map and should facilitate the cloning of many identified mutants.

Given the potential and importance of the zebrafish as a model organism for the studies of cardiac development, there is a need for development of EST resources from zebrafish heart cDNA libraries. Here we report the characterization of 5102 ESTs from a 3-d-old zebrafish embryonic heart cDNA library. We also report new map positions for 98 zebrafish ESTs identified in this cDNA library by RH mapping (Table 1) and identification of new synteny groups between zebrafish and human. This EST database represents a new genomic tool for studying aspects of cardiovascular development and disease in the zebrafish and a resource of genes for novel candidate gene discovery.

Table 1.

List of Mapped Zebrafish ESTs

EST identity Clone names Accession no. Primers Product size (bp) LG
SDF5Zeh0225 BE693123 F-ACGTGTAGTTAATGCAGCCGR-GCTGCACTGTTACAGCAATG1821
Actin, alpha cardiacZeh0293 BE693127 F-GAACGTATGGCACTGGAATCR-GACAGCAGAATTACAAGCG1381
Eph-related receptor Tyrosine kinase ligand 5 (HTK-L)Zeh0344 BE693134 F-CTTGTCAGCCATCTGGAATGR-ATGAATCTGGACATTGCTCG2001
Myosin light chain 1, alkali; skeletal fast muscle (MLC1SA)Zeh0637 BE693148 F-ACAGCTAAGAGGTGCTGTCGR-AAGTCACATCGTCCTCATGC1701
Zinc finger DNA binding protein (fZic)Zeh0655 BE693149 F-TTCTGTACACATTTTCGTCGR-GCCCAAGTCAAATGTTGTAC2041
E2F-related transcription factor (DP-1)Zeh1183 BE693163 F-GCTTGCTCAGAGCTGTGAAGR-GTGCTACTGATTCACGCCAG1221
Zinc finger factor (cysteine-rich protein)Zeh1201 BE693166 F-TAAACTGCCAGCACATTTCCR-TTATGGGATGTTGATCTCGG1631
Prolifin IIZeh0341 BE693133 F-CGCAAATGGAGCTGAATATCR-TGTTGCTACTGTGAGATGGG1842
Collagen pro alpha-I (III)Zeh0853 BE693156 F-ACACAGACATTGCATTCCACR-TTGATTACGCCGTAGCTATG1702
Natural killer cell enhancer factorZeh10637 BE693184 F-ATGCTGCAGAGTCTAGTGCCR-GTTCACCGATAAGCATGGAG1392
GTP-binding proteinZehn0822 BE693205 F-CTGCACTGACGTTACACTGCR-TCACGTATTGCATGCATCTG1062
RAG cohort 1 (RCH1)Zeh0389 BE693142 F-GTACAACTTGGAGCACGAGGR-GAATGTGTCGCACTTGAAGC1853
Homeobox transcription factor (hoxb2a)Zehn0229 BE693187 F-TATTCAATAGGGACAACGCCR-TGCCCATGTCGAAGTATCAG1943
GTP-binding protein (GST1-HS)Zehn1143 BE693210 F-GTGAACACGCTCATGCACTTR-ATAATGGCAGGCGGATACAG2013
Rabin 3Zehn0379 BE693189 F-GTGTTTCATCCGACAAGAACGR-GAACAGAGCCGTCACAGATG1824
Carnitine acetyltransferaseZeh0248 BE693124 F-ACAAAAGCATTCCAGGTGACR-ACTGCCACAATCACCAGTTC1125
Myosin heavy chain, betaZeh0269 BE693126 F-TGTGACTCTGCAATGTCAGCR-TCTGGTTGACAAGCTTCAGC1505
Myosin alkali light chain, atrialZeh0374 BE693138 F-ACGTGTAGTTAATGCAGCCGR-GAACAGTGATGGGTGCTGAG1235
Phosphoribosylpyrophosphate synthetase isoform (PRSP1)Zeh0682 BE693153 F-GCTCCAGTGTAAGCTGTTGGR-ACACAGGTCTGTGAAGTGCC2375
Ran/TC4 binding protein (RANBP1)Zeh1094 BE693158 F-TGATGACTGACGACTGGTCCR-CTTCACAAGACCTTGGTGCC1365
LIM domain transcription factorZeh10169 BE693181 F-TGGCAGACACTGAATAGCAGR-GTTGGTCTCATGAGGAAACG1155
Apolipoprotein A-I protein precursorZehn0309 BE693188 F-ACATCTGTGCGAATGTGGTCR-TTGAGGACTTGAGGACCATG1655
DEAD-box protein 72 (P72)Zeh0176 BE693119 F-CACTTAATCGGTCCGTGATCR-GTTGTGTCAATCTGCCAACA1966
High density lipoprotein binding protein (HDLBP)Zeh0409 BE693144 F-ACCAACCTGCATAGCAACTGR-GGCAGCAGAAGTCCTAAGGT2456
Gap modifying protein 1 (GMP1)Zeh0670 BE693152 F-TCGATGGCAGAGGTATGTTCR-TTCCGATTTGAGTGAACTGC1426
S-adenine homocystein hydrolaseZeh1173 BE693162 F-TTCATCAAATGCTTTCCTCGR-GTAAATGGCGCATTGAATTG1186
S-adenosylhomocystein hydrolase (AHCY)Zeh1364 BE693175 F-CTACCCAGACTCACAGCCTGR-TCGGCTCTTCCATGTCTTAC1536
PB1 (Polybromo)Zewp0130 BE693203 F-GCGTGTCTTTCATCATCAGGR-AAGACTGCCGGCTTAGTAGG1266
Myosin regulatory light chain, smooth muscle (MLCB)Zeh0157 BE693117 F-CAAATGAGATCGAATGCATGR-CTCGTGGATCATGTGTTCAC1557
Kruppel related zinc finger protein (HTF10)Zeh0353 BE693137 F-AGTCAACATGAAACACCAGGR-TTTGAACATATGCATGTTGG1597
Ferritin heavy subunit (FTH1)Zeh1145 BE693159 F-ATATCCAGCCACACGTGATGR-ACATGTTCGACAAGCTCACG1587
Troponin-T fast muscleZeh1249 BE693169 F-AACAGATAAAGCTGGCGAGCR-TCACTCGTGGTCAAGACATG2497
Homeobox transcription factor iriquois 3 (Xiro3)Zehn0543 BE693191 F-ATATCGTATCGACGCATTGGR-AATGTTCATGCATGGCTGTC1297
Myosin binding protein C, cardiac (MYBP-C)Zehn0716 BE693192 F-CGAACTTCCAGTTTGCATTCR-ACGAAGCCAAGTACAGGATG1467
Tumor necrosis factor receptor type I associated protein (TRADD)Zehn0873 BE693193 F-TTAATCTCGTGGCTGGATCCR-ACAGGCCTATCAACTGCTGG1387
NovelZehn1157 BE693198 F-CACATCTGGCAGACATCAGAR-TGGTTCATGCACTGACTGAC1507
Arrestin TRCarr (ARRB2)Zeh0294 BE693128 F-GGACGACTGAAGGATTCATGR-ATCATCATCGCTGACTGTGG1568
Y box protein 1Zeh0308 BE693131 F-TCTGCATAGAGTCTGCAGGCR-CAACATCCAACATCTGAGCA1098
Mitogen-activated protein kinase 14 (CSBP1)Zeh1243 BE693168 F-GATGCTAAAGCGGACAGATGR-CCTGAGGTTGCTACTGTGAA1748
Atrial natriuretic factor (ANF)Zeh1304 BE693172 F-CGGGATATGCTGTATGTATTTCAACR-TCGAATGTATATTGACACTGCGTAG1658
Death-Associated Protein 1 (DAP)Zeh0189 BE693120 F-TCATGGCCATCACTTACTCGR-CAAATGCCAAGCACATTCAG1799
PINCH protein (PINCH)Zeh0381 BE693141 F-GTTTCCTTGTCCTCACAGGCR-ACACTGCTATGAGCGAATGC1099
Transcription repressor (GCF2)Zeh1367 BE693176 FACACGTCTCCAGCAACATTCR-GACATGACATCCCCATCTTC1799
Parvalbumin beta (PVALB)Zehn1044 BE693194 F-ACGATACAGTGCCACGACTGR-ATCTGATGCCATCGCTGTC1449
Ws-3Zeh0038 BE693112 F-ATCCCTCATAGAGCCAATGGR-GCAAGGTTTCGAGGTAGAGG11310
Rac protein kinase betaZeh0582 BE693147 F-GCCCATGTCTGACTGTGATCR-TTCGAGAGTGACGCCTTATC18110
Nonhistone Chromosomal Protein (HMG17)Zeh0767 BE693154 F-ATCCCTCATAGAGCCAATGGR-ATCGTAAATGTTGACAGGCG17010
Actin-related proteinZehn1110 BE693209 F-AGGCGGATCTTAGTCAGGACR-TTCTGAGCTCTTCTGGCACT9910
Collagen type I, alpha-I (COL1A1)Zeh0348 BE693135 F-AGAGATGTGCATTGCATTCGR-TTGCCAGTTCGTCTAACGTC11512
Autoantigen annexin XI (ANX11)Zeh0376 BE693139 F-GATGAACAGGCTGAACCTCCR-TTCACTGAGGTTTGACCCTG13512
Creatine Kinase MZeh0657 BE693151 F-GAAACGAGCCAACAGTAGCCR-TTGAAATGATTCTGCACGTG16512
NovelZeh0008 BE693110 F-TCAATTATTGCATGCAGCACR-TATCCTCATGAAGCCTGGAC14513
Hypothetical protein (K04G7.12)Zeh0031 BE693111 F-GGTTCTGCTTGATCTCTGCCR-ACAATGACGACGCTGACATC10513
Calcium-Binding protein (EF-Hand)Aeh1186 BE693164 F-TTGAAATGCACAACAGACCCR-TCATTGACCTGTGCATGTTC15213
BMP5Zeh10669 BE693185 F-GCATATCCACCCACTGACATR-ATCAATTCATCAGCGACCAC25813
Vinculin (VCL)Zehn2160 BE693199 F-AACTTTCACAACCAGGCACTR-ACCTTTAGCTGAGATCCGTG16013
CArG box binding factorZeh1271 BE693171 F-ACACGATGGGAGGAAGTCTCR-TGAAATCTGTTAGCGGCAAG10314
Receptor for activated protein kinase C (RACK1)Zehp0047 BE693201 F-GCCACACTCTGATCAGGTTGR-CATTGTTGATGAGCTGAGGC13714
Nonhistone Chromosomal Protein (HMG-14A)Zeh0993 BE693157 F-ACTGCTGGCATGTTCACAAGR-AAGCTAATGGCAGAGCTGTG10215
TBX2 Protein (T-Box protein 2)Zeh1581 BE693179 F-CACTCTAATCATCCATGCGCR-AGTAAGCGGCCTAGAGAGCC16415
neurofibromatosis protein type 1 (NF1)Zehn0874 BE693206 F-TCAGACGAACACGCATCTTCR-GAAGGCACAGTCTTGACTGC15515
Notch homologue 2Zehs0146 BE693202 F-TGCATGTCGGATAGTTACCGR-GCCATGTGATTGGCTAATTG21115
pregnancy-specific beta 1-glycoprotein 4 precursor (PSBG4)Zeh0068F-CAGTGAGGCACAAAGGTAGCR-TGAACTTTAGAGAGGCTGGC12316
NovelZeh0082 BE693114 F-TGCCATTGCTGTATCTCACAR-CGTCTGAATCTGTTGCATTG18116
NovelZeh0312 BE693132 F-TCAGCTGATGAAGTTCCAGAR-ACATGTGTGCTTGTAGCAGG12216
Peanut (pnut)Zeh0351 BE693136 F-AGATCTGCCTGTGTCCGAACR-ATGTTCATCCAGCAGACTGG11116
NovelZeh0402 BE693143 F-GAGTTGCAGAGCTGGAGAACGTATTGTTGCCTAGTGGCCA21716
Rab 13Zeh0455 BE693145 F-CTCACACCACTCATCTGACCR-TACATTCCAGTCTGTCAGCC12916
PlectinZeh0535 BE693146 F-ATCAAGCTTGCCAGATGAAGR-GCACAAGCAAGACATGAGC17216
Apolipoprotein E precursor (APOE)Zeh1311 BE693174 F-TTCATTTCAGCAGCTGAAGGR-AATGCCATGTACTCACCACG19916
Protein-tyrosine-phosphotase nonreceptor type 2Zeh1546 BE693177 F-ACTCGCTGAGCTTTAACCTGR-ACCGTCGTGGTAAGTTGTTG18716
S-100 ProteinZehn1116 BE693208 F-TGCATTGTAACTGCAGTTGCR-CCTGCGAACAACTTTACCAG16916
NovelZeh0377 BE693140 F-TGCATGTCTGTGAGTGTTGAR-CGCAGTGAGTGTTTATGCTC22317
IL-13 receptor alpha chainZewp0171 BE693204 F-GCTCGGATAGAAAGCAGACAR-AGTACGTGATTGCGGTTCTG11117
Serine/Theonine protein kinaseZeh1150 BE693161 F-GCTTGTGAAGCGAGTCTCAGR-CTTGTGCACCAGGTCACTGT18418
Death-Associated Protein 5Zeh1307 BE693173 F-GGCAAATGCAAGTCAGGTACR-ATCTGGTCCCATTGATCTGC20318
Frizzled proteinZeh10603 BE693183 F-CTGATCGATGCCAACTCTTGR-GCAATTGCTCTAGCATGGAG15118
Tropomyosin, alpha non-muscleZeh0298 BE693129 F-CAGTGCCACTGCTTTGAACTR-GAGCAGAATGAGCCCAAGTC13819
hCDC10 (CDC10 homolog)Zeh0656 BE693150 F-GTGGTATTGGAGAAGGCCAGR-CCAGTTCACTGCTTGCTGAA31919
TitinZeh1256 BE693170 F-AAGAGCTGGCACAGTTTCTGR-GGCTTGCACACTGAGTTCAT14619
TGF-beta receptor interacting protein 1Zehn0464 BE693190 F-CTCCGTGCAGCTGAGTTAGGR-GTTACAGCAGCGTTGGAGAG14319
Zinc finger protein 45 (BRC1744)Zehn1068 BE693195 F-CTCTGTAAGCTGACCGATCCR-GGCAGCAGTCTCAGTAATGC24519
Rab5c-like proteinZehn1144 BE693197 F-AGTGCAAGGCATGGAGTAAGR-CTAAGTGAATATGCGGCTGC15119
Regulator of G-protein signaling 7(RGS7)Zeh0300 BE693130 F-GCAGTGATCACAATACCCTGR-TCCTTCAGAACGCAGATAGA17520
Apolipoprotein B (APOB)Zehl207 BE693167 F-GGATGACAATAGGTTGCAGGR-GAAGCCAATGGACACTTCAC16720
Connective tissue growth factor XCTGFZehl559 BE693178 F-TGACAGGGATACTGGCTCTTR-ACAGGACCTAGTCGAGTTAG11220
Deep Orange proteinZehl0587 BE693182 F-ATGCACATCCGGTTACATGTR-CGCAGAAGTTCGATCAAGAG12020
NovelZeh0115 BE693115 FATAGGCTATTGGCGTTGACAR-GACGCGTGAATGAAGTGAGT16722
Zinc finger protein 37 (DNA binding protein) (ZFP37)Zeh0174 BE693118 F-CTACATGCTGAATCTGGCCAR-CACGAGAGGACTCACACTGG16422
Similar to yeast SSU72Zeh1122 BE693160 F-GGCTGCGTCAGGTACAATTAR-TACTGACCGCAGCAGAGTGT26322
NovelZeh0124 BE693116 F-GCCACTCTCAGTGCTGTAGCR-GAGGATCATGGTCACCTGTG14023
TwistZeh0190 BE693121 F-GTTACCCGTCACTGAAGCAGR-CTGACCTGATGGATCAAGGC12323
ARD-1 N-acetyltransferase homolog (TE2)Zeh0223 BE693122 F-TAACTCCATGGGTGAGAACCR-ACGGACGTCAAAGACTCATC14823
Neural cell adhesion moleculeZeh0266 BE693125 F-AGAACGGATTCCTGGACTCAR-CACAAGTGTAACCGCTCTGT13223
Carboxyl terminal LIM domain protein (CLIM1)Zeh1190 BE693165 F-TACAGGGCTGTGAACTCCACR-AATACAGTTTCGCACATGCC24123
TGB-b superfamily receptor 1Zehn1109 BE693196 F-ACTTGGTGCGAGCTGTAATGR-TTGTGGACTTCCTAACTGCG17123
P1-Cdc21Zeh1616 BE693180 F-CCTGCAGGATAATACGCAGTR-TATGCAAAGCATGTGCTCTC15924
Eph-like receptor tyrosine kinase hEphB1b (EphB1)Zehn0206 BE693186 F-CATGAGCCTCAGGAGTGAAGR-AACACGGCAAGACTGTGATG1983,12
RanBP7Zeh0048 BE693113 F-GTTGCGATATCCTGAAGCTGR-CACGACCTTAGTGGACGATG156
Prostaglandin D SynthaseZeh0800 BE693155 F-ACACATCGGTCCAGAACATGR-TGAACAGTCATGGTGTGCTC143
Calpain 2Zehn1036 BE693207 F-GTCTTCATCCAGGTCTGCTGR-TCGAACTGGATATCCTGCAG12722
p47Zehn2383 BE693200 F-TCTCCAACTCCAGAGTGCAGR-AGCCTGACACTGAAGGAAGC101

[i] Listed are the putative identities of mapped ESTs as determined by matches to known sequences in GenBank, the accession no. of the ESTs, the names of the ESTs, primer sequences, PCR product sizes, and linkage group assignment.

A total of 5102 EST sequences were processed with the TIGR Assembler to estimate the number of unique transcripts represented in the EST set. A total of 359 clusters composed of 1771 ESTs were generated, whereas the remaining 3331 ESTs did not cluster. The number of unique transcripts identified from the zebrafish embryonic heart EST set was therefore estimated at up to 3690.

RESULTS

Overview of ESTs from the Zebrafish Embryonic Heart cDNA Library

A unidirectional cDNA library was constructed from 3-d-old zebrafish embryonic hearts. A total of 5102 random clones were partially sequenced from this cDNA library to generate ESTs. In total, 2937 (57.6%) showed significant identity to known sequences in the nonredundant nucleotide and peptide databases; of these, 946 were zebrafish entries. Another 722 (14.1%) ESTs matched to other ESTs in dbEST but not to any known sequences. The remaining 1418 (27.8%) showed no match to any known sequences and were designated as novel genes (Table2).

Table 2.

Summary of ESTs from the Zebrafish Embryonic Heart

Unmatched–novel1418 (27.8%)
ESTs matching to known sequences
 Matched to other ESTs722 (14.1%)
 Matched to known genes2242 (43.9%)
 Mitochondrial DNA237 (4.6%)
 Ribosomal proteins & RNA447 (8.8%)
 Repetitive elements11 (0.2%)
 Vector25 (0.5%)
Total5102 (100.0%)

A total of 5102 EST sequences were processed with the TIGR Assembler to estimate the number of unique transcripts represented in the EST set. A total of 359 clusters composed of 1771 ESTs were generated, whereas the remaining 3331 ESTs did not cluster. The number of unique transcripts identified from the zebrafish embryonic heart EST set was therefore estimated at up to 3690.

Known Gene Expression Profile in Zebrafish Embryonic Heart

ESTs matching to known genes were categorized into seven categories on the basis of general functions of the genes (cell division, cell signaling/communication, cell structure/motility, cell organism/defense, gene/protein expression, metabolism, and unclassified) (Adams et al. 1995; Hwang et al. 1997). In total, 1242 unique known genes were represented and the percentage of transcripts in each category was calculated. The largest class of genes represented those involved in gene/protein expression (25.9%). This class was followed by genes involved in metabolism (18.7%), cell structure/motility (16.4%), cell signaling and communication (9.6%), cell/organism defense (7.1%), and cell division (4.4%). Genes lacking enough information to be classified constituted the remaining 17.9% (Table3).

Table 3.

Functional Distribution of Known Genes

Functional Category No. of Unique Genes, %
Cell division55 (4.4%)
Cell signaling/communication119 (9.6%)
Cell structure/motility204 (16.4%)
Cell/organism defense88 (7.1%)
Gene/protein expression322 (25.9%)
Metabolism232 (18.7%)
Unclassified222 (17.9%)
Total1242 (100%)

Consistent with the high proportions of ESTs involved in gene/protein expression, ribosomal proteins were some of the most abundantly expressed (Table 4). Among other abundantly expressed genes, nine copies of the bone morphogenetic protein 4 (BMP4) were identified. Within the category cell structure/motility, the largest groups of ESTs represented contractile proteins, cytoskeletal proteins, and components of extracellular matrix. The high frequency of these transcripts was not unexpected for the heart, on the basis of our previous experience. However, an unusually high number of keratin proteins (75 clones) and cytokeratin proteins (77 clones) were identified, perhaps due to inclusion of some noncardiac tissues during the isolation of the embryonic hearts.

Table 4.

Most Abundant Genes Expressed in the Embryonic Zebrafish Heart

Identity Frequency (%) Identity Frequency (%)
Cell division (n = 2) Ribosomal protein S810
Nonhistone chromosomal protein HMG-176Ribosomal protein L1710
Prothymosin alpha6Ribosomal protein L810
Cell signaling/communication (n = 4)Ribosomal protein L4110
Parvalbumin, beta26Ribosomal protein L199
Calmodulin11Ribosomal protein L69
Bone morphogenetic protein 4 precursor (BMP4)9Ribosomal protein L118
Receptor for activated protein kinase C (RACK1)6Elongation factor 28
Cell structure/motility (n = 22) Ribosomal protein L277
Myosin heavy chain, fast skeletal muscle62Ribosomal protein L37
Actin, alpha skeletal53Ribosomal protein S27
Actin, beta42Ribosomal protein S187
Keratin37Ribosomal protein L137
Cytokeratin S35Ribosomal protein L13A7
Myosin light chain 2, fast skeletal muscle (mlc2f)16Ribosomal protein S37
Tropomyosin, alpha skeletal muscle14Homeobox protein LIM-36
Cytokeratin II14Ubiquitin6
Cytokeratin 811Ribosomal protein L18a6
Myosin light chain 1a, fast skeletal10Ribosomal RNA large subunit6
Cytokeratin type I (cytl)10Ribosomal protein L106
Collagen alpha-2 type I9Ribosomal protein S176
Myosin light chain 3, fast skeletal9Ribosomal protein S196
Actin, alpha cardiac9Acidic ribosomal protein P26
Tubulin, alpha8Ribosomal protein SA (P40)6
Keratin, type II7Ribosomal protein S206
Myosin regulartory light chain 2A, atrial muscle6Ribosomal protein L226
Desmin5Ribosomal protein S96
Fibronectin5Ribosomal protein S106
Keratin, type II (58 kD)5Ribosomal protein L326
Myosin heavy chain, alpha cardiac5Ribosomal protein L1a6
Myosin light chain 20-kD (MLC-2)5Ribosomal protein S116
Cell/organism defense (n = 10) Ribosomal large subunit 26S6
Globin, beta embryonic 1 (bE1)31Ribosomal protein L185
Heat shock cognate (hsc70)31Ribosomal protein S145
Globin 2, alpha-type embryonic15Ribosomal protein L1 (L4)5
zfY1–A cold shock protein11Ribosomal protein L55
Heat shock protein hsp90beta10Ribosomal protein L145
Creatine kinase M2-CK6Ribosomal protein L95
Globin, alpha6Ribosomal protein S125
Globin, alpha-type embryonic6 Metabolism (n = 9)
Globin, beta6ADP/ATP carrier protein19
Glutathione S-transferase5Cytochrome b18
Gene/protein expression (n = 50) NADH ubiquinone oxidoreductase subunit 4L12
Elongation factor 1 alpha43Apolipoprotein A-I precursor protein11
Acidic ribosomal phosphoprotein P016Cytochrome C oxidase subunit III8
Cathepsin L15Apolipoprotein E precursor protein7
Elongation factor l-gamma14NADH dehydrogenase subunit I7
Ribosomal protein S713ATP synthetase beta-subunit5
Ribosomal protein L7A13ATPase, calcium, sarcoplasmic/endoplasmic  reticulum 1 B5
Polyadenylate-binding protein12Isocitrate dehydrogenase5
Ribosomal protein S4 isoform11 Unclassified (n = 3)
Ribosomal protein S611Translationally controlled tumor protein P23 (TCTP)12
Ribosomal protein L3011Ependymin beta and gamma chains (Epd)7
Ribosomal protein S3A10SMT3A protein7
Ribosomal protein L410

[i] Genes are categorized in seven different functional categories and are listed in descending order according to their frequencies.

Comparative Analysis of Gene Expression Profile between Human Fetal Heart and Zebrafish Embryonic Heart

To determine similarities and differences between the two-chambered zebrafish and the four-chambered human heart, we compared proportions of genes in each functional category by using human fetal data fromHwang et al. (1997). Significant differences were detected in five different functional categories. It was found that in the zebrafish embryonic heart, there were significantly fewer transcripts encoding proteins that function in cell division (P < .005), cell signaling/communication (P < .001), and gene/protein expression (P < .001), whereas those involved in cell structure/motility and cell/organism defense were significantly increased (P < .001) relative to human fetal heart (Fig.1; Table 5). Detailed analysis of subcategories found that the decrease in cell division-related transcripts in zebrafish was due to a lower proportion of transcripts representing the general factors of cell division, whereas the decrease in cell/signaling communication was a result of the relative scarcity of identifiable growth factors and hormones in the zebrafish (Table 6). However, the number of transcripts representing effectors/modulators was significantly higher in the zebrafish. This increase could be attributed to a large number transcripts for parvalbumin, a calcium sequesterer detected in fish cardiac muscle (Laforet et al. 1991). Analysis of the cell structure/motility category revealed that extracellular matrix was the only subcategory that showed a significant decrease. However, the number of transcripts representing cytoskeletal proteins was much higher in the zebrafish. This increase was due to the large number of keratin and cytokeratin transcripts present. In the gene/protein expression category, the transcription factors, postranslational modification, ribosomal proteins, and translation factors subcategories all decreased significantly in the zebrafish.

Figure 1.

Comparison of relative levels of gene expression between embryonic zebrafish and human fetal hearts. Represented are levels of gene expression in the embryonic zebrafish and fetal human hearts in seven different functional categories. The χ2 test was used to determine statistical significance (*P = .005;+ P = .001).

13f1_F1TT_rev1
Table 5.

Relative Levels of Gene Expression in the Embryonic Zebrafish and Fetal Human Hearts

No. of ESTs Proportion of ESTs
Z H Z H EXP OBS/EXP χ2
Cell division
 General171540.65%1.42%36.970.4610.95
 DNA synthesis/replication8240.31%0.22%5.761.390.87
 Apoptosis6110.23%0.10%2.642.274.28
 Cell cycle20920.77%0.85%22.090.910.20
 Chromosome structure231490.88%1.37%35.770.644.63
Category subtotal744302.84%3.96%103.240.728.63[i]
Cell signalling/communication
 Cell adhesion11930.42%0.86%22.330.495.80
 Channel/transport proteins10780.38%0.72%18.730.534.10
 Effectors/modulators601562.30%1.44%37.451.6013.77
 Hormones/growth factors272971.04%2.74%71.310.3828.32
 Intracellular transducers272421.04%2.23%58.100.4617.04
 Metabolism0280.00%0.26%6.720.006.74
 Protein modification251660.96%1.53%39.860.635.62
 Receptors29971.11%0.89%23.291.251.41
Category subtotal18911577.25%10.66%277.790.6831.83
Cell structure/motility
 General12480.46%0.44%11.521.040.02
 Contractile proteins2298688.79%8.00%208.401.102.22
 Cytoskeletal32453712.43%4.95%128.932.51310.75
 Extracellular matrix694102.65%3.78%98.440.709.16[i]
 Microtubule-associated/motors300.12%0.00%0.00n/an/a
 Vesicular transport4330.15%0.30%7.920.501.95
Category subtotal641189624.60%17.47%455.221.4192.17
Cell/organism defense
 General521002.00%0.92%24.012.1730.63[i]
 DNA repair18640.69%0.59%15.371.170.45
 Carrier protein/membrane transport963033.68%2.79%72.751.327.65
 Stress response621462.38%1.35%35.051.7721.00
 Immunology7540.27%0.50%12.970.542.76
Category subtotal2356679.02%6.15%160.141.4738.74
Gene/protein expression
 RNA synthesis
  RNA polymerases3280.12%0.26%6.720.452.07
  RNA processing613352.34%3.09%80.430.764.85
  Transcription factors794583.03%4.22%109.960.728.33[i]
 Protein synthesis
  Posttranslational modification/targetting563412.15%3.14%81.870.688.45[i]
  Protein turnover541512.07%1.39%36.251.498.81[i]
  Ribosomal proteins449223217.23%20.56%535.890.8417.81
  tRNA synthesis/metabolism6330.23%0.30%7.920.760.47
  Translation factors1036853.95%6.31%164.470.6324.54
Category subtotal811426331.12%39.28%1023.530.7973.41
Metabolism
 General10280.38%0.26%6.721.521.6
 Amino acid22790.84%0.73%18.971.140.49
 Cofactors0120.00%0.11%2.880.002.88
 Energy/TCA cycle1445565.53%5.12%133.491.100.87
 Lipid511771.96%1.63%42.501.231.73
 Nucleotide32781.23%0.72%18.731.709.48[i]
 Protein modification9640.35%0.59%15.370.602.65
 Sugar/glycolysis503631.92%3.34%87.150.5616.40[i]
 Transport751462.88%1.35%35.052.1646.15
Category subtotal393150315.08%13.85%360.861.103.3
Unclassified26393610.09%8.62%224.731.157.14
Total260610854

[i] P = .005; †P = .001

[ii] (Z) Embryonic zebrafish; (H) Fetal human; (EXP) expected no. of transcripts; (OBS) observed no. of transcripts; (χ2) chi square result.

Table 6.

Zebrafish-Human Syntenies

LG EST name Gene Reference Human location
1Zeh0637MLC1SAa2q33–34
OTX3b2p13
DLX5b2q32
3Zehn0229HOXB2Aa17q21–q22
Zeh0389RCH1a17q23.1–q23.3
PARA2Bb17q12
CDC27b17q12–q23.2
HOXBb17q21–q22
7Zehn0716MYBPC2a11p11.2
CCND1b11q13
FGF3b11q13
Zehn873TRADDa16q22
VNCb16
CK2A2b16q13
12Zeh0348COL1A1a17q21.31–q22.05
HOXBBb17q21–q22
RARA2Ab17q12
DLX3b17q21.3–q22
13Zehn2160VCLa10q22–q23
RETb10q11.2
PAX2b10q24.3–q25.1
16Zeh0068PSG4a19q13.2
Zeh1311APOEa19q13.2

[i] This paper.

Significantly more ESTs were detected in the cell/organism defense category in the zebrafish, due largely to increases in three subcategories: general homeostasis, carrier proteins, and stress response. Although significant change was not detected in overall levels of transcripts devoted to metabolism, some subcategories exhibited significant changes. Specifically, the nucleotide and transport subcategories showed significant increases, but the sugar/glycolysis subcategory showed decreases. There were also significantly more ADP/ATP carrier proteins and ion-transporting ATPases identified in the zebrafish than in the human heart.

RH Mapping of Embryonic Heart ESTs

Primers were designed for 127 selected ESTs. Of these, 101 (79%) successfully amplified a zebrafish PCR product. Eleven of the primer pairs (9%) failed to amplify a detectable PCR product from zebrafish DNA, and primers for another 8 (6%) ESTs produced Hamster PCR products that could not be clearly distinguished from Zebrafish PCR products. Two primer pairs (2%) were designed for ESTs that are not covered in the hybrid panel (retentions frequency 0%) and primers for 5 (3%) other ESTs produced wrong size PCR products and were discarded. In total, mapping reactions were reproducibly scored for 102 genes represented in the EST set. Of these, 98 (96%) were successfully assigned to single linkage groups (LG), with 23 of 25 groups represented (Table 1). Linkage group 16 contained the most genes (n = 10), followed by LG 7 (n = 8), LG1 (n = 7), and LG6 (n = 6). No genes demonstrated significant linkage to LG21 or LG25 in this analysis (Table 1).

Synteny Analysis

To further analyze the conservation of synteny between zebrafish and humans, we compared positions of the mapped zebrafish ESTs and their human counterparts. Following the method described by Gates et al. (1999), we have identified one new conserved syntenic group between zebrafish and human and added more genes to the previously identified groups. Comparing map positions of zebrafish ESTs and human orthologs identified a new syntenic group belongs to linkage group 16 in zebrafish and chromosome 19 in human and added one to two extra genes to each of five previously identified groups (Table 6).

DISCUSSION

The generation of ESTs has proven to be a useful and rapid means to identify and isolate large numbers of expressed sequences (Adams et al. 1992, 1993; Hwang et al. 1994, 1995; Liew et al. 1994). Although extensive EST-based resources exist for human and other mammalian models such as mouse and rat, the EST database for the zebrafish presently contains approximately 100,000 ESTs and is still being developed (Gong et al. 1997; Gong 1999). In this report, we characterized the transcriptional profile of 3-d-old embryonic zebrafish hearts by generation of 5102 ESTs. Clustering of 5102 ESTs estimated the maximum number of unique genes represented in this set at 3690. Because this analysis was performed on 5′ end sequences that may arise from multiple nonoverlapping segments of the same gene, the true number of unique genes is almost certainly lower.

Of known gene matches, a number of genes thought to be involved in cardiogenesis were identified in the data set. These included nine copies of BMP4, which has been found to be involved in the regulation of left-right asymmetries of the zebrafish heart (Chen et al. 1997;Schilling et al. 1999). Other important factors known to regulate cardiogenesis were also identified, including homeobox transcription factors Nkx2.3/2.5, Mef2A/2C, and atrial natriuretic factor.

Although comparative analyses of DNA sequences have been performed between model organisms and humans (Koop 1995; Makalowski et al. 1996;Makalowski and Boguski 1998), little attention has been paid to studying the patterns of gene expression variations between model organisms and humans on a global scale. Understanding similarities and differences between identical tissues in different species is essential in establishing “synexpression” data sets, defining groups of genes that share a similar functional pathway (Niehrs and Pollet 1999). To investigate similarities and differences in gene expression profiles in the developing heart between zebrafish and humans, we analyzed relative levels of expression of genes with related functions. Despite limitations of comparing these two data sets at different stages of development, these findings provide us with a first look at global differences in overall physiological status between the two-chambered zebrafish and the four-chambered human heart, though for the most part, the analysis was too small to reliably reveal differences in the transcription of specific genes. Nevertheless, the results of this analysis suggest several interesting differences in patterns of expression. For example, the high frequency of transcripts detected in the cell/organism defense category in the zebrafish may indicate differences in homeostatic requirements between zebrafish and human hearts. A proportionally high number of heat shock cognate 70 transcripts (hsc70) was detected in the zebrafish heart, with 31 ESTs representing this gene (0.6% of all ESTs). This represents a significant increase in proportion of hsc70 expression over human fetal heart (0.1% of all ESTs; Hwang et al. 1997). Heat shock cognate 70 functions as a chaperone and is known to protect cells against apoptosis (Hohfeld 1998). Heat shock proteins can also be induced by environmental stress. Unlike human fetuses that develop in a stable environment in utero, fish embryos develop externally and it is plausible that the increased levels of hsc70 in the zebrafish embryonic heart may serve a protective role during embryonic development in the face of a potentially changing environment.

Beyond analysis of expression profiles, one immediate application of this EST resource is as a substrate for RH mapping. Recent reports have dramatically increased the number of mapped zebrafish markers, genes, and ESTs (Geisler et al. 1999; Hukriede et al. 1999). Here, we present mapping results for an additional 102 ESTs identified from our library that should further facilitate the identification of zebrafish mutant genes with essential functions during zebrafish embryonic development (Chen et al. 1996; Stainier et al. 1996).

Comparative analysis of map positions between zebrafish and human has identified that gene orthologs that are syntenic in mammals are also syntenic in zebrafish (Postlethwait et al. 1998). This discovery of extensive sharing of chromosome segments between zebrafish and humans has practical significance to the HGP. For example, synteny between zebrafish and humans will enable researchers to identify human ortholog from a gene's position in the zebrafish genome. Reciprocally, and more importantly, the phenotype of a zebrafish mutation can suggest function for the human gene (Postlethwait and Talbot 1997). However, before any conclusive characterization can be made about this conservation, more detailed analyses of these conservations are needed to further define the boundaries of conserved chromosome segments and the extent to which gene order is maintained between zebrafish and human. This information would be particularly useful in identifying candidate genes for positional cloning analyses. It is anticipated that the continuing development of a dense zebrafish map will markedly increase its utility and facilitate the transfer of genetics information between the zebrafish and human.

This collection of 5102 ESTs provides us with a preliminary view into the gene expression profile of the zebrafish embryonic heart. The identification of many genes known to be involved in cardiogenesis suggests that the generation of ESTs is an excellent method for identifying additional genes with essential roles in heart development. Further integration with mapping data of these zebrafish ESTs will provide a richer resource for identifying candidate genes for the several thousand mutants that affect zebrafish development. Construction and characterization of cDNA libraries from additional stages of development, with comparison of gene expression profiles between libraries, should provide further valuable insights into the molecular mechanisms of heart development and disease.

METHODS

RNA Isolation

Total RNA was isolated from 3-d-old zebrafish embryonic heart samples by the method described by Chomczynski and Sacchi (1987). Tissues were homogenized and extracted twice with acidic guanidinium isothiocyanate-phenol-chloroform. The poly(A)+ RNA fraction was isolated by oligo-dT cellulose chromatography (Pharmacia). Purity and RNA integrity were assessed by absorbance at 260/280 nm and agarose gel electrophoresis.

cDNA Library Construction

Libraries were constructed in the λZAP Express vector (Stratagene) according to the manufacturer's protocols. First-strand cDNA was synthesized with an XhoI-oligo(dT) adapter-primer. After second-strand synthesis and ligation of EcoRI adapters, cDNA was digested with XhoI, generating cDNA flanked byEcoRI sites at 5′ ends and XhoI sites at the 3′ ends. Digested cDNAs were size-fractionated with Sephacryl S-500 spin columns and ligated into the λZAP Express vector predigested with EcoRI and XhoI. The resulting concatomers were packaged by using Gigapack Gold packaging extracts. After titration, aliquots of primary packaging mix were stored in 7% DMSO at –80°C as primary library stocks, and the remainder was amplified to establish stable library stocks.

Partial Sequencing of 5′ Ends of cDNA Inserts

Plaques were picked randomly and eluted into SM buffer. Phage eluates (5 μL) were directly used for PCR reactions (50-μL final volume). Reaction mixtures contain 5 μL of 10X Taq buffer, 125 μL of each dNTP, 10 pmol each of forward primer (5′-GCCAAGCTCGAAATTAACCCTCACTAAAGGG-3′) and reverse primer (5′-CCAGTGAATTGTAATACGACTCACTAT AGGGCG-3′) and 1 U of Taq polymerase. The thermal cycle profile consisted of an initial denaturation at 94°C for 5 min, followed by 30 cycles of 94°C for 45 sec, 57°C for 30 sec, and 72°C for 3 min, and a final extension step of 72°C for 3 min. After agarose gel electrophoresis to determine the purity and concentration, 2 μL of PCR products were used directly for cycle sequencing by using the AmpliCycle Sequencing Kit (Perkin-Elmer) and 5 pmol of Cy5 labeled modified T3 primer (5′-GAAATTAACCCTCACTAAAGG-3′). The conditions for cycle sequencing were as follows: 94°C for 2 min, followed by 35 cycles of linear amplification (94°C, 30 sec; 50°C, 15 sec; 72°C, 1 min for 20 cycles and 94°C, 30 sec; 72°C, 1 min for 15 cycles). The reactions were stopped by addition of 0.5 v/v loading buffer (95% formamide, 20 mmol/L EDTA, 10 mg/mL blue dextran). Sequencing reactions were loaded onto 6% acrylamide gels and electrophoresed with A.L.F. and A.L.F. Express DNA sequencers (Pharmacia) (Hwang et al. 1995,1997).

Bioinformatics

Sequence search analysis of all ESTs against the nonredundant GenBank/EMBL/DDBJ nucleotide, nonredundant GenBank CDS translation/PDB/SwissProt/PIR/PRF peptide, and dbEST databases were performed with the BLAST algorithm (Altschul et al. 1990; Gish and States 1993) on a Unix platform (Sun Microsystems). Assignment of putative identities for ESTs required a minimum P value of 10–10. ESTs with known gene matches were categorized into different functional groups according to categories described in Hwang et al. (1997). Relative levels of gene expression were computed by summing the number of ESTs matching to that particular gene and dividing the sum by the total of ESTs that match to known genes (Hwang et al. 1997). The combined 5102 ESTs were clustered on the basis of sequence similarity by using TIGR Assembler (Fleischmann et al. 1995). Parameters were set so that ESTs were connected together only with a minimum of 95% nucleotide identity in an overlap region of 40 nucleotides. GenBank accession nos. of the Zebrafish Embryonic heart ESTs are AI353073-AI354214; AI616386-AI618739; AI618836-AI618858;AW453485-AW455194. Further clone information can be found on the Internet at URL www.tcgu.med.utoronto.ca.

Preparation of DNA Templates for 3′ End Sequencing

The cDNA clones were excised in vivo from the λZAP Express vector by using ExAssist/XLOLR helper phage system (Stratagene) before sequencing. Phagemid particles were excised by coinfectingEscherichia coli XL1-BLUE MRF′ cells with ExAssist helper phage. Excised pBluescript phagemids were used to infect E. coli XLOLR cells and selected by using kanamycin resistance. Single colonies were grown overnight in LB-kanamycin and DNA purified by using Qiagen plasmid purification kits. Purified DNA was then used for sequencing of 3′ ends.

Radiation Hybrid (RH) Mapping of cDNA Clones

A 94-hybrid zebrafish RH panel was purchased from Research Genetics. 3′-end sequences of each EST were used to design PCR primers with the assistance of the Williamstone Enterprises Primer Design program (http://www.williamstone.com). Primers were generally 20-bp long and were chosen to generate PCR products of 100–300 bp and a Tm range of 58–60°C. Primer pairs that showed high complementarity to each other or similarity to repeat sequences were discarded. ESTs for which no satisfactory primer pair was found were not used. Names, symbols, and primer sequences are summarized in Table1. Each primer pair was pretested for specificity with zebrafish and hamster genomic DNA (Research Genetics). Primer pairs that gave a specific zebrafish product were used to screen the RH panel.

PCR amplification was performed in 10-μL reaction mixtures containing reaction buffer, 2mM each dNTP, 0.05 U Taq polymerase, 4 pM each primer, and 5 ng each hybrid. The thermal cycle profile consisted of an initial denaturation at 94°C for 5 min, followed by 35 cycles of 94°C for 1 min, 58°C for 1 min, and 72°C for 1 min, and a final extension step of 72°C for 10 min. PCR products were separated by gel electrophoresis in 2% agarose with 0.5X TBE, and photographed on a UV transilluminator.

Each primer pair was tested in duplicate and positive products were scored. In case of discrepancies (positive on one plate but negative on the other), the band(s) were rescored. Retention profiles were submitted to the Max Planck Institute (Tubingen, Germany) for analysis by SAMapper 1.0 (Geisler et al. 1999).

Statistical Analysis

Analysis of differences in expression levels between zebrafish and human genes was performed by using 2606 and 10,854 unique genes respectively, with ESTs from the mitochondrial genome excluded from calculations. The expected number of zebrafish ESTs present in each functional category/subcategory was calculated based on the frequency of the observed number of ESTs in the fetal human heart cDNA library. By using the same method for identifying differentially expressed genes from EST-based expression profiles as described in Hwang et al. (2000), the statistical significance of the deviation of observed EST profiles from expected was tested with the χ2 test. For each category, the χ2 value was calculated by summing the χ2 value for that category with the χ2value calculated from the sum of the remaining category/subcategories. Statistical significance of the deviation from expectations was tested by the χ2 value with one d.f. The thresholds of significance were established at *P = .005 and+ P = .001. The statistical significance of deviation between the two sample sizes was confirmed by using another method for assessing significance of gene expression profiles as described in Audic and Claverie (1997) (http://igs-server.cnrs-mrs.fr).

Phylogenetic Analysis

Following the method described by Gates et al. (1999), each EST sequence was searched against the protein database at NCBI by using the BLASTX program (Altschul et al. 1990). Mammalian sequences that showed significant similarity to the zebrafish EST were retrieved. These sequences were then multiply aligned and neighbor-joining trees were constructed by using CLUSTALX (Thompson et al. 1997). A zebrafish EST is orthologous to a human gene if it appears as a sister group on the dendrogram. The locations of human gene loci were taken from Online Mendelian Inheritance in Man (OMIM) (http://www.ncbi.nlm.nih.gov/omim/); the Genome Database (http://www.gdb.org/gdb), and The Human Gene Map (http://www.ncbi.nlm.nih.gov/genemap99/).

We are grateful to Jack Liew for oligonucleotide synthesis, Wei Wei for assistance with automated sequencing, Robert Geisler and Gerd-Jörg Rauch for calculating map positions on the RH map and to everyone at the Cardiac Gene Unit for technical assistance. This work was supported by the Medical Research Council of Canada. C.T. is a recipient of a Heart and Stroke Foundation of Canada Traineeship. D.M.H. is a recipient of a Hunt Estate M.D./Ph.D. Studentship. A.A.D. is a recipient of a Heart and Stroke Foundation of Canada Traineeship. J. Y. is a recipient of a Heart and Stroke Foundation of Ontario Summer Student Scholarship.

The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked “advertisement” in accordance with 18 USC section 1734 solely to indicate this fact.

Notes

[7] Corresponding author.

Notes

[8] E-MAIL [email protected]; FAX (617) 975-0995.

[9] Article and publication are at www.genome.org/cgi/doi/10.1101/gr.154000.

REFERENCES

  1. M.D. AdamsM. DubnickA.R. KerlavageR. MorenoK.T. KelleyT.R. UtterbackJ.W. NagleC. FieldsJ.C. Venter(1992) Sequence identification of 2,375 human brain genes. Nature 355:632–634.
  2. M.D. AdamsM.B. SoaresA.R. KerlavageC. FieldsJ.C. Venter(1993) Rapid cDNA sequencing (expressed sequence tags) from a directionally cloned human infant brain cDNA library. Nat. Genet. 4:373–380.
  3. M.D. AdamsA.R. KerlavageR.D. FleischmannR.A. FuldnerC.J. BultN.H. LeeE.F. KirknessK.G. WeinstockJ.D. GocayneO. White(1995) Initial assessment of human gene diversity and expression patterns based upon 83 million nucleotides of cDNA sequence. Nature (suppl) 377:3–174.
  4. S.F. AltschulW. GishW. MillerE.W. MyersD.J. Lipman(1990) Basic local alignment search tool. J. Mol. Biol. 215:403–410.
  5. S. AudicJ-M. Claverie(1997) The significance of digital gene expression profiles. Genome Res. 7:986–995.
  6. J.N. ChenP. HaffterJ. OdenthalE. VogelsangM. BrandF.J. van EedenM. Furutani-SeikiM. GranatoM. HammerschmidtC.P. Heisenberg(1996) Mutations affecting the cardiovascular system and other internal organs in zebrafish. Development 123:293–302.
  7. J.N. ChenF.J. van EedenK.S. WarrenA. ChinC. Nusslein-VolhardP. HaffterM.C. Fishman(1997) Left-right pattern of cardiac BMP4 may drive asymmetry of the heart in zebrafish. Development 21:4373–4382.
  8. P. ChomczynskiN. Sacchi(1987) Single-step method of RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction. Anal. Biochem. 162:156–159.
  9. P. DeloukasG.D. SchulerG. GyapayE.M. BeasleyC. SoderlundP. Rodriguez-ToméL. HuiT.C. MatiseK.B. McKusickJ.S. Beckmann(1998) A physical map of 30,000 human genes. Science 282:744–746.
  10. W. DrieverM.C. Fishman(1996) Heritable disorders in transparent embryos. J. Clin. Invest. 97:1788–1794.
  11. R.D. FleischmannM.D. AdamsO. WhiteR.A. ClaytonE.F. KirknessA.R. KerlavageC.J. BultJ.F. TombB.A. DoughertyJ.M. Merrick(1995) Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science 269:496–512.
  12. M.A. GatesL. KimE.S. EganT. CardozoH.I. SirtokinS.T. DouganD. LashkariR. AbagyanA.F. SchierW.S. Talbot(1999) A genetic linkage map for zebrafish: Comparative analysis and localization of genes and expressed sequences. Genome Res. 9:334–347.
  13. R. GeislerG.J. RauchH. BaierF. van BebberL. BrobetaM.P. DekensK. FingerC. FrickeM.A. GatesH. Geiger(1999) A radiation hybrid map of the zebrafish genome. Nat. Genet. 23:86–89.
  14. W. GishD.J. States(1993) Identification of protein coding regions by database similarity search. Nat. Genet. 3:266–272.
  15. Z. Gong(1999) Zebrafish expressed sequence tags and their applications. Methods Cell. Biol. 60:213–233.
  16. Z. GongT. YanJ. LiaoS.E. LeeJ. HeC.L. Hew(1997) Rapid identification and isolation of zebrafish cDNA clones. Gene 201:87–98.
  17. P.D. HayesK. SchmittH.B. JonesG. GyapayJ. WeissenbachP.N. Goodfellow(1996) Regional assignment of human ESTs by whole-genome radiation hybrid mapping. Mamm. Genome 7:446–450.
  18. J. Hohfeld(1998) Regulation of the heat shock conjugate Hsc70 in the mammalian cell: The characterization of the anti-apoptotic protein BAG-1 provides novel insights. Biol. Chem. 3:269–274.
  19. N.A. HukriedeL. JolyM. TsangJ. MilesP. TellisJ.A. EpsteinW.B. BarbazukF.N. LiB. PawJ.H. Postlethwait(1999) Radiation hybrid mapping of the zebrafish genome. Proc. Natl. Acad. Sci. 96:9745–9750.
  20. D.M. HwangW.S. HwangC.C. Liew(1994) Single pass sequencing of a unidirectional human fetal heart cDNA library to discover novel genes of the cardiovascular system. J. Mol. Cell. Cardiol. 26:1329–1333.
  21. D.M. HwangY.W. FungR.X. WangC. LaurenssenE. CukermanS. TsuiK.P. FungM. WayeC.Y. LeeC.C. Liew(1995) Analysis of expressed sequence tags from a fetal heart cDNA library. Genomics 30:293–298.
  22. D.M. HwangA.A. DempseyR.X. WangM. RezvaniJ.D. BarransK.S. DaiH.Y. WangH. MaE. CukermanY.Q. Liu(1997) A genome-based resource for molecular cardiovascular medicine: Towards a compendium of cardiovascular genes. Circulation 96:4146–4203.
  23. D.M. HwangA.A. DempseyC.Y. LeeC.C. Liew(2000) Identification of differentially expressed genes in cardiac hypertrophy by analysis of expressed sequence tags. Genomics 66:1–14.
  24. S.L. JohnsonS.L. GatesM. JohnsonW.S. TalbotS. HorneK. BaikS. RudeJ.R. WongJ.H. Postlethwait(1996) Centromere-linkage analysis and consolidation of the zebrafish genetic map. Genetics 142:1277–1288.
  25. E.W. KnapikA. GoodmanO.S. AtkinsonC.T. RobertsM. ShiozawaC.U. SimS. Weksler-ZangenM.R. TrollietC. FutrellB.A. Innes(1996) A reference cross DNA panel for zebrafish (Danio rerio) anchored with simple sequence length polymorphisms. Development 123:451–460.
  26. E.W. KnapikA. GoodmanM. EkkerM. ChevretteJ. DelgadoS. NeuhaussN. ShimodaW. DrieverM.C. FishmanH.J. Jacob(1998) A microsatellite genetic linkage map for zebrafish. Nat. Genet. 18:338–343.
  27. B.F. Koop(1995) Human and rodent DNA sequence comparisons: A mosaic model of genomic evolution. Trends Genet. 11:367–371.
  28. C. LaforetG. FellerE. NarinxC. Gerday(1991) Parvalbumin in the cardiac muscle of normal and haemoglobin-myoglobin-free antarctic fish. J. Muscle Res. Cell Motil. 5:472–478.
  29. C.C. LiewD.M. HwangY.W. FungC. LaurenssenE. CukermanS. TsuiC.Y. Lee(1994) A catalogue of genes in the cardiovascular system as identified by expressed sequence tags (ESTs). Proc. Natl. Acad. Sci. 91:10645–10649.
  30. W. MakalowskiM. Boguski(1998) Evolutionary parameters of the transcribed mammalian genome: An analysis of 2,820 orthologous rodent and human sequences. Proc. Natl. Acad. Sci. 95:9407–9421.
  31. W. MakalowskiJ. ZhangM.S. Boguski(1996) Comparative analysis of 1196 orthologous mouse and human full-length mRNA and protein sequences. Genome Res. 6:846–857.
  32. L.C. McCarthyJ. TerrettM.E. DavisC.J. KnightsA.L. SmithR. CritcherK. SchmittJ. HudsonN.K. SpurrP.N. Goodfellow(1997) A first-generation whole genome-radiation hybrid map spanning the mouse genome. Genome Res. 7:1153–1161.
  33. C. NiehrsN. Pollet(1999) Synexpression groups in eukaryotes. Nature 402:483–487.
  34. J.H. PostlethwaitW.S. Talbot(1997) Zebrafish genomics: From mutants to genes. TIG 13:183–190.
  35. J.H. PostlethwaitS.L. JohnsonC.N. MidsonW.S. TalbotM. GatesE.W. BallingerD. AfricaR. AndrewsT. CarlJ.S. Eisen(1994) A genetic linkage map for the zebrafish. Science 264:699–703.
  36. J.H. PostlethwaitY.-L. YanM.A. GatesS. HorneA. AmoresA. BrownlieA. DonovanE.S. EganA. ForceZ. Gong(1998) Vertebrate genome evolution and the zebrafish gene map. Nat. Genet. 18:345–349.
  37. J.H. PostlethwaitY.-L. YanM.A. Gates(1999) Using random amplified polymorphic DNAs in zebrafish genomic analysis. Methods Cell Biol. 60:165–179.
  38. T.F. SchillingJ.P. ConcordetP.W. Ingham(1999) Regulation of left-right asymmetries in the zebrafish by Shh and BMP4. Dev. Biol. 2:277–287.
  39. N. ShimodaE.W. KnapikJ. ZinitiC. SimE. YamadaS. KaplanD. JacksonF. de SauvageH. JacobM.C. Fishman(1999) Zebrafish genetic map with 2000 microsatellite markers. Genomics 58:219–232.
  40. D.Y. StainierB. FouquetJ.N. ChenK.S. WarrenB.M. WeinsteinS.E. MeilerM.A. MohideenS.C. NeuhaussL. Solnica-KrezelA.F. Schier(1996) Mutations affecting the formation and function of the cardiovascular system in the zebrafish embryo. Development 123:285–292.
  41. J.D. ThompsonT.J. GibsonF. PlewniakF. JeanmouginD.G. Higgins(1997) The CLUSTALX windows interface: Flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 25:4876–4882.
  42. K.S. WarrenM.C. Fishman(1998) “Physiological genomics”: Mutants screens in zebrafish. Am. J. Physiol. 275:H1–H7.
Loading
Loading
Loading
Back to top