A Random Sequencing Approach for the Analysis of the Trypanosoma cruzi Genome: General Structure, Large Gene and Repetitive DNA Families, and Gene Discovery

Table 2.

Identification of New T. cruzi Genes

dbGSS Description Score Expect
11462 ref-NP_011059.1-GLC7-protein phosphatase type I [Saccharomyces cereviseae] 474 0.00E+00
10993 sp-P22679-elongation factor TU (EF-TU) [Mycoplasma hominis] 312 0.00E+00
00184 sp-P46794-cystathionine beta-synthase [Dictyostelium discoldeum] 303 0.00E+00
11438 sp-O76767-lumen protein retaining receptor [Drosophila melanogaster] 208 0.00E+00
11472 emb-CAB56598.1-alpha dynein heavy chain [Chlamydomonas reinhardtii] 202 0.00E+00
11026 gi-2425121-Spalten [Dictyostelium discoldeum] 116 0.00E+00
12036 gb-AAF19802.1-N-myristoyl transferase [Brassica oleracea] 107 0.00E+00
01761 gl-3004644-trypanothione synthetase [Crithidia fasciculata] 428 5.00E-42
11137 gb-AAB67249.1-T-complex protein 1, Beta subunit [Homo sapiens] 424 9.00E-42
11193 ref-NP_014850.1-RET1-second-largest subunit of RNA polymerase III [Saccharomyces cereviseae] 422 1.00E-41
11590 gb-AAF08387.1-26S proteasome regulatory complex subunit p48A [Drosophila melanogaster] 419 2.00E-41
11120 emb-CAA65384-malate dehydrogenase [Mesembryanthemum crystallinum] 371 2.00E-35
11285 gi-1931649-DNA helicaso isolog [Arabidopsis thaliana] 353 2.00E-33
11563 ref-NP_013458.1-transaldolase [Saccharomyces cerevisiae] 348 7.00E-33
09938 gi-2246458-S-adenosyl-methionine-sterol-C-methyltransferase [Ricinus communis] 348 8.00E-33
0705 pir-T1017324-sterol C-methyltransferase-castor bean [Ricinus communis] 348 9.00E-33
11338 gb-AAC02737.1-3-hydroxyisobutyryl-coenzyme A hydrolase [Arabidopsis thaliana] 341 5.00E-32
11467 gb-AAF04493.1-acetyl-CoA carboxylase 1 [Toxoplasma gondii] 330 7.00E-31
10956 dbj-BAA84364.1-DEIH-box RNA/DNA helicase [Arabidopsis thaliana] 328 2.00E-30
11575 gb-AAC73040.1-putative AAA-type ATPase [Arabidopsis thaliana] 323 5.00E-30
11516 sp-P05439-ATP synthase alpha chain [Rhodobacter blasticus] 319 2.00E-29
11606 sp-P51044-citrate synthase, mitochondrial precursor [Aspergillus niger] 321 2.00E-28
11417 gb-AAF21464.1-proline oxidase 2 [Homo sapiens] 310 2.00E-28
11502 gi-2654103-MAPKK kinase [Neurospora crassa] 304 9.00E-28
11523 gb-AAD26855.1-phenylalanyl tRNA synthetase beta subunit [Mus musculus] 301 2.00E-27
11568 gi-4101722-histone deacetylase mHDA1 [Mus musculus] 301 2.00E-27
11480 gi-2462752-phosphatidylinositol 3-kinase [Arabidopsis thaliana] 299 4.00E-27
11463 sp-P32826-serine carboxypeptidase precursor [Arabidopsis thaliana] 299 4.00E-27
10965 gi-687208-dynein heavy chain isotype 5C [Tripneustes gratilla] 289 5.00E-26
11328 pir-A56220-protein kinase aurora-fruit fly [Drosophila melanogaster] 287 1.00E-25
11446 sp-O15228-dihydroxyacetone phosphate acyltransferase (DAP-AT) [Homo sapiens] 280 7.00E-25
11433 gb-AAF11511.1-acetyl-CoA acetyltransferase [Deinococcus radiodurans] 279 9.00E-25
11601 sp-P30575-enolase 1 (2-phosphoglycerate dehydratase) [Candida albicans] 278 1.00E-24
01810 sp-O94476-eukaryotic translation initiation factor 6 (EIF-6) [Schizosaccharomyces pombe] 275 3.00E-24
0740 gb-AAF62506.1-ribosomal protein LS [Trypanoplasma borreli] 273 5.00E-24
11274 pir-S70896-aminomethyltransferase [Saccharomyces cerevisiae] 271 5.00E-24
11279 gi-780410-helicase [African swine fever virus] 272 6.00E-24
11882 sp-Q07405-ATP synthase alpha chain [Myxococcus xanthus] 269 1.00E-23
11432 emb-CAB40791.1-centrin [Euplotes octocarinatus] 265 3.00E-23
11654 gi-1872473-delta-24-sterol methyltransferase [Triticum aestivum] 258 7.00E-23
01015 pir-A56492-protein kinase ERK2 [Dictyostelium discoideum] 280 8.00E-23
11584 ref-NP_005678.1-phenylalanyl-tRNA synthetase beta-subunit [Homo sapiens] 259 1.00E-22
11434 sp-O05593-methionyl-tma synthetase [Mycobacterium tuberculosis] 254 6.00E-22
11038 gi-1354084-axonemal dynein light chain p33 (Strongylocentrotus purpuratus] 251 2.00E-21
11440 gi-2665637-mismatch repair protein MSH6 [Mus musculus] 248 4.00E-21
11248 gb-AAF22155.1-ARD-1 N-acetyltransferase homologue [Mus musculus] 244 1.00E-20
11852 gb-AAC32590.1-sperm flagellar protein Repro-SA-1 [Homo sapiens] 239 5.00E-20
01825 dbj-BAA20996-kinesin-like protein [Caenorhabditis elegans] 239 6.00E-20
11210 pir-A35630-regulatory protein algR3 [Pseudomonas aeruginosa] 237 7.00E-20
  • GSS sequences were used to search NCBI's non-redundant database using BLASTX. The first 50 GSSs out of the 947 GSSs showing matches against non-T. cruzi sequences are listed. Detailed information about the homologies found for GSSs can be found athttp://www.iib.unsam.edu.ar/genomelab/tcruzi/gss.html.

  • GSS names in dbGSS are the numbers given here preceded by GSSTc (e.g., GSSTc11210).

  • Descriptions are taken directly from the BLAST reports.

This Article

  1. Genome Res. 10: 1996-2005

Preprint Server