Table 3.

Fungal-Specific Genes Present in the NGP Data Set

Discontig Identification SC E-value[i] Fungal distribution[ii]
III. Cell structure/cytoskeleton[iii]
   34[iv] YOL030w[iv] 3 × 10−54 Sp, Ca, En
  439[iv] Gas1p (YMR307wd)[iv] 3 × 10−32 Sp, Ca, En
 1133Pir1p (YKL164c)4 × 10−11 Ca[v]
V. Metabolism
 1003Fth1p (YBR207w)3 × 10−26 Sp, Ca
VII. RNA synthesis
  231Ecm22p (YLR228c)5 × 10−7 Sp, Ca, En
  489[vi] Sok2p (YMR016c)4 × 10−23 Sp, Ca, En
Unclassified
  469YGR033c2 × 10−14 Sp, Ca
  812YOL048c4 × 10−8 Ca
  828YOR081c3 × 10−7 Sp, Ca

[i] E-value of best BLAST hit against S. cerevisiae data set. E-value of best BLAST hit against nonfungal datasets is >0.1.

[ii] Distribution of homologs within the fungi based on BLAST searches. (Sp) S. pombe; (Ca) C. albicans; (En)A. nidulans. None of these sequences have clear homologs in the sequences available from nonascomycete fungi.

[iii] Functional categories are as described in Nelson et al. (1997), based on a modification of the system described by White and Kerlavage (1996).

[iv] These sequences [N. crassa discontigs 34 and 439 and S. cerevisiae Gas1p (YMR307w) and YOL030w] correspond to paralogous fungal-specific genes encoding glycophospholipid-anchored surface proteins. Gas1p has a weak nonfungal hit, corresponding to a putative A. thaliana β-1,3-glucanase (for details, seePopolo and Vai 1999).

[v] The absence of a S. pombe Pir1p homolog may be more than incomplete sampling, because previous studies were unable to identify fragments hybridizing to the PIR1 gene in S. pombe (Toh-e et al. 1993).

[vi] Discontig 489 corresponded to the Asm-1 + gene, which has been shown to encode an APSES-domain transcription factor homologous to the S. cerevisiae Sok2p protein (Aramayo et al. 1996).