Fungal-Specific Genes Present in the NGP Data Set
| Discontig | Identification | SC E-value[i] | Fungal distribution[ii] |
| III. Cell structure/cytoskeleton[iii] | |||
| 34[iv] | YOL030w[iv] | 3 × 10−54 | Sp, Ca, En |
| 439[iv] | Gas1p (YMR307wd)[iv] | 3 × 10−32 | Sp, Ca, En |
| 1133 | Pir1p (YKL164c) | 4 × 10−11 | Ca[v] |
| V. Metabolism | |||
| 1003 | Fth1p (YBR207w) | 3 × 10−26 | Sp, Ca |
| VII. RNA synthesis | |||
| 231 | Ecm22p (YLR228c) | 5 × 10−7 | Sp, Ca, En |
| 489[vi] | Sok2p (YMR016c) | 4 × 10−23 | Sp, Ca, En |
| Unclassified | |||
| 469 | YGR033c | 2 × 10−14 | Sp, Ca |
| 812 | YOL048c | 4 × 10−8 | Ca |
| 828 | YOR081c | 3 × 10−7 | Sp, Ca |
[i] E-value of best BLAST hit against S. cerevisiae data set. E-value of best BLAST hit against nonfungal datasets is >0.1.
[ii] Distribution of homologs within the fungi based on BLAST searches. (Sp) S. pombe; (Ca) C. albicans; (En)A. nidulans. None of these sequences have clear homologs in the sequences available from nonascomycete fungi.
[iii] Functional categories are as described in Nelson et al. (1997), based on a modification of the system described by White and Kerlavage (1996).
[iv] These sequences [N. crassa discontigs 34 and 439 and S. cerevisiae Gas1p (YMR307w) and YOL030w] correspond to paralogous fungal-specific genes encoding glycophospholipid-anchored surface proteins. Gas1p has a weak nonfungal hit, corresponding to a putative A. thaliana β-1,3-glucanase (for details, seePopolo and Vai 1999).
[v] The absence of a S. pombe Pir1p homolog may be more than incomplete sampling, because previous studies were unable to identify fragments hybridizing to the PIR1 gene in S. pombe (Toh-e et al. 1993).
[vi] Discontig 489 corresponded to the Asm-1 + gene, which has been shown to encode an APSES-domain transcription factor homologous to the S. cerevisiae Sok2p protein (Aramayo et al. 1996).