Table 1.

Summary of BLASTN Comparisons ofSaccharomyces Species' DNA Sequences to S. cerevisiae DNA Sequence

Species # of sequences % identity CODING % identity NON-CODING % of sequences aligning CODING/NON
S. paradoxus 72888.381.677.9/36.3
S. cariacanus 86788.080.577.5/39.1
S. mikatae 113683.674.879.9/27.9
S. bayanus 85179.773.665.7/19.5
S. castellii 229070.362.5[ii] 48.9/1.7[ii]
S. kluyveri 214570.272.8[ii] 38.6/1.8[ii]
S. unisporus 235769.267.6[ii] 40.4/2.2[ii]

[i] Coding sequences are all ORFs known or predicted to encode protein. The remaining yeast genomic sequences (minus the sequences encoding RNA genes, transposable elements, and CEN sequences) were considered non-coding. BLASTN comparisons of sensu stricto yeast species used MATCH/MISMATCH parameters M = 5, N = 5 with the default gap penalties. This change was made to more accurately reflect the similarity of the sequences and equals a PAM distance of ∼30 (States et al. 1991). Using these strict MATCH/MISMATCH parameters, only significant alignments were obtained. Default parameters were used for sensu lato and petite-negative comparisons toS. cerevisiae using a wordlength (W) of 5. The dust filter was used to mask low complexity nucleotide sequences. Alignments with aP-value <5.0e-06 were counted as significant.

[ii] Many of the alignments are to repetitive sequences (ribosomal rDNA repeat alignments were not counted).