Surveying Saccharomyces Genomes to Identify Functional Elements by Comparative DNA Sequence Analysis

Table 1.

Summary of BLASTN Comparisons ofSaccharomyces Species' DNA Sequences to S. cerevisiae DNA Sequence

Species # of sequences % identity CODING % identity NON-CODING % of sequences aligning CODING/NON
S. paradoxus 728 88.3 81.6 77.9/36.3
S. cariacanus 867 88.0 80.5 77.5/39.1
S. mikatae 1136 83.6 74.8 79.9/27.9
S. bayanus 851 79.7 73.6 65.7/19.5
S. castellii 2290 70.3 62.5 48.9/1.7
S. kluyveri 2145 70.2 72.8 38.6/1.8
S. unisporus 2357 69.2 67.6 40.4/2.2
  • Coding sequences are all ORFs known or predicted to encode protein. The remaining yeast genomic sequences (minus the sequences encoding RNA genes, transposable elements, and CEN sequences) were considered non-coding. BLASTN comparisons of sensu stricto yeast species used MATCH/MISMATCH parameters M = 5, N = 5 with the default gap penalties. This change was made to more accurately reflect the similarity of the sequences and equals a PAM distance of ∼30 (States et al. 1991). Using these strict MATCH/MISMATCH parameters, only significant alignments were obtained. Default parameters were used for sensu lato and petite-negative comparisons toS. cerevisiae using a wordlength (W) of 5. The dust filter was used to mask low complexity nucleotide sequences. Alignments with aP-value <5.0e-06 were counted as significant.

  • Many of the alignments are to repetitive sequences (ribosomal rDNA repeat alignments were not counted).

This Article

  1. Genome Res. 11: 1175-1186

Preprint Server