Repetitive sequence variation and dynamics in the ribosomal DNA array of Saccharomyces cerevisiae as revealed by whole-genome resequencing
- Stephen A. James1,4,
- Michael J.T. O'Kelly2,4,5,
- David M. Carter3,6,
- Robert P. Davey1,
- Alexander van Oudenaarden2 and
- Ian N. Roberts1,7
- 1National Collection of Yeast Cultures, Institute of Food Research, Norwich Research Park, Colney, Norwich NR4 7UA, United Kingdom;
- 2Department of Physics, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA;
- 3Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1HH, United Kingdom
-
↵4 These authors contributed equally to this work.
Abstract
Ribosomal DNA (rDNA) plays a key role in ribosome biogenesis, encoding genes for the structural RNA components of this important cellular organelle. These genes are vital for efficient functioning of the cellular protein synthesis machinery and as such are highly conserved and normally present in high copy numbers. In the baker's yeast Saccharomyces cerevisiae, there are more than 100 rDNA repeats located at a single locus on chromosome XII. Stability and sequence homogeneity of the rDNA array is essential for function, and this is achieved primarily by the mechanism of gene conversion. Detecting variation within these arrays is extremely problematic due to their large size and repetitive structure. In an attempt to address this, we have analyzed over 35 Mbp of rDNA sequence obtained from whole-genome shotgun sequencing (WGSS) of 34 strains of S. cerevisiae. Contrary to expectation, we find significant rDNA sequence variation exists within individual genomes. Many of the detected polymorphisms are not fully resolved. For this type of sequence variation, we introduce the term partial single nucleotide polymorphism, or pSNP. Comparative analysis of the complete data set reveals that different S. cerevisiae genomes possess different patterns of rDNA polymorphism, with much of the variation located within the rapidly evolving nontranscribed intergenic spacer (IGS) region. Furthermore, we find that strains known to have either structured or mosaic/hybrid genomes can be distinguished from one another based on rDNA pSNP number, indicating that pSNP dynamics may provide a reliable new measure of genome origin and stability.
Footnotes
-
↵5 Present addresses: Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
-
6 Autonomy Corporation, Cambridge Business Park, Cambridge CB4 0WZ, UK.
-
↵7 Corresponding author.
↵E-mail ian.roberts{at}bbsrc.ac.uk; fax 44-(0)1603-458414.
-
[Supplemental material is available online at www.genome.org.]
-
Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.084517.108.
-
- Received August 15, 2008.
- Accepted December 23, 2008.
- Copyright © 2009 by Cold Spring Harbor Laboratory Press











