Unusual composition of a yeast chromosome arm is associated with its delayed replication

  1. Célia Payen1,
  2. Gilles Fischer1,
  3. Christian Marck2,
  4. Caroline Proux3,
  5. David James Sherman4,
  6. Jean-Yves Coppée3,
  7. Mark Johnston5,
  8. Bernard Dujon1 and
  9. Cécile Neuvéglise6,7
  1. 1 Institut Pasteur, Unité de Génétique Moléculaire des Levures, CNRS (URA 2171), Université Pierre et Marie Curie, 75724 Paris Cedex 15, France;
  2. 2 Institut de Biologie et Technologies de Saclay (iBiTec-S), 91191 Gif-sur-Yvette Cedex, France;
  3. 3 Institut Pasteur, 75724 Paris Cedex 15, France;
  4. 4 INRIA Bordeaux Sud-Ouest (MAGNOME) and CNRS UMR 5800 (LaBRI), Université Bordeaux 1, 33405 Talence Cedex, France;
  5. 5 Department of Genetics, Washington University Medical School, St. Louis, Missouri 63110, USA;
  6. 6 INRA (UMR 1238) CNRS (UMR 2585) AgroParisTech, Laboratoire de Microbiologie et Génétique Moléculaire, 78850 Thiverval-Grignon, France

    Abstract

    The 11.3-Mb genome of the yeast Lachancea (Saccharomyces) kluyveri displays an intriguing compositional heterogeneity: a region of 1 Mb, covering almost the whole left arm of chromosome C (C-left), has an average GC content of 52.9%, which is significantly higher than the 40.4% global GC content of the rest of the genome. This region contains the MAT locus, which remains normal in composition. The excess of GC base pairs affects both coding and noncoding sequences, and thus is not due to selective pressure acting on protein sequences. It leads to a strong codon usage bias and alters the amino acid composition of the 457 proteins encoded on C-left that do not show obvious bias for functional categories, or the presence of paralogs or orthologs of essential genes of Saccharomyces cerevisiae. They share significant synteny conservation with other species of the Saccharomycetaceae, and phylogenetic analysis indicates that C-left originates from a Lachancea species. In contrast, there is a complete absence of transposable elements in C-left, whereas 18 elements per megabase are distributed across the rest of the genome. Comparative hybridization of synchronized cells using high-density genome arrays reveals that C-left is replicated later during S phase than the rest of the genome. Two possible primary causes of this major compositional heterogeneity are discussed: an ancient hybridization of two related species with very distinct GC composition, or an intrinsic mechanism, possibly associated with the loss of the silent cassettes from C-left that progressively increased the GC content and generated the delayed replication of this chromosomal arm.

    Footnotes

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server