Widespread genome duplications throughout the history of flowering plants

  1. Liying Cui1,2,3,
  2. P. Kerr Wall1,2,3,
  3. James H. Leebens-Mack1,2,3,
  4. Bruce G. Lindsay5,
  5. Douglas E. Soltis6,
  6. Jeff J. Doyle8,
  7. Pamela S. Soltis7,
  8. John E. Carlson2,3,4,
  9. Kathiravetpilla Arumuganathan9,
  10. Abdelali Barakat1,2,3,
  11. Victor A. Albert10,
  12. Hong Ma1,2,3, and
  13. Claude W. dePamphilis1,2,3,11
  1. 1 Department of Biology,
  2. 2 Institute of Molecular Evolutionary Genetics,
  3. 3 Huck Institutes of the Life Sciences,
  4. 4 School of Forest Resources
  5. 5 Department of Statistics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA;
  6. 6 Department of Botany
  7. 7 Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611, USA;
  8. 8 Department of Plant Biology, Cornell University, Ithaca, New York 14853, USA;
  9. 9 Virginia Mason Research Center, Benaroya Research Institute, Seattle, Washington 98101, USA;
  10. 10 Natural History Museum, University of Oslo, NO-0318 Oslo, Norway

Abstract

Genomic comparisons provide evidence for ancient genome-wide duplications in a diverse array of animals and plants. We developed a birth–death model to identify evidence for genome duplication in EST data, and applied a mixture model to estimate the age distribution of paralogous pairs identified in EST sets for species representing the basal-most extant flowering plant lineages. We found evidence for episodes of ancient genome-wide duplications in the basal angiosperm lineages including Nuphar advena (yellow water lily: Nymphaeaceae) and the magnoliids Persea americana (avocado: Lauraceae), Liriodendron tulipifera (tulip poplar: Magnoliaceae), and Saruma henryi (Aristolochiaceae). In addition, we detected independent genome duplications in the basal eudicot Eschscholzia californica (California poppy: Papaveraceae) and the basal monocot Acorus americanus (Acoraceae), both of which were distinct from duplications documented for ancestral grass (Poaceae) and core eudicot lineages. Among gymnosperms, we found equivocal evidence for ancient polyploidy in Welwitschia mirabilis (Gnetales) and no evidence for polyploidy in pine, although gymnosperms generally have much larger genomes than the angiosperms investigated. Cross-species sequence divergence estimates suggest that synonymous substitution rates in the basal angiosperms are less than half those previously reported for core eudicots and members of Poaceae. These lower substitution rates permit inference of older duplication events. We hypothesize that evidence of an ancient duplication observed in the Nuphar data may represent a genome duplication in the common ancestor of all or most extant angiosperms, except Amborella.

Footnotes

  • 11

    11 Corresponding author.

    11 E-mail cwd3{at}psu.edu; fax (814) 865-9131.

  • [Supplemental material is available online at www.genome.org. ]

  • Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.4825606

    • Received October 20, 2005.
    • Accepted March 27, 2006.
| Table of Contents

Preprint Server