Deterministic protein inference for shotgun proteomics data provides new insights into Arabidopsis pollen development and function

  1. Monica A Grobei1,
  2. Ermir Qeli2,
  3. Erich Brunner2,
  4. Hubert Rehrauer3,
  5. Runxuan Zhang2,
  6. Bernd Roschitzki3,
  7. Konrad Basler2,
  8. Christian H Ahrens2 and
  9. Ueli Grossniklaus1,4
  1. 1 Institute of Plant Biology, University of Zurich;
  2. 2 Institute for Molecular Biology, University of Zurich;
  3. 3 Functional Genomics Center Zurich, ETH and University of Zurich
  1. 4 E-mail: grossnik{at}botinst.uzh.ch

Abstract

Pollen, the male gametophyte of flowering plants, represents an ideal biological system to study developmental processes, such as cell polarity, tip growth, and morphogenesis. Upon hydration, the metabolically quiescent pollen rapidly switches to an active state, exhibiting extremely fast growth. This rapid switch requires relevant proteins to be stored in the mature pollen, where they have to retain functionality in a desiccated environment. Using a shotgun proteomics approach, we unambiguously identified about ~3,500 proteins in Arabidopsis pollen, including 537 proteins that were not identified in genetic or transcriptomic studies. To generate this comprehensive reference dataset, which extends the previously reported pollen proteome by a factor of 13, we developed a novel deterministic peptide classification scheme for protein inference. This generally applicable approach considers the gene model - protein sequence - protein accession relationships. It allowed us to classify and eliminate ambiguities inherently associated with any shotgun proteomics dataset, to report a conservative list of protein identifications, and to seamlessly integrate data from previous transcriptomics studies. Manual validation of proteins unambiguously identified by a single, information-rich peptide enabled us to significantly reduce the false discovery rate, while keeping valuable identifications of shorter and lower abundant proteins. Bioinformatic analyses revealed a higher stability of pollen proteins compared to those of other tissues and implied a protein family of previously unknown function in vesicle trafficking. Interestingly, the pollen proteome is most similar to that of seeds, indicating physiological similarities between these developmentally distinct tissues.

Footnotes

    • Received November 10, 2008.
    • Accepted June 16, 2009.
ACCEPTED MANUSCRIPT

Preprint Server