Comparative proteogenomics: Combining mass spectrometry and comparative genomics to analyze multiple genomes
- Nitin Gupta1,5,
- Jamal Benhamida1,
- Vipul Bhargava1,
- Daniel Goodman1,
- Elisabeth Kain1,
- Ian Kerman2,
- Ngan Nguyen1,
- Noah Ollikainen1,
- Jesse Rodriguez1,
- Jian Wang1,
- Mary S. Lipton3,
- Margaret Romine3,
- Vineet Bafna1,4,
- Richard D. Smith3, and
- Pavel A. Pevzner1,4
- 1 Bioinformatics Program, University of California San Diego, La Jolla, California 92093, USA;
- 2 Division of Biology, University of California San Diego, La Jolla, California 92093, USA;
- 3 Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, USA;
- 4 Department of Computer Science and Engineering, University of California San Diego, La Jolla, California 92093, USA
Abstract
Recent proliferation of low-cost DNA sequencing techniques will soon lead to an explosive growth in the number of sequenced genomes and will turn manual annotations into a luxury. Mass spectrometry recently emerged as a valuable technique for proteogenomic annotations that improves on the state-of-the-art in predicting genes and other features. However, previous proteogenomic approaches were limited to a single genome and did not take advantage of analyzing mass spectrometry data from multiple genomes at once. We show that such a comparative proteogenomics approach (like comparative genomics) allows one to address the problems that remained beyond the reach of the traditional “single proteome” approach in mass spectrometry. In particular, we show how comparative proteogenomics addresses the notoriously difficult problem of “one-hit-wonders” in proteomics, improves on the existing gene prediction tools in genomics, and allows identification of rare post-translational modifications. We therefore argue that complementing DNA sequencing projects by comparative proteogenomics projects can be a viable approach to improve both genomic and proteomic annotations.
Footnotes
-
↵5 Corresponding author.
↵5 E-mail ngupta{at}ucsd.edu; fax (858) 534-8499.
-
[Supplemental material is available online at www.genome.org.]
-
Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.074344.107.
-
- Received November 12, 2007.
- Accepted April 2, 2008.
- Copyright © 2008, Cold Spring Harbor Laboratory Press











