Genome Research

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


Published online before print January 2, 2007, 10.1101/gr.5646507
Genome Res. 17:231-239, 2007
©2007 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/07 $5.00
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Supplemental Research Data
Right arrow All Versions of this Article:
gr.5646507v1
gr.5646507v2
17/2/231    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Tanner, S.
Right arrow Articles by Bafna, V.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Tanner, S.
Right arrow Articles by Bafna, V.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Methods

Improving gene annotation using peptide mass spectrometry

Stephen Tanner1,6, Zhouxin Shen2, Julio Ng1, Liliana Florea3, Roderic Guigó4, Steven P. Briggs2, and Vineet Bafna5

1 Bioinformatics Program, University of California, San Diego, La Jolla, California 92093-0419, USA; 2 Department of Biology, University of California, San Diego, La Jolla, California 92093-0346, USA; 3 Department of Computer Science, George Washington University, Washington, DC 20052, USA; 4 Centre de Regualció Genòmica, 08003 Barcelona, Spain; 5 Department of Computer Science and Engineering, University of California, San Diego, La Jolla, California 92093-0404, USA

Annotation of protein-coding genes is a key goal of genome sequencing projects. In spite of tremendous recent advances in computational gene finding, comprehensive annotation remains a challenge. Peptide mass spectrometry is a powerful tool for researching the dynamic proteome and suggests an attractive approach to discover and validate protein-coding genes. We present algorithms to construct and efficiently search spectra against a genomic database, with no prior knowledge of encoded proteins. By searching a corpus of 18.5 million tandem mass spectra (MS/MS) from human proteomic samples, we validate 39,000 exons and 11,000 introns at the level of translation. We present translation-level evidence for novel or extended exons in 16 genes, confirm translation of 224 hypothetical proteins, and discover or confirm over 40 alternative splicing events. Polymorphisms are efficiently encoded in our database, allowing us to observe variant alleles for 308 coding SNPs. Finally, we demonstrate the use of mass spectrometry to improve automated gene prediction, adding 800 correct exons to our predictions using a simple rescoring strategy. Our results demonstrate that proteomic profiling should play a role in any genome sequencing project.


6 Corresponding author.

E-mail stanner{at}ucsd.edu; fax (858) 534-7029

[Supplemental material is available online at www.genome.org.]

Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.5646507


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Physiol. GenomicsHome page
E. W. Deutsch, H. Lam, and R. Aebersold
Data analysis and bioinformatics tools for tandem mass spectrometry in proteomics
Physiol Genomics, October 8, 2008; 33(1): 18 - 25.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
N. Gupta, J. Benhamida, V. Bhargava, D. Goodman, E. Kain, I. Kerman, N. Nguyen, N. Ollikainen, J. Rodriguez, J. Wang, et al.
Comparative proteogenomics: Combining mass spectrometry and comparative genomics to analyze multiple genomes
Genome Res., July 1, 2008; 18(7): 1133 - 1142.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
N. Bandeira, J. V. Olsen, M. Mann, and P. A. Pevzner
Multi-spectra peptide sequencing and its applications to multistage mass spectrometry
Bioinformatics, July 1, 2008; 24(13): i416 - i423.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
K. Baerenfaller, J. Grossmann, M. A. Grobei, R. Hull, M. Hirsch-Hoffmann, S. Yalovsky, P. Zimmermann, U. Grossniklaus, W. Gruissem, and S. Baginsky
Genome-Scale Proteomics Reveals Arabidopsis thaliana Gene Models and Proteome Dynamics
Science, May 16, 2008; 320(5878): 938 - 941.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
M. B. Lucitt, T. S. Price, A. Pizarro, W. Wu, A. K. Yocum, C. Seiler, M. A. Pack, I. A. Blair, G. A. FitzGerald, and T. Grosser
Analysis of the Zebrafish Proteome during Embryonic Development
Mol. Cell. Proteomics, May 1, 2008; 7(5): 981 - 994.
[Abstract] [Full Text] [PDF]


Home page
Brief Funct Genomic ProteomicHome page
C. Ansong, S. O. Purvine, J. N. Adkins, M. S. Lipton, and R. D. Smith
Proteogenomics: needs and roles to be filled by proteomics in genome annotation
Brief Funct Genomic Proteomic, March 10, 2008; (2008) eln010v1.
[Abstract] [Full Text] [PDF]




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.
Copyright © 2007 by Cold Spring Harbor Laboratory Press.