Use of shotgun proteomics for the identification, confirmation, and correction of C. elegans gene annotations

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 3.
Figure 3.

Classification of proteins identified from existing or new coding sequences. From the total 6779 proteins identified, 6350 were identified based on the protein-coding genes from WormBase WS150, and 429 proteins were identified using either new GeneFinder predictions, the conserved intergenic data set, or both. From the 429 new proteins, 33 mapped to predicted pseudogenes in WS150. Of the 33 predicted pseudogenes, 18.2% have been confirmed by RT-PCR. We have identified 151 misannotated protein sequences, and 56.9% of these new coding sequences have RT-PCR confirmation. The last category represents 245 novel or unknown coding sequences of which 40.8% have RT-PCR confirmation.

This Article

  1. Genome Res. 18: 1660-1669

Preprint Server