Discovery and annotation of small proteins using genomics, proteomics, and computational approaches

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 2.
Figure 2.

P. deltoides small protein-coding candidate genes enriched from transcription units. (A) Number of genes in different sORF candidate subsets. (B) Proportion of the sORF subsets having known protein domains detected by InterProScan. Subset A contains the sORF candidates with high protein-coding potential predicted using known proteins as training sequences. Subset B contains sORF candidates conserved between P. deltoides and at least one other plant species. Subset C contains sORF candidates clustered into families. (Initial) The initial sORF candidate set (Fig. 1). (AB) The intersection of Subsets A and B. (ABC) (i.e., the high-confidence sORF candidate set) The intersection of Subsets A, B, and C. The value in parentheses represents the number of sORFs in each individual subset.

This Article

  1. Genome Res. 21: 634-641

Preprint Server