Table 1.

Position-Specific Amino Acid Frequencies (Expressed as a Percentage) at the Three C-Terminal Positions for Each of the Genomes Studied

AA M. jannaschii E. coli S. cerevisiae
−1 −2 −3 ORF −1 −2 −3 ORF −1 −2 −3 ORF
A2.41.92.75.410.29.47.69.55.24.24.85.5
C0.80.51.01.31.21.21.11.21.71.51.41.3
D4.03.24.35.54.13.74.15.15.15.25.45.8
E13.38.69.08.78.26.48.35.76.55.86.26.5
F4.55.43.94.33.62.83.43.95.35.85.54.5
G4.96.74.66.36.76.55.77.42.64.54.65.0
H1.41.11.21.44.32.62.72.32.72.32.32.2
I10.09.611.210.53.74.05.26.07.16.36.66.6
K17.217.614.810.410.98.77.04.411.510.98.87.3
L11.711.212.39.57.99.610.110.610.59.310.39.6
M0.62.62.32.31.32.11.72.82.21.92.32.1
N4.26.15.45.34.04.44.24.07.15.55.86.1
P1.51.42.03.42.93.24.74.42.32.93.64.3
Q3.71.91.31.46.65.44.74.44.44.04.13.9
R5.66.56.33.88.77.57.95.54.76.54.94.5
S3.84.65.24.56.27.05.95.86.78.78.39.0
T1.93.22.74.01.15.34.75.43.85.34.65.9
V2.94.25.56.84.86.36.57.15.34.65.75.6
W1.50.80.90.71.61.11.51.51.71.01.31.0
Y4.12.83.64.42.22.83.02.93.53.83.73.4
AA C. elegans A. thaliana H. sapiens
−1 −2 −3 ORF −1 −2 −3 ORF −1 −2 −3 ORF
A5.24.14.86.36.24.95.56.35.36.06.47.0
C2.72.22.02.12.62.22.01.83.22.72.42.2
D4.34.74.65.34.54.84.95.54.55.04.44.9
E5.75.55.46.55.15.85.16.85.16.96.87.1
F8.55.75.74.95.75.05.04.34.63.43.83.7
G3.04.44.25.33.75.54.96.43.76.26.16.8
H3.42.62.22.32.62.52.42.33.52.82.72.5
I6.46.16.36.25.45.15.15.34.53.73.64.4
K8.810.18.76.56.27.77.16.47.57.97.25.7
L9.38.18.78.710.28.69.99.511.18.59.29.9
M1.92.22.12.62.12.02.12.42.11.92.12.2
N7.46.65.14.95.14.54.24.44.03.83.33.7
P2.63.84.64.94.24.35.04.85.66.16.26.2
Q5.04.03.94.13.23.53.93.54.84.84.64.7
R4.56.75.95.26.67.26.35.45.26.45.65.6
S6.78.69.08.010.010.810.79.09.59.410.08.0
T3.24.86.55.84.65.65.45.14.95.26.45.3
V6.15.35.86.26.95.66.06.76.54.94.96.1
W1.11.41.11.11.51.51.41.31.41.51.61.2
Y4.33.13.43.23.73.13.22.93.12.82.62.7

[i] The fourth column for each organism (ORF) shows the amino acid frequencies across the entire ORF array.