The genetic code is nearly optimal for allowing additional information within protein-coding sequences

Click on image to view larger version.

Table 2.

Significance of the genetic code in representing arbitrary sequences

Click on table to view larger version.

Table 2.

Shown are the fractions of alternative codes for which the average of the logarithm of the probabilities of all n-mers is equal or higher to that of the real code. Also shown are the fraction of alternative genetic codes for which the average probability of the 20% most-difficult n-mer sequences is equal or higher than in the real genetic code. Similar results are obtained for larger fractions of the most difficult n-mer sequences. Results for n > 8 are based on 105 randomly sampled n-mers.

This Article

  1. Genome Res. 17: 405-412

Preprint Server