Functional insights from the distribution and role of homopeptide repeat-containing proteins

Table 1.

Number of homopeptide repeats and RCPs in GENPEPT, Eukaryotes, and Prokaryotes



GENPEPT

Eukaryote

Prokaryote

Other (viruses/environmental sequences)

Repeats
Proteins
Repeats
Proteins
Repeats
Proteins
Repeats
Proteins
Alanine 6132 5045 5465 4425 251 250 416 370
Valine 149 117 94 83 9 9 46 25
Leucine 1638 1602 1446 1426 70 70 122 106
Isoleucine 57 56 34 33 3 3 20 20
Proline 4837 3931 4157 3333 217 184 463 414
Methionine 27 22 19 18 0 0 8 4
Phenylalanine 196 186 175 172 1 1 20 13
Tryptophan 3 3 3 3 0 0 0 0
Glycine 5981 5020 5002 4168 310 281 669 571
Serine 6383 5463 5424 4742 378 258 581 463
Threonine 2997 2415 2492 1984 63 59 442 372
Cystine 64 52 38 38 0 0 26 14
Asparagine 7126 3731 6962 3597 31 29 133 105
Glutamine 8334 5699 8022 5464 52 51 260 184
Tyrosine 56 51 39 38 4 4 13 9
Aspartic Acid 1835 1707 1554 1451 34 34 247 222
Glutamic Acid 4779 4302 4334 3912 67 61 378 329
Lysine 2081 1926 1920 1774 25 25 136 127
Arginine 751 714 462 443 60 57 229 214
Histidine 1140 1061 1049 971 32 32 59 58
Total
54,566
37,355
48,691
32,628
1607
1388
4268
3339

This Article

  1. Genome Res. 15: 537-551

Preprint Server