Table 4.

Expression/Solubility Data for the Most Common Pfam Domainsa


Countb

%Exp

%Sol

Domain length

Pfam

Description
60 78.3 23.3 104 Motile-Sperm Major sperm protein domain
34 70.6 17.6 111 Histone Core histone H2A/H2B/H3/H4
72 69.4 25.0 71 RRM-1 RNA recognition motif
38 68.4 2.6 70 GST-N GST, N-terminal
44 65.9 18.2 247 Adh-short Short chain dehydrogenase
48 64.6 4.2 163 Ras Ras family
63 57.1 3.2 38 WD40 WD domain, G-beta repeat
49 44.9 18.4 80 Helicase-C Helicase conserved C-terminal
38 44.7 10.5 56 Homeobox Homeobox domain
35 40.0 2.9 208 Metallophos Calcineurin-like phosphoesterase
56 39.3 5.4 106 BTB BTB/POZ domain
236 39.0 6.4 275 Pkinase Protein kinase domain
47 31.9 4.3 114 MATH MATH domain
35 31.4 2.9 117 DUF290 Transthyretin-like family
65 27.7 10.8 77 zf-C4 Zinc finger, C4 type
68 26.5 0 45 F-box F-box domain
56 25.0 0 142 FTH FTH domain
48 22.9 10.4 232 Y-phosphatase Protein-tyrosine phosphatase
41 22.0 9.8 195 Hormone-recep Ligand-binding domain of nuclear hormone receptor
46 10.9 0 212 Neur-chan-LBD Neurotransmitter-gated ion-channel
39 7.7 0 294 7tm-5 7TM chemoreceptor
41 7.3 0 202 Neur-chan-memb Neurotransmitter-gated ion-channel
105 6.7 1.9 59 Collagen Collagen triple helix repeat
70 2.9 1.4 61 Col-cuticle-N Nematode cuticle collagen N-terminal
57
1.8
0
37
ShTK
ShTK domain

a The list is sorted on % Expressed. %Exp stands for %Expressed, and %Sol for %Soluble.

b The 33 count is 2% of total 1689 distinct pfams in the studied sequences. The list contains all with count >2% of the 1689 distinct Pfams in the studied sequences.