Annotation Transfer for Genomics: Measuring Functional Divergence in Multi-Domain Proteins

Table 2.

Most Versatile Single-Domain Superfamilies

No.func No.prot Sfam comb Function SWISS-PROT ID SWISS-PROT function
E1.11.1 GSHP_RAT Plasma Glutathione Peroxidase (1.11.1.9)
263# DYL5_CHLRE Dynein, Flagellar Outer Arm–C. reinhardtii
D260# BSAA_BACSU Glutathione Peroxidase Homolog Bsaa
268# REHY_TORRU Rehydrin–Tortula ruralis (Moss)
11 69 3.38.1 266# PHOS_HUMAN Phosducin (33 Kd Phototransducing Protein)
269# REHY_ORYSA Rad24 Protein–Oryza sativa (Rice)
272# THIO_BPT4 Thioredoxin (Bacteriophage T4)
D271#272# TDX2_BRUMA Thioredoxin Peroxidase 2
261# BTUE_ECOLI Vitamin B12 Transport Periplasmic Protein Btue
342# BRAZ_PENBA Brazzein–Pentadiplandra brazzeana
376#336# SCKK_TITSE Neurotoxin Ts-Kapa (Tsk)–(Brazilian scorpion)
341#356# AF2B_SINAL Cysteine-Rich Antifungal Protein 2b (Afp2b)
10 28 7.3.6 343# DEFA_ZOPAT Defensin, Isoforms B And C–Zophobas atratus
361# DMYC_DROME Drosomycin Precursor (Cysteine-Rich Peptide)
361#376# SCX5_BUTEU Insectotoxin I5a–(Lesser Asian scorpion)
336# SCX3_LEIQH Leiuropeptide Iii–(Scorpion)
203# SIA1_SORBI Small-Pr Inhibitor Of Insect Alpha-Amylases
310# AB18_PEA Aba-Responsive Protein Abr18–Garden Pea
311# DRR3_PEA Disease Resistance Response Protein Pi49
7 34 4.79.3 231# MPAA_CORAV Major Pollen Allergen Cor A 1,–Eu. Hazel
312# L18B_LUPLU Protein L1r18b (LIpr10.1b)
E3.1.– RNS2_PANGI Ribonuclease 2 (3.1.–/–)–Panax Ginseng
314# SAM2_SOYBN Stress-Induced Protein Sam22
184# CSF2_SHEEP Colony-Stimulating Factor
381#564#184# IL4_RAT Interleukin-4 (B-Cell Igg Diff. Factor)
7 43 1.26.1 185# LIF_HUMAN Leukemia Inhibitory Factor (Lif)
187# PRL_ANGAN Prolactin Precursor (Prl)–
186# PLF3_MOUSE Proliferin 3 Mitogen-Regulated
188# SOMA_PAROL Somatotropin (Growth Hormone)
  • The most versatile superfamilies in single-domain proteins as determined from their functional description in SWISS-PROT, with some representatives. The keyword combinations in the fourth column were based either on the first three components of their EC numbers (for enzymes) or derived automatically by comparing the DE description line of SWISS-PROT entries to a list of synonymous keywords athttp://bioinfo.mbb.yale.edu/partslist/func. A keyword number starting with a D indicates an enzyme that does not have an assigned EC number in its description in SWISS-PROT.

This Article

  1. Genome Res. 11: 1632-1640

Preprint Server