Large-Scale Proteomic Analysis of the Human Spliceosome

Table 2.

Novel Proteins

Acc. No. Name Comments
Novel proteins and proteins with unclear functions with sequence similarities implicating them in splicing/mRNA processing
ENSP00000295270 Hypothetical protein Similar to U5 snRNP 200 kDa
ENSP00000272417 CDNA FLJ13778 fis Similar to U5 snRNP 200 kDa
ENSP00000301345 Hypothetical protein Similar to U5 snRNP 220 kDa
TREMBL: Q9NUY0 CDNA FLJ11063 fis Similar to arginine/serine-rich 4
SWISS-PROT:Q13523 Serine/threonine-protein kinase Ser/Thr protein kinase family, similar to S. pombePRP4
ENSP00000296630 Hypothetical protein RRM domain, bipartite NLS, similar to arginine/serine-rich 11
ENSP00000266057 CDNA FLJ10998 fis Similar to RNA lariat debranching enzyme
ENSP00000273541 Hypothetical protein Similar to Isy 1p, a potential splice factor in yeast
XP_013029 Hypothetical protein Similar to U2 snRNP A′
ENSP00000286032 Hypothetical protein Similar to hnRNP A3
ENSP00000301786 Hypothetical protein Similar to hnRNP U
ENSP000000301784 Hypothetical protein Similar to hnRNP U
ENSP00000261832 Hypothetical protein DKFZp434E2220 BASIC, basic domain in HLH proteins of MYOD family, PSP, proline-rich domain in spliceosome-associated proteins, zinc finger CCHC, zinc knuckle
ENSP00000244367 CGI-124 protein Cyclophilin-type peptidyl-prolyl cis-transisomerase
ENSP00000215824 CYP-60 Cyclophilin-type peptidyl-prolyl cis-transisomerase
ENSP00000234288 PPIL3b Cyclophilin-type peptidyl-prolyl cis-transisomerase
ENSP00000282972 Serologically defined colon cancer antigen 10 Cyclophilin-type peptidyl-prolyl cis-transisomerase, bipartite NLS
SWISS-PROT: Q9UNP9 Cyclophilin E Cyclophilin-type peptidyl-prolyl cis-trans isomerase, RRM domain
ENSP00000261308 KIAA0073 protein Cyclophilin-type peptidyl-prolyl cis-trans isomerase, G-protein beta WD-40 repeats
SWISS-PROT: Q92841 Probable RNA-dependent helicase p72 DEAD/DEAH-box helicase
ENSP00000274514 RNA helicase DEAD/DEAH-box helicase
ENSP00000242776 Hypothetical protein Similar to nuclear RNA helicase, DECD variant of DEAD-box helicase family
SWISS-PROT: Q92499 DDX1 DEAD/DEAH-box helicase, SPRY domain
SWISS-PROT: Q9NR30 DDX21 DEAD/DEAH-box helicase, bipartite NLS
SWISS-PROT: Q9UJV9 DEAD-box protein abstract homolog DEAD/DEAH-box helicase, zinc finger CCHC type
ENSP00000218971 DDX26 DEAD-box, von Willebrand factor type A domain
SWISS-PROT: P38919 Eukaryotic initiation factor 4A-like NUK-34 DEAD-box helicase
ENSP00000297920 Hypothetical protein FLJ11307 Double-stranded RNA-binding domain (DsRBD)
ENSP00000263115 Hypothetical protein G-patch domain
ENSP00000277477 Far upstream element (FUSE) binding protein 3 KH domain
ENSP00000295749 KIAA 1604 protein MIF4G, middle domain of eukaryotic initiation factor 4G and MA3 domain, bipartite NLS
ENSP00000298643 PRO1777 PWI domain
SWISS-PROT: Q9Y580 RNA-binding protein 7 RRM domain
SWISS-PROT: O43251 RNA-binding protein 9 RRM domain
ENSP00000295971 Hypothetical protein FLJ20273 RRM domain
ENSP00000266301 KIAA 1649 protein RRM domain
SWISS-PROT: Q9Y388 Hypothetical protein CGI-79.B RRM domain
SWISS-PROT: Q02040 B-lymphocyte antigen precursor RRM domain
ENSP00000262632 Hypothetical 47.4 kDa RRM domain, ATP/GTP-binding site motif A (P-loop)
ENSP00000293677 Hypothetical protein RRM domain, Bipartite NLS
SWISS-PROT: Q9BXP5 Arsenite-resistance protein 2 RRM domain, Bipartite NLS
ENSP00000220496 Hypothetical protein FLJ10634 RRM domain, DNAJ heat shock protein, bipartite NLS
TREMBL: O00425 Putative RNA-binding protein KOC RRM domain, KH domain
ENSP00000262710 KIAA0670 protein RRM domain, SAP domain
TREMBL: Q96SC6 OTT-MAL RRM domain, SAP domain
ENSP00000295996 KIAA0332 protein RRM domain, Surp domain, Bipartite NLS
ENSP00000199814 Hypothetical protein FLJ10290 RRM domain, Zinc finger C-x8-C-x5-C-x3-H type
SWISS-PROT: P98175 RNA-binding protein 10 RRM domain, C2H2 type zinc finger, bipartite NLS
ENSP00000261972 (+ENSP00000261973) Hypothetical protein S164 (+N-terminal extension: CDNA: FLJ22454 fis, clone HRC09703) RRM domain, PWI domain, bipartite NLS, Spectrin repeat (ENSP00000261973 encodes the N-terminal extension of ENSP00000261972)
TREMBL: Q9UQ35 RNA-binding protein RS domain
ENSP00000247001 F23858_1 Surp domain, G-patch domain
ENSP00000299951 Hypothetical protein U1-like zinc finger, bipartite NLS
ENSP00000281372 HsKin17 protein C2H2 zinc finger
TREMBL: Q96KR1 Putative Zinc finger protein C2H2 zinc finger
ENSP00000239893 OPA-interacting protein OIP2 3′ exoribonuclease family
Novel proteins without similarities implicating them in splicing/mRNA processing
SWISS-PROT: Q9C0J8 WDC146 G-protein beta WD-40 repeats
ENSP00000253952 Hypothetical 34.8 kDa protein G-protein beta WD-40 repeats
ENSP00000263222 Hypothetical 57.5 kDa protein G-protein beta WD-40 repeats
ENSP00000156471 KIAA0560 protein ATP/GTP-binding site motif A (P-loop)
SWISS-PROT: Q9UH06   ENSP00000216252 Hypothetical 12.4 kDa protein   BK223H9 PHD-finger (C4HC3 zinc finger) belongs to the UPF0123 family of hypothetical proteins
ENSP00000260210 Hypothetical protein MGC13125 Bipartite NLS, ankyrin similarity
ENSP00000257181 Hypothetical protein FLJ14936 Bipartite NLS, similar to unknowns
ENSP00000290008 Hypothetical protein Bipartite NLS
SWISS-PROT: Q9NZB2 C9orf10 protein Bipartite NLS, similar to unknowns
ENSP00000247026 Hypothetical 66.4 kDa protein Bipartite NLS
ENSP00000236273 GCIP-interacting protein p29 Bipartite NLS, similar to unknowns
ENSP00000292314 Hypothetical protein Bipartite NLS, similar to unknowns
ENSP00000266923 C21orf70 Bipartite NLS, similar to unknowns
ENSP00000221899 NY-REN-24 antigen Bipartite NLS, Ezrin/radixin/moesin family; similar to Drosophila cactin
SWISS-PROT: Q14331 FRG1 protein (FSHD region gene 1 protein) Bipartite NLS, Lipocalin-related protein and Bos/Can/Equ allergen domain
SWISS-PROT: P42285 KIAA0052 protein SKI2 helicase family
ENSP00000221413 CGI-46 protein DnaB helicase family
ENSP00000222969 G10 protein homolog (EDG-2) G10 protein family
ENSP00000279839 Adrenal gland protein AD-002 GTP-binding signal recognition particle (SRP54) G-domain
ENSP00000278702 Similar to nuclear mitotic apparatus protein 1 Involucrin repeat, G-protein gamma subunit, DNA gyrase/topoisomerase IV, subunit A, M protein repeat, bZIP (Basic-leucine zipper) transcription factor family
SWISS-PROT:Q92733 Proline-rich protein PRCC Proline-rich extension
ENSP00000263905 KIAA1461 protein PWWP domain, Methyl-CpG binding domain
XP_089514 Hypothetical protein Similar to nucleophosmin
ENSP00000258457 Hypothetical 25.9 kDa protein Similar to Xenopus ashwin
TREMBL: Q8WYA6 Nuclear associated protein Similar to Bos taurus P14
TREMBL: Q13769 Hypothetical protein Similarity to intermediate filament b [Dugesia japonica]
SWISS-PROT: Q9Y5B6 GC-rich sequence DNA-binding factor homolog Similar to C-TERMINAL OF GCF/TCF9 and other putative transcription factors
SWISS-PROT: Q9Y224 Hypothetical protein CGI-99 Similarity to putative transcription factors
ENSP00000216038 Hypothetical 55.2 kDa protein Uncharacterized protein family UPF0027
ENSP00000289509 Hypothetical 80.5 kDa protein Similar to unknowns
ENSP00000245838 Hypothetical protein LOC57187 Similar to unknowns
ENSP00000289996 Hypothetical protein Similar to unknowns
ENSP00000252137 DiGeorge syndrome critical region gene DGSI protein Similar to unknowns
ENSP00000256579 Hypothetical protein FLJ10330 Similar to unknowns
ENSP00000245651 C20orf158 protein Similar to unknowns
SWISS-PROT: Q9BWJ5 Hypothetical protein MGC3133 Similar to unknowns
ENSP00000272091 Hypothetical protein XP_089191 Similar to unknowns
ENSP00000297526 KIAA1440 protein Similar to unknowns
ENSP00000271942 Hypothetical protein FLJ21919 Similar to unknowns
TREMBL: Q9BTU2 Hypothetical 31.5 kDa protein Similar to unknowns
TREMBL: Q8WVN3 Hypothetical protein Similar to unknowns
  • SWISS-PROT or ENSEMBL accession numbers are given athttp://srs.embl-heidelberg.de:8000 and http://www.ensembl.org.

  • Domains: RRM: RNA recognition motive; Bipartite NLS: Bipartite Nuclear Localization Signal; SPRY: SP1a/RY anodine receptor SPRY domain; G-patch: named after seven highly conserved glycines; KH: hnRNP K homology domain; PWI: proline-tryptophan-isoleucine motifs; SAP: SAF-A/B, Acinus and PIAS motif; RS: Arginine-Serine repeats; Surp: Suppressor-of-white-apricot splicing regulator domain.

This Article

  1. Genome Res. 12: 1231-1245

Preprint Server