Table 2.

Novel Proteins

Acc. No.[i] Name Comments[ii]
Novel proteins and proteins with unclear functions with sequence similarities implicating them in splicing/mRNA processing
ENSP00000295270Hypothetical proteinSimilar to U5 snRNP 200 kDa
ENSP00000272417CDNA FLJ13778 fisSimilar to U5 snRNP 200 kDa
ENSP00000301345Hypothetical proteinSimilar to U5 snRNP 220 kDa
TREMBL: Q9NUY0CDNA FLJ11063 fisSimilar to arginine/serine-rich 4
SWISS-PROT:Q13523 Serine/threonine-protein kinaseSer/Thr protein kinase family, similar to S. pombePRP4
ENSP00000296630Hypothetical proteinRRM domain, bipartite NLS, similar to arginine/serine-rich 11
ENSP00000266057CDNA FLJ10998 fisSimilar to RNA lariat debranching enzyme
ENSP00000273541Hypothetical proteinSimilar to Isy 1p, a potential splice factor in yeast
XP_013029Hypothetical proteinSimilar to U2 snRNP A′
ENSP00000286032Hypothetical proteinSimilar to hnRNP A3
ENSP00000301786Hypothetical proteinSimilar to hnRNP U
ENSP000000301784Hypothetical proteinSimilar to hnRNP U
ENSP00000261832Hypothetical protein DKFZp434E2220BASIC, basic domain in HLH proteins of MYOD family, PSP, proline-rich domain in spliceosome-associated proteins, zinc finger CCHC, zinc knuckle
ENSP00000244367CGI-124 proteinCyclophilin-type peptidyl-prolyl cis-transisomerase
ENSP00000215824CYP-60Cyclophilin-type peptidyl-prolyl cis-transisomerase
ENSP00000234288PPIL3bCyclophilin-type peptidyl-prolyl cis-transisomerase
ENSP00000282972Serologically defined colon cancer antigen 10Cyclophilin-type peptidyl-prolyl cis-transisomerase, bipartite NLS
SWISS-PROT: Q9UNP9Cyclophilin ECyclophilin-type peptidyl-prolyl cis-trans isomerase, RRM domain
ENSP00000261308KIAA0073 proteinCyclophilin-type peptidyl-prolyl cis-trans isomerase, G-protein beta WD-40 repeats
SWISS-PROT: Q92841 Probable RNA-dependent helicase p72DEAD/DEAH-box helicase
ENSP00000274514RNA helicaseDEAD/DEAH-box helicase
ENSP00000242776Hypothetical proteinSimilar to nuclear RNA helicase, DECD variant of DEAD-box helicase family
SWISS-PROT: Q92499 DDX1DEAD/DEAH-box helicase, SPRY domain
SWISS-PROT: Q9NR30DDX21DEAD/DEAH-box helicase, bipartite NLS
SWISS-PROT: Q9UJV9DEAD-box protein abstract homologDEAD/DEAH-box helicase, zinc finger CCHC type
ENSP00000218971DDX26DEAD-box, von Willebrand factor type A domain
SWISS-PROT: P38919 Eukaryotic initiation factor 4A-like NUK-34DEAD-box helicase
ENSP00000297920Hypothetical protein FLJ11307Double-stranded RNA-binding domain (DsRBD)
ENSP00000263115Hypothetical proteinG-patch domain
ENSP00000277477Far upstream element (FUSE) binding protein 3KH domain
ENSP00000295749KIAA 1604 proteinMIF4G, middle domain of eukaryotic initiation factor 4G and MA3 domain, bipartite NLS
ENSP00000298643PRO1777PWI domain
SWISS-PROT: Q9Y580RNA-binding protein 7RRM domain
SWISS-PROT: O43251 RNA-binding protein 9RRM domain
ENSP00000295971Hypothetical protein FLJ20273RRM domain
ENSP00000266301KIAA 1649 proteinRRM domain
SWISS-PROT: Q9Y388Hypothetical protein CGI-79.BRRM domain
SWISS-PROT: Q02040 B-lymphocyte antigen precursorRRM domain
ENSP00000262632Hypothetical 47.4 kDaRRM domain, ATP/GTP-binding site motif A (P-loop)
ENSP00000293677Hypothetical proteinRRM domain, Bipartite NLS
SWISS-PROT: Q9BXP5Arsenite-resistance protein 2RRM domain, Bipartite NLS
ENSP00000220496Hypothetical protein FLJ10634RRM domain, DNAJ heat shock protein, bipartite NLS
TREMBL: O00425 Putative RNA-binding protein KOCRRM domain, KH domain
ENSP00000262710KIAA0670 proteinRRM domain, SAP domain
TREMBL: Q96SC6OTT-MALRRM domain, SAP domain
ENSP00000295996KIAA0332 proteinRRM domain, Surp domain, Bipartite NLS
ENSP00000199814Hypothetical protein FLJ10290RRM domain, Zinc finger C-x8-C-x5-C-x3-H type
SWISS-PROT: P98175 RNA-binding protein 10RRM domain, C2H2 type zinc finger, bipartite NLS
ENSP00000261972 (+ENSP00000261973)Hypothetical protein S164 (+N-terminal extension: CDNA: FLJ22454 fis, clone HRC09703)RRM domain, PWI domain, bipartite NLS, Spectrin repeat (ENSP00000261973 encodes the N-terminal extension of ENSP00000261972)
TREMBL: Q9UQ35RNA-binding proteinRS domain
ENSP00000247001 F23858_1Surp domain, G-patch domain
ENSP00000299951Hypothetical proteinU1-like zinc finger, bipartite NLS
ENSP00000281372HsKin17 proteinC2H2 zinc finger
TREMBL: Q96KR1Putative Zinc finger proteinC2H2 zinc finger
ENSP00000239893OPA-interacting protein OIP23′ exoribonuclease family
Novel proteins without similarities implicating them in splicing/mRNA processing
SWISS-PROT: Q9C0J8WDC146G-protein beta WD-40 repeats
ENSP00000253952Hypothetical 34.8 kDa proteinG-protein beta WD-40 repeats
ENSP00000263222Hypothetical 57.5 kDa proteinG-protein beta WD-40 repeats
ENSP00000156471KIAA0560 proteinATP/GTP-binding site motif A (P-loop)
SWISS-PROT: Q9UH06   ENSP00000216252Hypothetical 12.4 kDa protein   BK223H9PHD-finger (C4HC3 zinc finger) belongs to the UPF0123 family of hypothetical proteins
ENSP00000260210Hypothetical protein MGC13125Bipartite NLS, ankyrin similarity
ENSP00000257181Hypothetical protein FLJ14936Bipartite NLS, similar to unknowns
ENSP00000290008Hypothetical proteinBipartite NLS
SWISS-PROT: Q9NZB2C9orf10 proteinBipartite NLS, similar to unknowns
ENSP00000247026Hypothetical 66.4 kDa proteinBipartite NLS
ENSP00000236273GCIP-interacting protein p29Bipartite NLS, similar to unknowns
ENSP00000292314Hypothetical proteinBipartite NLS, similar to unknowns
ENSP00000266923C21orf70Bipartite NLS, similar to unknowns
ENSP00000221899NY-REN-24 antigenBipartite NLS, Ezrin/radixin/moesin family; similar to Drosophila cactin
SWISS-PROT: Q14331 FRG1 protein (FSHD region gene 1 protein)Bipartite NLS, Lipocalin-related protein and Bos/Can/Equ allergen domain
SWISS-PROT: P42285 KIAA0052 proteinSKI2 helicase family
ENSP00000221413CGI-46 proteinDnaB helicase family
ENSP00000222969G10 protein homolog (EDG-2)G10 protein family
ENSP00000279839Adrenal gland protein AD-002GTP-binding signal recognition particle (SRP54) G-domain
ENSP00000278702Similar to nuclear mitotic apparatus protein 1Involucrin repeat, G-protein gamma subunit, DNA gyrase/topoisomerase IV, subunit A, M protein repeat, bZIP (Basic-leucine zipper) transcription factor family
SWISS-PROT:Q92733 Proline-rich protein PRCCProline-rich extension
ENSP00000263905KIAA1461 proteinPWWP domain, Methyl-CpG binding domain
XP_089514Hypothetical proteinSimilar to nucleophosmin
ENSP00000258457Hypothetical 25.9 kDa proteinSimilar to Xenopus ashwin
TREMBL: Q8WYA6Nuclear associated proteinSimilar to Bos taurus P14
TREMBL: Q13769 Hypothetical proteinSimilarity to intermediate filament b [Dugesia japonica]
SWISS-PROT: Q9Y5B6GC-rich sequence DNA-binding factor homologSimilar to C-TERMINAL OF GCF/TCF9 and other putative transcription factors
SWISS-PROT: Q9Y224Hypothetical protein CGI-99Similarity to putative transcription factors
ENSP00000216038Hypothetical 55.2 kDa proteinUncharacterized protein family UPF0027
ENSP00000289509Hypothetical 80.5 kDa proteinSimilar to unknowns
ENSP00000245838Hypothetical protein LOC57187Similar to unknowns
ENSP00000289996Hypothetical proteinSimilar to unknowns
ENSP00000252137DiGeorge syndrome critical region gene DGSI proteinSimilar to unknowns
ENSP00000256579Hypothetical protein FLJ10330Similar to unknowns
ENSP00000245651C20orf158 proteinSimilar to unknowns
SWISS-PROT: Q9BWJ5Hypothetical protein MGC3133Similar to unknowns
ENSP00000272091Hypothetical protein XP_089191Similar to unknowns
ENSP00000297526KIAA1440 proteinSimilar to unknowns
ENSP00000271942Hypothetical protein FLJ21919Similar to unknowns
TREMBL: Q9BTU2Hypothetical 31.5 kDa proteinSimilar to unknowns
TREMBL: Q8WVN3Hypothetical proteinSimilar to unknowns

[i] SWISS-PROT or ENSEMBL accession numbers are given athttp://srs.embl-heidelberg.de:8000 and http://www.ensembl.org.

[ii] Domains: RRM: RNA recognition motive; Bipartite NLS: Bipartite Nuclear Localization Signal; SPRY: SP1a/RY anodine receptor SPRY domain; G-patch: named after seven highly conserved glycines; KH: hnRNP K homology domain; PWI: proline-tryptophan-isoleucine motifs; SAP: SAF-A/B, Acinus and PIAS motif; RS: Arginine-Serine repeats; Surp: Suppressor-of-white-apricot splicing regulator domain.