Potential Regulatory Sequences in Coregulated Genes
| Gene pair[i] | Homology Blocks | ||
| SIAT9 | t[ttgatatgt] [iii] cacttgattgggaaaf(26/26) | ccttt[gccaatacaatgca][iv]gcaaatgctt (29/29) | |
| (6 exons) | (689 bp 3′ downstream exon 2) | (4358 bp 3′ downstream stop codon) | |
| HIVEP2+ | t[ttgatatat] [iii] caattgattaggaaac(23/26) | cctct[gccaatcacatgca] [iv] gcaaatgctt(25/29) | |
| (6 exons) | (8331 bp 5′ upstream start codon) | (7558 bp 3′ downstream stop codon) | |
| HIVEP2 | ttttta{(t[ttcatctga) [v] c] [vi] a} [vii] a(19/19) | gataattttacttggtattt[ctttttcttga] [viii](31/31) | |
| (6 exons) | (16087 bp 5′ upstream start codon) | (823 bp 3′ downstream exon 3) | |
| SEPP1+ | tttttc({t[ttcatctga) [v] c] [vi] a} [vii] a(18/19) | gattatttcattttgttttt[ctttttcttga] [viii](26/31) | |
| (5 exons) | (16520 bp 5′ upstream start codon) | (3784 bp 3′ downstream stop codon) | |
| HIVEP2 | [agaaaatctcttccttttaa] [xii] atttct(26/26) | [atgaa] [xii] agctctcagtatattggc(23/23) | [tttttttttttt] [xi] gcttgtaaaa(22/22) |
| (6 exons) | (7281 bp 5′ upstream start codon) | (802 bp 5′ upstream start codon) | (17498 bp 3′ downstream stop codon) |
| NEFL mm+ | [agaaaatcttttcctattaa] [ix] aattct(23/26) | [atgaa] [x] ggctctcagtgtattggc(21/23) | [tttttttttttt] [xi] tcttctaaaa(20/22) |
| (4 exons) | (19520 bp 5′ upstream start codon) | (1611 bp 3′ downstream stop codon) | (9527 bp 5′ upstream start codon) |
| HIVEP2 | aa[aaagaatgaatctgttttaa] [xii](22/22) | ||
| (6 exons) | (17972 bp 3′ downstream stop codon) | ||
| SCYD1+ | aa[aaaaaacgaatctgttttaa] [xii](20/22) | ||
| (3 exons) | (1816 bp 3′ downstream exon 1) | ||
| SEPP1 | gaatctgcaaa[gcctttcct] [xiii](20/20) | [cttctgtttactctca] [xiv] ctct(20/20) | |
| (5 exons) | (15110 bp 5′ upstream start codon) | (14403 bp 5′ upstream start codon) | |
| SCYD1+ | gaaactgtaaa[gcctttcct] [xiii](18/20) | [cttcttgttactctca] [xiv] ctct(18/20) | |
| (3 exons) | (9285 bp 3′ downstream stop codon) | (19725 bp 3′ downstream stop codon) | |
| SCYD1 | atgcata[aatatttacatata] [xv] t(22/22) | aaattttcttctgcttagct [xxxix](20/20) | |
| (3 exons) | (5003 bp 3′ downstream 3′ exon 1) | (5235 bp 3′ downstream exon 1) | |
| NEFL+ | atgcata[tatattcacatata] [xv] t(20/22) | aaatttttttctgcttagct [xxxix](19/20) | |
| (4 exons) | (13633 bp 3′ downstream stop codon) | (6321 bp 5′ upstream start codon) | |
| CUGBP2 | at[ggaaaatgcaaagg] [xvi] agaaaacag(25/25) | gcccccct[aggagaggcaggtg] [xvii] ctgc(26/26) | |
| (13 exons) | (65490 bp 3′ downstream exon 1) | (89115 bp 3′ downstream exon 1) | |
| MBP+ | at[ggaaaaagcaaagg] [vii] agaaatcag(23/25) | gcccacca[aggagaggcagggg] [xvii] ctgc(23/26) | |
| (7 exons) | (33 bp 3′ downstream exon 1) | (17744 bp 5′ upstream) | |
| MBP | atgcaaa[tacataaaat] [xviii] aa(19/19) | tcagtaacatttat[tttaacaactt] [xix](25/25) | |
| (7 exons) | (10021 bp 5′ upstream start codon) | (6019 bp 3′ downstream stop codon) | |
| CBLN1+ | atgtaaa[tacataaaat] [xviii] aa(18/19) | tcattaaaatttat[tttaaaaactt] [xix](22/25) | |
| (3 exons) | (12012 bp 5′ upstream start codon) | (9085 bp 5′ upstream start codon) | |
| MBP | catgtttgcagtggagt [xxxix](17/17) | tttcttt[cttgctttctgg] [xx](19/19) | |
| (7 exons) | (1122 bp 5′ upstream exon 6) | (3434 bp 3′ downstream end of exon 6) | |
| NEFL− | catgtttgcagtggagt [xxxix](17/17) | tttcttt[ctagctttctgg] [xx](18/19) | |
| (4 exons) | (1045 bp 3′ downstream codon) | (623 bp 5′ upstream start codon) | |
| NEFL | t[gttgt{tgttgtt] u gtttt} [xxi] g(19/19) | gatac[tttcaaagcatctgg] [xxii](20/20) | |
| (4 exons) | (17091 bp 5′ upstream start codon) | (5611 bp 5′ upstream start codon) | |
| CBLN1− | t[gttgt{tgttgtt] u gttt} [xxi] g(19/19) | gattc[tttcaaagcatctgg] [xxii](19/20) | |
| (3 exons) | (461 bp 3′ downstream stop codon) | (18945 bp 5′ upstream start codon) | |
| NEFL | a{(a[aaaaaaaaaaaa] [xxiii])[xxiv] a} [xxv] aaagcacttca(26/26) | aa[t(aaataaataaat) aa ] bb,cc gtcc(19/19) | |
| (4 exons) | (9536 bp 5′ upstream start codon) | (4175 bp 3′ downstream stop codon) | |
| PNUTL2 nn | a{(a[aaaaaaaaaaaa] [xxiii])[xxiv] a} [xxv] agagagcttca(23/26) | aa[t(aaataaataaat) aa ] bb,cc ttcc(18/19) | |
| (13 exons) | (15553 bp 5′ upstream start codon) | (2140 bp 3′ downstream exon 5) | |
| NEFL | atctttgttagtt[tttttttttttt] [xi] t(26/26) | gaggttttggtagcattct [xxxix] (1919) | |
| (4 exons) | (9120 bp 5′ upstream start codon) | (412 bp 3′ downstream exon 3) | |
| USP9X− | atctttgttagga[tttttttttttt] [xi] t(24/26) | gaggatttggtagcattct [xxxix](18/19) | |
| (44 exons) | (963 bp 3′ downstream exon 22) | (1737 bp 5′ upstream start codon) | |
| NEFL | aaacaaaa[aaaagaaaaaaat] dd tatttt(27/27) | ||
| (4 exons) | (745 bp 3′ downstream exon 1) | ||
| INA+ | aaagaaag[aaaagaaaaaaat] dd tagttt(24/27) | ||
| (3 exons) | (12338 bp 5′ upstream start codon) | ||
| CBLN1 | tagactacactccaaaattt[ggacattc] ee(28/28) | ttttggtga[aagttaagatattt] ff c(25/25) | g[gc(c{ctcatctgca} gg g) hh g] ii c(17/17) |
| (3 exons) | (10346 bp 5′ upstream start codon) | (4822 bp 5′ upstream start codon) | (6168 bp 3′ downstream stop codon) |
| USP9X+ | tagactacacttcaaaagta[ggatattc] ee(24/28) | tttttggtgt[aatttaagaaattt] ff c(22/25) | g[gc(c{ctcatctgca} gg g) hh g] ii c(17/17) |
| (44 exons) | (3010 bp 3′ downstream exon 34) | (4799 bp 3′ downstream stop codon) | (18415 bp 3′ downstream stop codon) |
| [ctttgttttt] jj gttttttttgg(22/22) | |||
| (8167 bp 3′ downstream stop codon) | |||
| [ctttgttttt] jj gctttttattgg(20/22) | |||
| (13160 bp 3′ downstream stop codon) | |||
| USP9X | ttttttg(tttttt[t{ttttt) kk } ll(23/23) | ||
| (44 exons) | (1572 bp 3′ downstream exon 11) | ||
| PNUTL2+ | tttgttt(tttttt[t{ttttt) [xi] cctg] kk } ll(21/23) | ||
| (13 exons) | (625 bp 5′ upstream start codon) | ||
[i] (+) correlated pair, (−) anti-correlated pair.
[ii] Nucleotide matches shown in parentheses, mismatches in bold. Potential binding sites (core in capitals).
[iii] LMO2COM ttGATAtat.
[iv] OCT1 gccaatcacATGCa.
[v] GATA2 & GATA3 tcaGATGaaa(anti-sense strand).
[vi] MYOD ttCATCgac.
[vii] LMO2COMtgtCAGAtgaaa (anti-sense strand).
[viii] NFAT ntcaaGAAAaag (anti-sense strand).
[ix] GFI1 nnnnagaaAATCttttcctattaa.
[x] TCF11 TTCAtnnnnnnnn (anti-sense strand).
[xi] HFH2 tttTTTTttttt.
[xii] GFI1 aaagaatgAATCtgttttaannnn.
[xiii] NFAT nnnagGAAAggc (anti-sense strand).
[xiv] FREAC2 tgagagTAAAcagaag (anti-sense strand).
[xv] ICT1 tatattcacATATa.
[xvi] GKLF ggaaaaagcaAAGG.
[xvii] GKLF aggagaggcaGGGG.
[xviii] TATA taCATAAAAt.
[xix] SRY tttaACAActtn.
[xx] TH1E47 cttgctttCTGGnnnn.uSRYaacaACAAcaac anti-sense strand).
[xxi] HFH2 tgtTGTTgtttt.
[xxii] BARBIE tttcAAAGcatctgg.
[xxiii] HFH2 tttTTTTttttt (anti-sense strand).
[xxiv] HFH3 tttTTTTtttttt (anti-sense strand).
[xxv] HNF3B tttttTTTTtttttt (anti-sense strand).
[xxvi] aaHFH2 attTATTtattt (anti-sense strand).
[xxvii] bb,ccHFH8 & HFH3 attTATTtattta (anti-sense strand).
[xxviii] ddHFH-3 attTTTTtctttt (anti-sense strand).
[xxix] eeGATA gGATATtcnnn.
[xxx] ffCEBPB aatttaaGAAAttt.
[xxxi] ggMYOD ctCATCtgca.
[xxxii] hhLMO2COM ctgCAGAtgagg (anti-sense strand).
[xxxiii] iiE47 cctGCAGatgagggc (anti-sense strand).
[xxxiv] jjSRY aaaaACAAagnn (anti-sense strand).
[xxxv] kkNFAT nncagGAAAaaa (anti-sense strand).
[xxxvi] llCETS1P54 ncAGGAaaaa (anti-sense strand).
[xxxvii] mmhuman NEFL = mouse Nfl.
[xxxviii] nnhuman PNUTL2 = mouse Sept4.
[xxxix] No known transcription factor binding site.