Eukaryotic Promoter Recognition

Table 1.

Mapped and Predicted Transcription Start Sites

Seq. Citation TSS Audic Autogene GeneID NNPP P’Find P’Scan TATA TSSG TSSW
L47615  3321  bp Barbeau  et al.  (1996) 2078/  2108  (e) 2800 3000 r2572 r522 r672 r2422  1681 2080  2511  r920 r1623 r2012 300 1900 2200 2600 r221 r721 r1521 r1921 560 1690 2087 r2440 r2010 r1612 r909 2810 r2502 270 2707 2081 2510 r1302 r1304 r1703 262 2578 r166 r2023 r2556 246 549 2510 2811 r162 r701 r2023 r2614
U54701  5663  bp Bernier  et al.  (1996) 935(efp);  2002(ef)   715  2803  3117  3352  5250  5366  5683  r626 r1308 r2909 r5299 300 800 2000 2500 3200 3500 4200 r563 r1163 r1663 r2963 r5263 234 1095 2995 3121 3129 r4605 r3222 r1808 r528 4400 r4904 3113 119 228 716 2979 2981 3118 3368 5369 r4356 r5619 3124 2260 3115
None  2003  bp Chu et al.  (1996) 1483–1554  (ef);  1756– 1783  (efp) 1482 400 800 1500 r1403 1493 1060 r864 r186
U10577  1649  bp Grande  et al.  (1996) 1169–1171  (ef);  r1040–1045  (ef) 1150 1350 1550 r700 r800 500 800 362 787 1342 1170 r870 349 354 352 1165 r1271 352 1177 1078 r1432
U30245  1093  bp Kao et al.  (1996) 850–  961(efp)   314   472    680   r251   r386   r569   r620 740 r554 446 475 690 r528 r680 r818 r574
U69634  1515  bp Le et al.  (1997) 1450(ef) 300, 1500 , r16, r1016 300, 1200, r715 305, 605, 1376 r864 1490 r1186 280, 1336 , r54 306, 1336 , r61, r1336
U29912  565  bp Mahnke-Zizelman  et al.  (1996) 143–166  (efp) 550 r16 395 370 r66 379  383 r306 382 r330
U29927  2562  bp Mahnke-Zizelman  et al.  (1996) 738–  803(efp);  1553–  1717(efp) 521 700 1800 2400 r162 r762 r1262 r2062 528 577 1539 2120 r897 1280 r983 332 1529
Y10100  1066  bp Naville  et al.  (1997) 1018–  1033(ef) 161 769 r102 300 r466 167 787 r712 r88 400 r347 159 781 r181 r349 282 r183 166 r183
None  2191  bp Nomoto  et al.  (1996) 1793–  1812(ef) 2150 r1542 114 1139 1267 200 700 1000 1500 r691 r1291 1198 1297 1382 1649 1774 1852 r1718 r417 2010 r1712 1782 1135 1375 1824 r1599 r1936 1369 1839 r1599 r2018
U75286  1984  bp Rogers  et al.  (1997) 1416/  1480(efp) 1600 r985 r177 800 1200 r1684 494 726 1262 r147 1320 r1065 r1245 479 1302 r1121 r1732
U52432  1604  bp Schoots  et al.  (1997) 1521/  1523(efp) 1350 r1005 300 r1304 1187 1536 r1462 r864 r281 1120 r925 1138  r138 1231 r954 987 1288 r954
U80601  632  bp Silins  et al.  (1997) 317–  400(ef) 600 r33 135 296 309 389 402 414 433 530 542 551 607 r490 r478 r448 r378 r363 r275 620 r333 269 348 r284 393 r195 r518
X94563  2692  bp Swinnen  et al.  (1996) 1163–  1200(ef) 1000 1300 1900 2050 r1693 r1593 r1493 r893 374 2528 r262 r1044 r2469 700 1000 r1992 r2392 1200 1483 r2460 r2109 r1125 r251 1170 r1013 613 2527 r227 r578 r2436 1116 r1026 1188 1880 r1069 r1709
Z49978  1352  bp Szabo  et al.  (1996) 855(e);  1020(e);  1150(e) 1350 r1053 r903 289 889 1010 r190 r235 r955 400 1100 r852 127 1017 r942 r373 r187 r68 1320 r1023 1126 893 1011 r405 r1122 r1160 1152 r1039 r196 116 1212 r196 r1060
U49855  682  bp Toulouse  et al.  (1996) 28/51(e) 400 r33 r382 72 201 r156 r127 310 r33 251 r89 240 r168 r538
X75410  918  bp Wieman  et al.  (1996) 815/835/  836(ef) 146 476 601 656 682 778 871 931 r304 r405 r472 r510 r721 800 156 578 r501 r292 r39 300 r9 461 572 640 685 925 r198 r340 r385 r408 r410 r619 r874 r876
U24240  1728  bp Yoo et al.  (1996) 1480(p) 1400 r117 r767 r821 r887 600 1100 r528 r744 r958 r1364 r91 1350 r1429 r959 1518 1518 r1378
  • For each sequence tested (all mammalian), the DDBJ/EMBL/GenBank accession no. and length is listed, then the citation, transcription start site(s) (TSS), and prediction results from the various algorithms. A semicolon separates (groups of) TSSs that belong to different exons, or to functionally verified distinct promotors. Within a group of TSSs, if more than three were given by the investigators, and they were divided into major and minor sites, only major sites are listed (with the assumption that the minor starts are as likely to be from mRNA degradation products as from genuine alternative CAP sites). If more than three TSSs are given and all have equal status, the first and last, separated by a dash, are listed. If the TSS is prefixed with an r, the sense strand is the reverse complement of that given in the database (and numbering is 5’ to 3’ on that strand). For U75286, an alternate TSS is given by Chang and Yoshida (1997). With each TSS is given the experimental mapping method(s): (e) Primer extension; (f) functional promoter assay; (p) RNase protection. Correct predictions are shown in boldface type and underlined. The programs are described in the text.

This Article

  1. Genome Res. 7: 861-878

Preprint Server