Table 1.

Highest Scoring Nontrivial Patterns with (at Most) One Wild-Card Symbol

No.[ii] PatternScore[iii] N+[iv] N[v]
A. Regions −100..0
2AAG.AAACAAA6.54371
6A.TAAGAACA5.79270
8A.AATAGGA5.61433
9AAGAAA.CAAA5.58260
12GTAACAA.C5.36250
13AAA.AACTTA5.36250
20ACAAC.TAA5.09393
21AG.AAACAAA5.06648
23ACAAACAA.A4.97485
26AATAGTA.A4.927711
32AATAGTATA4.77271
34TCACTAC.T4.72220
35CAAACA.ACA4.72220
37ACA.ATAGA4.72557
42AGAGA.ATA4.63547
47AATAAACAA.A4.59261
50AAAG.ACAAG4.57353
52CTAAGAA.A4.55537
56A.AAGGGAAG4.51210
57CAAA.TAAC4.50486
B. Regions −250..−150
14TTACCCGC6.22290
58GT.ACCCG5.59545
71T.ACCCGC5.48423
126CGGGTA.T5.06648
141G.TACCCG4.97485
165CGGGTAA.A4.87475
178GTTACCCG4.83373
305TACAT.TATA4.436510
353TTTCTC.TTT4.32466
372TTACCCG4.3011923
379TTTCCTGT.T4.29200
405CTCATCTC.T4.24241
425TCACGTGA4.20282
427T.ATATATTC4.20282
454CGGGTAA4.1211423
460TGTGT.GAT4.08190
465ATTACCCG.A4.08190
474G.ACATATAT4.06231
485TA.GTAAAC4.05272
500TTTCTCT.TT4.03477

[i] Matches were only allowed on the W (gene) strand.

[ii] No. of the pattern enumerating them decreasingly by scores (before trivial patterns were removed).

[iii] From equation 2.

[iv] No. of upstream regions matching the pattern.

[v] No. of random sequences matching the pattern.