Table 2.

Correlation of Clusters With Their Cognate CRM Sequences

Motif BCD CAD KR HG ABDB KNI GT PRD EN TLL TTK FTZF1 FTZ DFD
Site PWM  cutoff4.24.846.55.43.677.544.33.64.54.32.6
Cluster  Ecutoff4 * 10−4 1.2 * 10−3 2.3 * 10−3 4.4 * 10−3 2.3 * 10−3 1.1 * 10−2 1.1 * 10−2 4.1 * 10−2 3 * 10−4 1.4 * 10−2 4.4 * 10−3 4.1 * 10−2 6 * 10−4 7 * 10−4
Locus  name
 abda−0.050.275
 btd0.918−0.07−0.09
 dll
 ems0.7710
 en00.623
 eve0.5470.4590.2460.4130.5440.656
 ftz00.4930.4060
 gsb0.665
 gt0.930.7540.438
 hairy0.7790.3760.476−0.1
 hb0.4170.7580.684
 kni0.6200.4040.9330.4
 kr0.702−0.060.6150.325
 otd−0.06
 prd00
 runt0.3510.4240.088−0.06−0.08
 sal0.769−0.050.854−0.03
 tll−0.040.958
 ubx0.2760.3980.04

[i] Correlation coefficients (CC) are shown for some of the tested motif/locus combinations. High CC reflects the presence of a binding motif cluster in its cognate CRM region. A moderate CC value indicates the presence of additional clusters of the motif in the locus or shift of binding site clusters relative to the regulatory regions. Motifs tested and the optimal parameter settings are shown in the top rows; gene names are given in the first column. In all tests, the size of the resolution window was set to 575 bp, according to the results of the global optimization procedure (see Table 1). All combinations tested correspond to known regulatory interactions.