A Biophysical Approach to Transcription Factor Binding Site Discovery

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 3
Figure 3

This figure illustrates the importance of the correlation effects in genomic background statistics. The histograms of binding energies are obtained for a randomly chosen ϵ vector (blue) and its scrambled, that is, position-permuted, version Formula (red). The magenta curve is the theoretical estimate of the binding energy distribution based on a random nucleotide model without correlations, but with the correct one-point statistics of bases. That estimate is the same for ϵ and Formula. The theoretical energy distribution for model background statistics, which includes correct one- and two- (nearest-neighbor) base statistics, is different for ϵ and Formula (green curves) is in a much better agreement with the empirical histograms.

This Article

  1. Genome Res. 13: 2381-2390

Preprint Server