John A. McNeil; Kelly P. Smith; Lisa L. Hall; Jeanne B. Lawrence

Word frequency analysis reveals enrichment of dinucleotide repeats on the human X chromosome and [GATA]_n in the X escape region

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 1.

Distribution of word frequencies in the genome. The x-axis represents the frequency of word pairs in the genome, and the y-axis is the number of word pairs that occur at that frequency. The highest peak is largely populated by complex words that contain no CpGs. Words containing two and one CpGs, respectively, populate the first two peaks. The rarest words in the left tail have three or four CpGs, while the shoulder on the right tail is composed of simple sequence, largely mono- and dinucleotide repeats (see arrow).

This Article

Published in Advance March 13, 2006, doi: 10.1101/gr.4627606 Genome Res. 2006. 16: 477-484

AbstractFree
Full TextFree
Full Text (PDF)
Supplemental Research Data

Word frequency analysis reveals enrichment of dinucleotide repeats on the human X chromosome and [GATA]_n in the X escape region

This Article

Preprint Server

Current Issue

In This Issue

Word frequency analysis reveals enrichment of dinucleotide repeats on the human X chromosome and [GATA]n in the X escape region

This Article

Preprint Server

Current Issue

In This Issue

Word frequency analysis reveals enrichment of dinucleotide repeats on the human X chromosome and [GATA]_n in the X escape region