Elliott H. Margulies; Gregory M. Cooper; George Asimenos; Daryl J. Thomas; Colin N. Dewey; Adam Siepel; Ewan Birney; Damian Keefe; Ariel S. Schwartz; Minmei Hou; James Taylor; Sergey Nikolaev; Juan I. Montoya-Burgos; Ari Löytynoja; Simon Whelan; Fabio Pardi; Tim Massingham; James B. Brown; Peter Bickel; Ian Holmes; James C. Mullikin; Abel Ureta-Vidal; Benedict Paten; Eric A. Stone; Kate R. Rosenbloom; W. James Kent; Gerard G. Bouffard; Xiaobin Guan; Nancy F. Hansen; Jacquelyn R. Idol; Valerie V.B. Maduro; Baishali Maskeri; Jennifer C. McDowell; Morgan Park; Pamela J. Thomas; Alice C. Young; Robert W. Blakesley; Donna M. Muzny; Erica Sodergren; David A. Wheeler; Kim C. Worley; Huaiyang Jiang; George M. Weinstock; Richard A. Gibbs; Tina Graves; Robert Fulton; Elaine R. Mardis; Richard K. Wilson; Michele Clamp; James Cuff; Sante Gnerre; David B. Jaffe; Jean L. Chang; Kerstin Lindblad-Toh; Eric S. Lander; Angie Hinrichs; Heather Trumbower; Hiram Clawson; Ann Zweig; Robert M. Kuhn; Galt Barber; Rachel Harte; Donna Karolchik; Matthew A. Field; Richard A. Moore; Carrie A. Matthewson; Jacqueline E. Schein; Marco A. Marra; Stylianos E. Antonarakis; Serafim Batzoglou; Nick Goldman; Ross Hardison; David Haussler; Webb Miller; Lior Pachter; Eric D. Green; Arend Sidow

Figure 7.

Annotated versus unannotated constrained sequences. For each block of constrained sequence, a score based on the log-likelihood of observing such a sequence under a model of constrained versus neutral evolution was computed using the phastOdds program (Siepel et al. 2005). These values were divided by the length of each block to compute a normalized per-base log-likelihood that reflects constraint intensity (X-axis). These values were plotted as a frequency histogram (Y-axis) for the blocks of constrained sequences that do (yellow) or do not (blue) overlap an experimental annotation. The distributions largely overlap (green), even at the extreme positive end in which highly constrained sequences reside. For comparison, the distribution for ancestral repeat sequences is shown as a representation of largely neutral DNA.

Analyses of deep mammalian sequence alignments and constraint predictions for 1% of the human genome

This Article

Preprint Server

Current Issue

In This Issue