<?xml version="1.0" encoding="ISO-8859-1"?>

<rdf:RDF
 xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
 xmlns="http://purl.org/rss/1.0/"
 xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/"
 xmlns:dc="http://purl.org/dc/elements/1.1/"
 xmlns:syn="http://purl.org/rss/1.0/modules/syndication/"
 xmlns:prism="http://purl.org/rss/1.0/modules/prism/"
 xmlns:admin="http://webns.net/mvcb/"
>

<channel rdf:about="http://genome.cshlp.org">
<title>Genome Research current issue</title>
<link>http://genome.cshlp.org</link>
<description>Genome Research RSS feed -- current issue</description>
<prism:coverDisplayDate>Nov  1 2008 12:00:00:000AM</prism:coverDisplayDate>
<prism:publicationName>Genome Research</prism:publicationName>
<prism:issn>1088-9051</prism:issn>
<items>
 <rdf:Seq>
  <rdf:li rdf:resource="http://genome.cshlp.org/cgi/content/short/18/11/1683?rss=1" />
  <rdf:li rdf:resource="http://genome.cshlp.org/cgi/content/short/18/11/1686?rss=1" />
  <rdf:li rdf:resource="http://genome.cshlp.org/cgi/content/short/18/11/1698?rss=1" />
  <rdf:li rdf:resource="http://genome.cshlp.org/cgi/content/short/18/11/1711?rss=1" />
  <rdf:li rdf:resource="http://genome.cshlp.org/cgi/content/short/18/11/1724?rss=1" />
  <rdf:li rdf:resource="http://genome.cshlp.org/cgi/content/short/18/11/1733?rss=1" />
  <rdf:li rdf:resource="http://genome.cshlp.org/cgi/content/short/18/11/1743?rss=1" />
  <rdf:li rdf:resource="http://genome.cshlp.org/cgi/content/short/18/11/1752?rss=1" />
  <rdf:li rdf:resource="http://genome.cshlp.org/cgi/content/short/18/11/1763?rss=1" />
  <rdf:li rdf:resource="http://genome.cshlp.org/cgi/content/short/18/11/1778?rss=1" />
  <rdf:li rdf:resource="http://genome.cshlp.org/cgi/content/short/18/11/1787?rss=1" />
  <rdf:li rdf:resource="http://genome.cshlp.org/cgi/content/short/18/11/1798?rss=1" />
  <rdf:li rdf:resource="http://genome.cshlp.org/cgi/content/short/18/11/1806?rss=1" />
  <rdf:li rdf:resource="http://genome.cshlp.org/cgi/content/short/18/11/1814?rss=1" />
  <rdf:li rdf:resource="http://genome.cshlp.org/cgi/content/short/18/11/1829?rss=1" />
  <rdf:li rdf:resource="http://genome.cshlp.org/cgi/content/short/18/11/1844?rss=1" />
  <rdf:li rdf:resource="http://genome.cshlp.org/cgi/content/short/18/11/1851?rss=1" />
  <rdf:li rdf:resource="http://genome.cshlp.org/cgi/content/short/18/11/1859?rss=1" />
 </rdf:Seq>
</items>
<image rdf:resource="http://genome.cshlp.org/icons/banner/title.gif" />
</channel>

<image rdf:about="http://genome.cshlp.org/icons/banner/title.gif">
<title>Genome Research</title>
<url>http://genome.cshlp.org/icons/banner/title.gif</url>
<link>http://genome.cshlp.org</link>
</image>

<item rdf:about="http://genome.cshlp.org/cgi/content/short/18/11/1683?rss=1">
<title><![CDATA[[FORUM] Don't throw the baby out with the bathwater: Enabling a bottom-up approach in genome-wide association studies]]></title>
<link>http://genome.cshlp.org/cgi/content/short/18/11/1683?rss=1</link>
<description><![CDATA[]]></description>
<dc:creator><![CDATA[McGuire, S. E., McGuire, A. L.]]></dc:creator>
<dc:date>2008-10-30</dc:date>
<dc:identifier>info:doi/10.1101/gr.083584.108</dc:identifier>
<dc:title><![CDATA[[FORUM] Don't throw the baby out with the bathwater: Enabling a bottom-up approach in genome-wide association studies]]></dc:title>
<dc:publisher>Cold Spring Harbor Laboratory Press</dc:publisher>
<prism:number>11</prism:number>
<prism:volume>18</prism:volume>
<prism:endingPage>1685</prism:endingPage>
<prism:publicationDate>2008-11-01</prism:publicationDate>
<prism:startingPage>1683</prism:startingPage>
<prism:section>FORUM</prism:section>
</item>

<item rdf:about="http://genome.cshlp.org/cgi/content/short/18/11/1686?rss=1">
<title><![CDATA[[REVIEW] Defensins and the dynamic genome: What we can learn from structural variation at human chromosome band 8p23.1]]></title>
<link>http://genome.cshlp.org/cgi/content/short/18/11/1686?rss=1</link>
<description><![CDATA[
<p>Over the past four years, genome-wide studies have uncovered numerous examples of structural variation in the human genome. This includes structural variation that changes copy number, such as deletion and duplication, and structural variation that does not change copy number, such as orientation and positional polymorphism. One region that contains all these types of variation spans the chromosome band 8p23.1. This region has been studied in some depth, and the focus of this review is to examine our current understanding of the variation of this region. We also consider whether this region is a good model for other structurally variable regions in the genome and what the implications of this variation are for clinical studies. Finally, we discuss the bioinformatics challenges raised, discuss the evolution of the region, and suggest some future priorities for structural variation research.</p>
]]></description>
<dc:creator><![CDATA[Hollox, E. J., Barber, J. C.K., Brookes, A. J., Armour, J. A.L.]]></dc:creator>
<dc:date>2008-10-30</dc:date>
<dc:identifier>info:doi/10.1101/gr.080945.108</dc:identifier>
<dc:title><![CDATA[[REVIEW] Defensins and the dynamic genome: What we can learn from structural variation at human chromosome band 8p23.1]]></dc:title>
<dc:publisher>Cold Spring Harbor Laboratory Press</dc:publisher>
<prism:number>11</prism:number>
<prism:volume>18</prism:volume>
<prism:endingPage>1697</prism:endingPage>
<prism:publicationDate>2008-11-01</prism:publicationDate>
<prism:startingPage>1686</prism:startingPage>
<prism:section>REVIEW</prism:section>
</item>

<item rdf:about="http://genome.cshlp.org/cgi/content/short/18/11/1698?rss=1">
<title><![CDATA[[ARTICLES] Copy number variation and evolution in humans and chimpanzees]]></title>
<link>http://genome.cshlp.org/cgi/content/short/18/11/1698?rss=1</link>
<description><![CDATA[
<p>Copy number variants (CNVs) underlie many aspects of human phenotypic diversity and provide the raw material for gene duplication and gene family expansion. However, our understanding of their evolutionary significance remains limited. We performed comparative genomic hybridization on a single human microarray platform to identify CNVs among the genomes of 30 humans and 30 chimpanzees as well as fixed copy number differences between species. We found that human and chimpanzee CNVs occur in orthologous genomic regions far more often than expected by chance and are strongly associated with the presence of highly homologous intrachromosomal segmental duplications. By adapting population genetic analyses for use with copy number data, we identified functional categories of genes that have likely evolved under purifying or positive selection for copy number changes. In particular, duplications and deletions of genes with inflammatory response and cell proliferation functions may have been fixed by positive selection and involved in the adaptive phenotypic differentiation of humans and chimpanzees.</p>
]]></description>
<dc:creator><![CDATA[Perry, G. H., Yang, F., Marques-Bonet, T., Murphy, C., Fitzgerald, T., Lee, A. S., Hyland, C., Stone, A. C., Hurles, M. E., Tyler-Smith, C., Eichler, E. E., Carter, N. P., Lee, C., Redon, R.]]></dc:creator>
<dc:date>2008-10-30</dc:date>
<dc:identifier>info:doi/10.1101/gr.082016.108</dc:identifier>
<dc:title><![CDATA[[ARTICLES] Copy number variation and evolution in humans and chimpanzees]]></dc:title>
<dc:publisher>Cold Spring Harbor Laboratory Press</dc:publisher>
<prism:number>11</prism:number>
<prism:volume>18</prism:volume>
<prism:endingPage>1710</prism:endingPage>
<prism:publicationDate>2008-11-01</prism:publicationDate>
<prism:startingPage>1698</prism:startingPage>
<prism:section>ARTICLES</prism:section>
</item>

<item rdf:about="http://genome.cshlp.org/cgi/content/short/18/11/1711?rss=1">
<title><![CDATA[[LETTERS] Reduced purifying selection prevails over positive selection in human copy number variant evolution]]></title>
<link>http://genome.cshlp.org/cgi/content/short/18/11/1711?rss=1</link>
<description><![CDATA[
<p>Copy number variation is a dominant contributor to genomic variation and may frequently underlie an individual&rsquo;s variable susceptibilities to disease. Here we question our previous proposition that copy number variants (CNVs) are often retained in the human population because of their adaptive benefit. We show that genic biases of CNVs are best explained, not by positive selection, but by reduced efficiency of selection in eliminating deleterious changes from the human population. Of four CNV data sets examined, three exhibit significant increases in protein evolutionary rates. These increases appear to be attributable to the frequent coincidence of CNVs with segmental duplications (SDs) that recombine infrequently. Furthermore, human orthologs of mouse genes, which, when disrupted, result in pre- or postnatal lethality, are unusually depleted in CNVs. Together, these findings support a model of reduced purifying selection (Hill&ndash;Robertson interference) within copy number variable regions that are enriched in nonessential genes, allowing both the fixation of slightly deleterious substitutions and increased drift of CNV alleles. Additionally, all four CNV sets exhibited increased rates of interspecies chromosomal rearrangement and nucleotide substitution and an increased gene density. We observe that sequences with high G+C contents are most prone to copy number variation. In particular, frequently duplicated human SD sequence, or CNVs that are large and/or observed frequently, tend to be elevated in G+C content. In contrast, SD sequences that appear fixed in the human population lie more frequently within low G+C sequence. These findings provide an overarching view of how CNVs arise and segregate in the human population.</p>
]]></description>
<dc:creator><![CDATA[Nguyen, D.-Q., Webber, C., Hehir-Kwa, J., Pfundt, R., Veltman, J., Ponting, C. P.]]></dc:creator>
<dc:date>2008-10-30</dc:date>
<dc:identifier>info:doi/10.1101/gr.077289.108</dc:identifier>
<dc:title><![CDATA[[LETTERS] Reduced purifying selection prevails over positive selection in human copy number variant evolution]]></dc:title>
<dc:publisher>Cold Spring Harbor Laboratory Press</dc:publisher>
<prism:number>11</prism:number>
<prism:volume>18</prism:volume>
<prism:endingPage>1723</prism:endingPage>
<prism:publicationDate>2008-11-01</prism:publicationDate>
<prism:startingPage>1711</prism:startingPage>
<prism:section>LETTERS</prism:section>
</item>

<item rdf:about="http://genome.cshlp.org/cgi/content/short/18/11/1724?rss=1">
<title><![CDATA[[LETTERS] Copy number variation at the breakpoint region of isochromosome 17q]]></title>
<link>http://genome.cshlp.org/cgi/content/short/18/11/1724?rss=1</link>
<description><![CDATA[
<p>Isochromosome 17q, or i(17q), is one of the most frequent nonrandom changes occurring in human neoplasia. Most of the i(17q) breakpoints cluster within a ~240-kb interval located in the Smith-Magenis syndrome common deletion region in 17p11.2. The breakpoint cluster region is characterized by a complex architecture with large (~38&ndash;49 kb), inverted and directly oriented, low-copy repeats (LCRs), known as REPA and REPB that apparently lead to genomic instability and facilitate somatic genetic rearrangements. Through the analysis of bacterial artificial chromosome (BAC) clones, pulsed-field gel electrophoresis (PFGE), and public array comparative genomic hybridization (array CGH) data, we show that the REPA/B structure is also susceptible to frequent meiotic rearrangements. It is a highly dynamic genomic region undergoing deletions, inversions, and duplications likely produced by non-allelic homologous recombination (NAHR) mediated by the highly identical <I>SNORD3@</I>, also known as <I>U3</I>, gene cluster present therein. We detected at least seven different REPA/B structures in samples from 29 individuals of which six represented potentially novel structures. Two polymorphic copy-number variation (CNV) variants, detected in 20% of samples, could be structurally described along with the likely underlying molecular mechanism for formation. Our data show the high susceptibility to rearrangements at the i(17q) breakpoint cluster region in the general population and exemplifies how large genomic regions laden with LCRs still represent a technical challenge for both determining specific structure and assaying population variation. The variant REPA/B structures identified may have different susceptibilities for inducing i(17q), thus potentially representing important risk alleles for tumor progression.</p>
]]></description>
<dc:creator><![CDATA[Carvalho, C. M.B., Lupski, J. R.]]></dc:creator>
<dc:date>2008-10-30</dc:date>
<dc:identifier>info:doi/10.1101/gr.080697.108</dc:identifier>
<dc:title><![CDATA[[LETTERS] Copy number variation at the breakpoint region of isochromosome 17q]]></dc:title>
<dc:publisher>Cold Spring Harbor Laboratory Press</dc:publisher>
<prism:number>11</prism:number>
<prism:volume>18</prism:volume>
<prism:endingPage>1732</prism:endingPage>
<prism:publicationDate>2008-11-01</prism:publicationDate>
<prism:startingPage>1724</prism:startingPage>
<prism:section>LETTERS</prism:section>
</item>

<item rdf:about="http://genome.cshlp.org/cgi/content/short/18/11/1733?rss=1">
<title><![CDATA[[LETTERS] Unexpected complexity at breakpoint junctions in phenotypically normal individuals and mechanisms involved in generating balanced translocations t(1;22)(p36;q13)]]></title>
<link>http://genome.cshlp.org/cgi/content/short/18/11/1733?rss=1</link>
<description><![CDATA[
<p>Approximately one in 500 individuals carries a reciprocal translocation. Balanced translocations are usually associated with a normal phenotype unless the translocation breakpoints disrupt a gene(s) or cause a position effect. We investigated breakpoint junctions at the sequence level in phenotypically normal balanced translocation carriers. Eight breakpoint junctions derived from four nonrelated subjects with apparently balanced translocation t(1;22)(p36;q13) were examined. Additions of nucleotides, deletions, duplications, and a triplication identified at the breakpoints demonstrate high complexity at the breakpoint junctions and indicate involvement of multiple mechanisms in the DNA breakage and repair process during translocation formation. Possible detailed nonhomologous end-joining scenarios for t(1;22) cases are presented. We propose that cryptic imbalances in phenotypically normal, balanced translocation carriers may be more common than currently appreciated.</p>
]]></description>
<dc:creator><![CDATA[Gajecka, M., Gentles, A. J., Tsai, A., Chitayat, D., Mackay, K. L., Glotzbach, C. D., Lieber, M. R., Shaffer, L. G.]]></dc:creator>
<dc:date>2008-10-30</dc:date>
<dc:identifier>info:doi/10.1101/gr.077453.108</dc:identifier>
<dc:title><![CDATA[[LETTERS] Unexpected complexity at breakpoint junctions in phenotypically normal individuals and mechanisms involved in generating balanced translocations t(1;22)(p36;q13)]]></dc:title>
<dc:publisher>Cold Spring Harbor Laboratory Press</dc:publisher>
<prism:number>11</prism:number>
<prism:volume>18</prism:volume>
<prism:endingPage>1742</prism:endingPage>
<prism:publicationDate>2008-11-01</prism:publicationDate>
<prism:startingPage>1733</prism:startingPage>
<prism:section>LETTERS</prism:section>
</item>

<item rdf:about="http://genome.cshlp.org/cgi/content/short/18/11/1743?rss=1">
<title><![CDATA[[LETTERS] Dispensability of mammalian DNA]]></title>
<link>http://genome.cshlp.org/cgi/content/short/18/11/1743?rss=1</link>
<description><![CDATA[
<p>In the lab, the <I>cis-</I>regulatory network seems to exhibit great functional redundancy. Many experiments testing enhancer activity of neighboring <I>cis-</I>regulatory elements show largely overlapping expression domains. Of recent interest, mice in which <I>cis-</I>regulatory ultraconserved elements were knocked out showed no obvious phenotype, further suggesting functional redundancy. Here, we present a global evolutionary analysis of mammalian conserved nonexonic elements (CNEs), and find strong evidence to the contrary. Given a set of CNEs conserved between several mammals, we characterize functional dispensability as the propensity for the ancestral element to be lost in mammalian species internal to the spanned species tree. We show that ultraconserved-like elements are over 300-fold less likely than neutral DNA to have been lost during rodent evolution. In fact, many thousands of noncoding loci under purifying selection display near uniform indispensability during mammalian evolution, largely irrespective of nucleotide conservation level. These findings suggest that many genomic noncoding elements possess functions that contribute noticeably to organism fitness in naturally evolving populations.</p>
]]></description>
<dc:creator><![CDATA[McLean, C., Bejerano, G.]]></dc:creator>
<dc:date>2008-10-30</dc:date>
<dc:identifier>info:doi/10.1101/gr.080184.108</dc:identifier>
<dc:title><![CDATA[[LETTERS] Dispensability of mammalian DNA]]></dc:title>
<dc:publisher>Cold Spring Harbor Laboratory Press</dc:publisher>
<prism:number>11</prism:number>
<prism:volume>18</prism:volume>
<prism:endingPage>1751</prism:endingPage>
<prism:publicationDate>2008-11-01</prism:publicationDate>
<prism:startingPage>1743</prism:startingPage>
<prism:section>LETTERS</prism:section>
</item>

<item rdf:about="http://genome.cshlp.org/cgi/content/short/18/11/1752?rss=1">
<title><![CDATA[[LETTERS] Evolution of the mammalian transcription factor binding repertoire via transposable elements]]></title>
<link>http://genome.cshlp.org/cgi/content/short/18/11/1752?rss=1</link>
<description><![CDATA[
<p>Identification of lineage-specific innovations in genomic control elements is critical for understanding transcriptional regulatory networks and phenotypic heterogeneity. We analyzed, from an evolutionary perspective, the binding regions of seven mammalian transcription factors (ESR1, TP53, MYC, RELA, POU5F1, SOX2, and CTCF) identified on a genome-wide scale by different chromatin immunoprecipitation approaches and found that only a minority of sites appear to be conserved at the sequence level. Instead, we uncovered a pervasive association with genomic repeats by showing that a large fraction of the bona fide binding sites for five of the seven transcription factors (ESR1, TP53, POU5F1, SOX2, and CTCF) are embedded in distinctive families of transposable elements. Using the age of the repeats, we established that these repeat-associated binding sites (RABS) have been associated with significant regulatory expansions throughout the mammalian phylogeny. We validated the functional significance of these RABS by showing that they are over-represented in proximity of regulated genes and that the binding motifs within these repeats have undergone evolutionary selection. Our results demonstrate that transcriptional regulatory networks are highly dynamic in eukaryotic genomes and that transposable elements play an important role in expanding the repertoire of binding sites.</p>
]]></description>
<dc:creator><![CDATA[Bourque, G., Leong, B., Vega, V. B., Chen, X., Lee, Y. L., Srinivasan, K. G., Chew, J.-L., Ruan, Y., Wei, C.-L., Ng, H. H., Liu, E. T.]]></dc:creator>
<dc:date>2008-10-30</dc:date>
<dc:identifier>info:doi/10.1101/gr.080663.108</dc:identifier>
<dc:title><![CDATA[[LETTERS] Evolution of the mammalian transcription factor binding repertoire via transposable elements]]></dc:title>
<dc:publisher>Cold Spring Harbor Laboratory Press</dc:publisher>
<prism:number>11</prism:number>
<prism:volume>18</prism:volume>
<prism:endingPage>1762</prism:endingPage>
<prism:publicationDate>2008-11-01</prism:publicationDate>
<prism:startingPage>1752</prism:startingPage>
<prism:section>LETTERS</prism:section>
</item>

<item rdf:about="http://genome.cshlp.org/cgi/content/short/18/11/1763?rss=1">
<title><![CDATA[[LETTERS] E2F in vivo binding specificity: Comparison of consensus versus nonconsensus binding sites]]></title>
<link>http://genome.cshlp.org/cgi/content/short/18/11/1763?rss=1</link>
<description><![CDATA[
<p>We have previously shown that most sites bound by E2F family members in vivo do not contain E2F consensus motifs. However, differences between in vivo target sites that contain or lack a consensus E2F motif have not been explored. To understand how E2F binding specificity is achieved in vivo, we have addressed how E2F family members are recruited to core promoter regions that lack a consensus motif and are excluded from other regions that contain a consensus motif. Using chromatin immunoprecipitation coupled with DNA microarray analysis (ChIP-chip) assays, we have shown that the predominant factors specifying whether E2F is recruited to an in vivo binding site are (1) the site must be in a core promoter and (2) the region must be utilized as a promoter in that cell type. We have tested three models for recruitment of E2F to core promoters lacking a consensus site, including (1) indirect recruitment, (2) looping to the core promoter mediated by an E2F bound to a distal motif, and (3) assisted binding of E2F to a site that weakly resembles an E2F motif. To test these models, we developed a new in vivo assay, termed eChIP, which allows analysis of transcription factor binding to isolated fragments. Our findings suggest that in vivo (1) a consensus motif is not sufficient to recruit E2Fs, (2) E2Fs can bind to isolated regions that lack a consensus motif, and (3) binding can require regions other than the best match to the E2F motif.</p>
]]></description>
<dc:creator><![CDATA[Rabinovich, A., Jin, V. X., Rabinovich, R., Xu, X., Farnham, P. J.]]></dc:creator>
<dc:date>2008-10-30</dc:date>
<dc:identifier>info:doi/10.1101/gr.080622.108</dc:identifier>
<dc:title><![CDATA[[LETTERS] E2F in vivo binding specificity: Comparison of consensus versus nonconsensus binding sites]]></dc:title>
<dc:publisher>Cold Spring Harbor Laboratory Press</dc:publisher>
<prism:number>11</prism:number>
<prism:volume>18</prism:volume>
<prism:endingPage>1777</prism:endingPage>
<prism:publicationDate>2008-11-01</prism:publicationDate>
<prism:startingPage>1763</prism:startingPage>
<prism:section>LETTERS</prism:section>
</item>

<item rdf:about="http://genome.cshlp.org/cgi/content/short/18/11/1778?rss=1">
<title><![CDATA[[LETTERS] Reconfiguration of genomic anchors upon transcriptional activation of the human major histocompatibility complex]]></title>
<link>http://genome.cshlp.org/cgi/content/short/18/11/1778?rss=1</link>
<description><![CDATA[
<p>The folding of chromatin into topologically constrained loop domains is essential for genomic function. We have identified genomic anchors that define the organization of chromatin loop domains across the human major histocompatibility complex (MHC). This locus contains critical genes for immunity and is associated with more diseases than any other region of the genome. Classical MHC genes are expressed in a cell type-specific pattern and can be induced by cytokines such as interferon-gamma (IFNG). Transcriptional activation of the MHC was associated with a reconfiguration of chromatin architecture resulting from the formation of additional genomic anchors. These findings suggest that the dynamic arrangement of genomic anchors and loops plays a role in transcriptional regulation.</p>
]]></description>
<dc:creator><![CDATA[Ottaviani, D., Lever, E., Mitter, R., Jones, T., Forshew, T., Christova, R., Tomazou, E. M., Rakyan, V. K., Krawetz, S. A., Platts, A. E., Segarane, B., Beck, S., Sheer, D.]]></dc:creator>
<dc:date>2008-10-30</dc:date>
<dc:identifier>info:doi/10.1101/gr.082313.108</dc:identifier>
<dc:title><![CDATA[[LETTERS] Reconfiguration of genomic anchors upon transcriptional activation of the human major histocompatibility complex]]></dc:title>
<dc:publisher>Cold Spring Harbor Laboratory Press</dc:publisher>
<prism:number>11</prism:number>
<prism:volume>18</prism:volume>
<prism:endingPage>1786</prism:endingPage>
<prism:publicationDate>2008-11-01</prism:publicationDate>
<prism:startingPage>1778</prism:startingPage>
<prism:section>LETTERS</prism:section>
</item>

<item rdf:about="http://genome.cshlp.org/cgi/content/short/18/11/1787?rss=1">
<title><![CDATA[[LETTERS] In-depth characterization of the microRNA transcriptome in a leukemia progression model]]></title>
<link>http://genome.cshlp.org/cgi/content/short/18/11/1787?rss=1</link>
<description><![CDATA[
<p>MicroRNAs (miRNAs) have been shown to play important roles in physiological as well as multiple malignant processes, including acute myeloid leukemia (AML). In an effort to gain further insight into the role of miRNAs in AML, we have applied the Illumina massively parallel sequencing platform to carry out an in-depth analysis of the miRNA transcriptome in a murine leukemia progression model. This model simulates the stepwise conversion of a myeloid progenitor cell by an engineered overexpression of the nucleoporin 98 (<I>NUP98</I>)&ndash;homeobox <I>HOXD13</I> fusion gene (ND13), to aggressive AML inducing cells upon transduction with the oncogenic collaborator <I>Meis1.</I> From this data set, we identified 307 miRNA/miRNA* species in the ND13 cells and 306 miRNA/miRNA* species in ND13+Meis1 cells, corresponding to 223 and 219 miRNA genes. Sequence counts varied between two and 136,558, indicating a remarkable expression range between the detected miRNA species. The large number of miRNAs expressed and the nature of differential expression suggest that leukemic progression as modeled here is dictated by the repertoire of shared, but differentially expressed miRNAs. Our finding of extensive sequence variations (isomiRs) for almost all miRNA and miRNA* species adds additional complexity to the miRNA transcriptome. A stringent target prediction analysis coupled with in vitro target validation revealed the potential for miRNA-mediated release of oncogenes that facilitates leukemic progression from the preleukemic to leukemia inducing state. Finally, 55 novel miRNAs species were identified in our data set, adding further complexity to the emerging world of small RNAs.</p>
]]></description>
<dc:creator><![CDATA[Kuchenbauer, F., Morin, R. D., Argiropoulos, B., Petriv, O. I., Griffith, M., Heuser, M., Yung, E., Piper, J., Delaney, A., Prabhu, A.-L., Zhao, Y., McDonald, H., Zeng, T., Hirst, M., Hansen, C. L., Marra, M. A., Humphries, R. K.]]></dc:creator>
<dc:date>2008-10-30</dc:date>
<dc:identifier>info:doi/10.1101/gr.077578.108</dc:identifier>
<dc:title><![CDATA[[LETTERS] In-depth characterization of the microRNA transcriptome in a leukemia progression model]]></dc:title>
<dc:publisher>Cold Spring Harbor Laboratory Press</dc:publisher>
<prism:number>11</prism:number>
<prism:volume>18</prism:volume>
<prism:endingPage>1797</prism:endingPage>
<prism:publicationDate>2008-11-01</prism:publicationDate>
<prism:startingPage>1787</prism:startingPage>
<prism:section>LETTERS</prism:section>
</item>

<item rdf:about="http://genome.cshlp.org/cgi/content/short/18/11/1798?rss=1">
<title><![CDATA[[LETTERS] Transcription of foreign DNA in Escherichia coli]]></title>
<link>http://genome.cshlp.org/cgi/content/short/18/11/1798?rss=1</link>
<description><![CDATA[
<p>Propagation of heterologous DNA in <I>E. coli</I> host cells is central to molecular biology. DNA constructs are often engineered for expression of recombinant protein in <I>E. coli</I>, but the extent of incidental transcription arising from natural regulatory sequences in cloned DNA remains underexplored. Here, we have used programmable microarrays and RT-PCR to measure, comprehensively, the transcription of <I>H. influenzae</I>, <I>P. aeruginosa</I>, and human DNA propagating in <I>E. coli</I> as bacterial artificial chromosomes. We find evidence that at least half of all <I>H. influenzae</I> genes are transcribed in <I>E. coli</I>. Highly transcribed genes are principally involved in energy metabolism, and their proximal promoter regions are significantly enriched with <I>E. coli</I> <sup>70</sup> (also known as RpoD) binding sites. <I>H. influenzae</I> genes acquired from an ancient bacteriophage Mu insertion are also highly transcribed. Compared with <I>H. influenzae</I>, a smaller proportion of <I>P. aeruginosa</I> genes are transcribed in <I>E. coli</I>, and in <I>E. coli</I> there is punctuated transcription of human DNA. The presence of foreign DNA in <I>E. coli</I> disturbs the host transcriptional profile, with expression of the <I>E. coli</I> phage shock protein operon and the flagellar gene cluster being particularly strongly up-regulated. While cross-species transcriptional activation is expected to be enabling for horizontal gene transfer in bacteria, incidental expression of toxic genes can be problematic for DNA cloning. Ongoing characterization of cross-expression will help inform the design of biosynthetic gene clusters and synthetic microbial genomes.</p>
]]></description>
<dc:creator><![CDATA[Warren, R. L., Freeman, J. D., Levesque, R. C., Smailus, D. E., Flibotte, S., Holt, R. A.]]></dc:creator>
<dc:date>2008-10-30</dc:date>
<dc:identifier>info:doi/10.1101/gr.080358.108</dc:identifier>
<dc:title><![CDATA[[LETTERS] Transcription of foreign DNA in Escherichia coli]]></dc:title>
<dc:publisher>Cold Spring Harbor Laboratory Press</dc:publisher>
<prism:number>11</prism:number>
<prism:volume>18</prism:volume>
<prism:endingPage>1805</prism:endingPage>
<prism:publicationDate>2008-11-01</prism:publicationDate>
<prism:startingPage>1798</prism:startingPage>
<prism:section>LETTERS</prism:section>
</item>

<item rdf:about="http://genome.cshlp.org/cgi/content/short/18/11/1806?rss=1">
<title><![CDATA[[METHODS] Overlapping euchromatin/heterochromatin- associated marks are enriched in imprinted gene regions and predict allele-specific modification]]></title>
<link>http://genome.cshlp.org/cgi/content/short/18/11/1806?rss=1</link>
<description><![CDATA[
<p>Most genome-level analysis treats the two parental alleles equivalently, yet diploid genomes contain two parental genomes that are often epigenetically distinct. While single nucleotide polymorphisms (SNPs) can be used to distinguish these genomes, it would be useful to develop a generalized strategy for identifying candidate genes or regions showing allele-specific differences, independent of SNPs. We have explored this problem by looking for overlapping marks in the genome related to both euchromatin (histone H3 dimethyl lysine-4 [H3K4Me2]) and heterochromatin (DNA methylation [DNAm]). "Double hits" were defined by the intersection of H3K4Me2 and DNAm. For the top 5% of marks, defined by a sliding window, imprinted gene regions were enriched for double hits 5.4-fold. When the location information of CTCF binding sites were integrated, the "triple hits" were enriched 76-fold for known imprinted genes in the regions studied. The double hits in imprinted genes were found to occur usually at the site of alternative or antisense transcripts. In addition, four of four imprinted genes tested showing double hits also showed allele-specific methylation. We suggest that overlapping euchromatin/heterochromatin marks are common and are enriched for epigenetically distinct parental chromosome regions. Furthermore, we developed a novel approach to identifying allele-specific marks that is SNP independent, by fractionating using H3K4Me2 antibodies followed by DNA methylation analysis.</p>
]]></description>
<dc:creator><![CDATA[Wen, B., Wu, H., Bjornsson, H., Green, R. D., Irizarry, R., Feinberg, A. P.]]></dc:creator>
<dc:date>2008-10-30</dc:date>
<dc:identifier>info:doi/10.1101/gr.067587.108</dc:identifier>
<dc:title><![CDATA[[METHODS] Overlapping euchromatin/heterochromatin- associated marks are enriched in imprinted gene regions and predict allele-specific modification]]></dc:title>
<dc:publisher>Cold Spring Harbor Laboratory Press</dc:publisher>
<prism:number>11</prism:number>
<prism:volume>18</prism:volume>
<prism:endingPage>1813</prism:endingPage>
<prism:publicationDate>2008-11-01</prism:publicationDate>
<prism:startingPage>1806</prism:startingPage>
<prism:section>METHODS</prism:section>
</item>

<item rdf:about="http://genome.cshlp.org/cgi/content/short/18/11/1814?rss=1">
<title><![CDATA[[METHODS] Enredo and Pecan: Genome-wide mammalian consistency-based multiple alignment with paralogs]]></title>
<link>http://genome.cshlp.org/cgi/content/short/18/11/1814?rss=1</link>
<description><![CDATA[
<p>Pairwise whole-genome alignment involves the creation of a homology map, capable of performing a near complete transformation of one genome into another. For multiple genomes this problem is generalized to finding a set of consistent homology maps for converting each genome in the set of aligned genomes into any of the others. The problem can be divided into two principal stages. First, the partitioning of the input genomes into a set of colinear segments, a process which essentially deals with the complex processes of rearrangement. Second, the generation of a base pair level alignment map for each colinear segment. We have developed a new genome-wide segmentation program, Enredo, which produces colinear segments from extant genomes handling rearrangements, including duplications. We have then applied the new alignment program Pecan, which makes the consistency alignment methodology practical at a large scale, to create a new set of genome-wide mammalian alignments. We test both Enredo and Pecan using novel and existing assessment analyses that incorporate both real biological data and simulations, and show that both independently and in combination they outperform existing programs. Alignments from our pipeline are publicly available within the Ensembl genome browser.</p>
]]></description>
<dc:creator><![CDATA[Paten, B., Herrero, J., Beal, K., Fitzgerald, S., Birney, E.]]></dc:creator>
<dc:date>2008-10-30</dc:date>
<dc:identifier>info:doi/10.1101/gr.076554.108</dc:identifier>
<dc:title><![CDATA[[METHODS] Enredo and Pecan: Genome-wide mammalian consistency-based multiple alignment with paralogs]]></dc:title>
<dc:publisher>Cold Spring Harbor Laboratory Press</dc:publisher>
<prism:number>11</prism:number>
<prism:volume>18</prism:volume>
<prism:endingPage>1828</prism:endingPage>
<prism:publicationDate>2008-11-01</prism:publicationDate>
<prism:startingPage>1814</prism:startingPage>
<prism:section>METHODS</prism:section>
</item>

<item rdf:about="http://genome.cshlp.org/cgi/content/short/18/11/1829?rss=1">
<title><![CDATA[[METHODS] Genome-wide nucleotide-level mammalian ancestor reconstruction]]></title>
<link>http://genome.cshlp.org/cgi/content/short/18/11/1829?rss=1</link>
<description><![CDATA[
<p>Recently attention has been turned to the problem of reconstructing complete ancestral sequences from large multiple alignments. Successful generation of these genome-wide reconstructions will facilitate a greater knowledge of the events that have driven evolution. We present a new evolutionary alignment modeler, called "Ortheus," for inferring the evolutionary history of a multiple alignment, in terms of both substitutions and, importantly, insertions and deletions. Based on a multiple sequence probabilistic transducer model of the type proposed by Holmes, Ortheus uses efficient stochastic graph-based dynamic programming methods. Unlike other methods, Ortheus does not rely on a single fixed alignment from which to work. Ortheus is also more scaleable than previous methods while being fast, stable, and open source. Large-scale simulations show that Ortheus performs close to optimally on a deep mammalian phylogeny. Simulations also indicate that significant proportions of errors due to insertions and deletions can be avoided by not assuming a fixed alignment. We additionally use a challenging hold-out cross-validation procedure to test the method; using the reconstructions to predict extant sequence bases, we demonstrate significant improvements over using closest extant neighbor sequences. Accompanying this paper, a new, public, and genome-wide set of Ortheus ancestor alignments provide an intriguing new resource for evolutionary studies in mammals. As a first piece of analysis, we attempt to recover "fossilized" ancestral pseudogenes. We confidently find 31 cases in which the ancestral sequence had a more complete sequence than any of the extant sequences.</p>
]]></description>
<dc:creator><![CDATA[Paten, B., Herrero, J., Fitzgerald, S., Beal, K., Flicek, P., Holmes, I., Birney, E.]]></dc:creator>
<dc:date>2008-10-30</dc:date>
<dc:identifier>info:doi/10.1101/gr.076521.108</dc:identifier>
<dc:title><![CDATA[[METHODS] Genome-wide nucleotide-level mammalian ancestor reconstruction]]></dc:title>
<dc:publisher>Cold Spring Harbor Laboratory Press</dc:publisher>
<prism:number>11</prism:number>
<prism:volume>18</prism:volume>
<prism:endingPage>1843</prism:endingPage>
<prism:publicationDate>2008-11-01</prism:publicationDate>
<prism:startingPage>1829</prism:startingPage>
<prism:section>METHODS</prism:section>
</item>

<item rdf:about="http://genome.cshlp.org/cgi/content/short/18/11/1844?rss=1">
<title><![CDATA[[METHODS] Nested Patch PCR enables highly multiplexed mutation discovery in candidate genes]]></title>
<link>http://genome.cshlp.org/cgi/content/short/18/11/1844?rss=1</link>
<description><![CDATA[
<p>Medical resequencing of candidate genes in individual patient samples is becoming increasingly important in the clinic and in clinical research. Medical resequencing requires the amplification and sequencing of many candidate genes in many patient samples. Here we introduce Nested Patch PCR, a novel method for highly multiplexed PCR that is very specific, can sensitively detect SNPs and mutations, and is easy to implement. This is the first method that couples multiplex PCR with sample-specific DNA barcodes and next-generation sequencing to enable highly multiplex mutation discovery in candidate genes for multiple samples in parallel. In our pilot study, we amplified exons from colon cancer and matched normal human genomic DNA. From each sample, we successfully amplified 96% (90 of 94) targeted exons from across the genome, totaling 21.6 kbp of sequence. Ninety percent of all sequencing reads were from targeted exons, demonstrating that Nested Patch PCR is highly specific. We found that the abundance of reads per exon was reproducible across samples. We reliably detected germline SNPs and discovered a colon tumor specific nonsense mutation in <I>APC</I>, a gene causally implicated in colorectal cancer. With Nested Patch PCR, candidate gene mutation discovery across multiple individual patient samples can now utilize the power of second-generation sequencing.</p>
]]></description>
<dc:creator><![CDATA[Varley, K. E., Mitra, R. D.]]></dc:creator>
<dc:date>2008-10-30</dc:date>
<dc:identifier>info:doi/10.1101/gr.078204.108</dc:identifier>
<dc:title><![CDATA[[METHODS] Nested Patch PCR enables highly multiplexed mutation discovery in candidate genes]]></dc:title>
<dc:publisher>Cold Spring Harbor Laboratory Press</dc:publisher>
<prism:number>11</prism:number>
<prism:volume>18</prism:volume>
<prism:endingPage>1850</prism:endingPage>
<prism:publicationDate>2008-11-01</prism:publicationDate>
<prism:startingPage>1844</prism:startingPage>
<prism:section>METHODS</prism:section>
</item>

<item rdf:about="http://genome.cshlp.org/cgi/content/short/18/11/1851?rss=1">
<title><![CDATA[[RESOURCES] Mapping short DNA sequencing reads and calling variants using mapping quality scores]]></title>
<link>http://genome.cshlp.org/cgi/content/short/18/11/1851?rss=1</link>
<description><![CDATA[
<p>New sequencing technologies promise a new era in the use of DNA sequence. However, some of these technologies produce very short reads, typically of a few tens of base pairs, and to use these reads effectively requires new algorithms and software. In particular, there is a major issue in efficiently aligning short reads to a reference genome and handling ambiguity or lack of accuracy in this alignment. Here we introduce the concept of <I>mapping quality</I>, a measure of the confidence that a read actually comes from the position it is aligned to by the mapping algorithm. We describe the software MAQ that can build assemblies by mapping shotgun short reads to a reference genome, using quality scores to derive genotype calls of the consensus sequence of a diploid genome, e.g., from a human sample. MAQ makes full use of mate-pair information and estimates the error probability of each read alignment. Error probabilities are also derived for the final genotype calls, using a Bayesian statistical model that incorporates the mapping qualities, error probabilities from the raw sequence quality scores, sampling of the two haplotypes, and an empirical model for correlated errors at a site. Both read mapping and genotype calling are evaluated on simulated data and real data. MAQ is accurate, efficient, versatile, and user-friendly. It is freely available at <inter-ref locator="http://maq.sourceforge.net" locator-type="url">http://maq.sourceforge.net</inter-ref>.</p>
]]></description>
<dc:creator><![CDATA[Li, H., Ruan, J., Durbin, R.]]></dc:creator>
<dc:date>2008-10-30</dc:date>
<dc:identifier>info:doi/10.1101/gr.078212.108</dc:identifier>
<dc:title><![CDATA[[RESOURCES] Mapping short DNA sequencing reads and calling variants using mapping quality scores]]></dc:title>
<dc:publisher>Cold Spring Harbor Laboratory Press</dc:publisher>
<prism:number>11</prism:number>
<prism:volume>18</prism:volume>
<prism:endingPage>1858</prism:endingPage>
<prism:publicationDate>2008-11-01</prism:publicationDate>
<prism:startingPage>1851</prism:startingPage>
<prism:section>RESOURCES</prism:section>
</item>

<item rdf:about="http://genome.cshlp.org/cgi/content/short/18/11/1859?rss=1">
<title><![CDATA[[ERRATUM] Erratum]]></title>
<link>http://genome.cshlp.org/cgi/content/short/18/11/1859?rss=1</link>
<description><![CDATA[]]></description>
<dc:creator><![CDATA[]]></dc:creator>
<dc:date>2008-10-30</dc:date>
<dc:title><![CDATA[[ERRATUM] Erratum]]></dc:title>
<dc:publisher>Cold Spring Harbor Laboratory Press</dc:publisher>
<prism:number>11</prism:number>
<prism:volume>18</prism:volume>
<prism:endingPage>1859</prism:endingPage>
<prism:publicationDate>2008-11-01</prism:publicationDate>
<prism:startingPage>1859</prism:startingPage>
<prism:section>ERRATUM</prism:section>
</item>

</rdf:RDF>