Method

Joint annotation of chromatin state and chromatin conformation reveals relationships among domain types and identifies domains of cell-type-specific expression

    • 1Department of Computer Science and Engineering, University of Washington, Seattle, Washington 98195, USA
    • 2Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
    • 3Princess Margaret Cancer Centre, University of Toronto, ON M5G 1L7, Canada
    • 4Department of Medical Biophysics, University of Toronto, ON M5G 1L7, Canada
    • 5Department of Biological Science, The Florida State University, Tallahassee, Florida 32304, USA
    • 6Department of Electrical Engineering, University of Washington, Seattle, Washington 98195, USA
Published February 12, 2015. Vol 25 Issue 4, pp. 544-557. https://doi.org/10.1101/gr.184341.114
Download PDF Please log-in to or register for your personal account in order to access PDF Cite Article Permissions Share
cover of Genome Research Vol 36 Issue 4
Current Issue:

Abstract

The genomic neighborhood of a gene influences its activity, a behavior that is attributable in part to domain-scale regulation. Previous genomic studies have identified many types of regulatory domains. However, due to the difficulty of integrating genomics data sets, the relationships among these domain types are poorly understood. Semi-automated genome annotation (SAGA) algorithms facilitate human interpretation of heterogeneous collections of genomics data by simultaneously partitioning the human genome and assigning labels to the resulting genomic segments. However, existing SAGA methods cannot integrate inherently pairwise chromatin conformation data. We developed a new computational method, called graph-based regularization (GBR), for expressing a pairwise prior that encourages certain pairs of genomic loci to receive the same label in a genome annotation. We used GBR to exploit chromatin conformation information during genome annotation by encouraging positions that are close in 3D to occupy the same type of domain. Using this approach, we produced a model of chromatin domains in eight human cell types, thereby revealing the relationships among known domain types. Through this model, we identified clusters of tightly regulated genes expressed in only a small number of cell types, which we term “specific expression domains.” We found that domain boundaries marked by promoters and CTCF motifs are consistent between cell types even when domain activity changes. Finally, we showed that GBR can be used to transfer information from well-studied cell types to less well-characterized cell types during genome annotation, making it possible to produce high-quality annotations of the hundreds of cell types with limited available data.

Loading
Loading
Back to top