NMRDMR Snakemake Workflow : Cross-Species Regulatory Elements from Histone Mark Peaks

Snakemake Snakemake-Report

The pipeline starts from H3K4me3, H3K4me1 & H3K27ac peaks in different species, and outputs promoters, enhancers and primed enhancers sets for each species, all mapped to a common coordinate system using a reference (mouse). H3k27ac reads density for the sets of orthologous promoters and enhancers are then extracted from .bam files and normalized across species and replicates.

This workflow was developed to define orthologous regulatory elements in the following study:

Parey et al., 2023, Phylogenetic modeling of enhancer shifts in African mole-rats reveals regulatory changes associated with tissue-specific traits.

Reproducing the analysis

Installation (requires conda)

To install snakemake in a conda environnment (for example in an env named snake), run the following commands:

Running

Output files

The pipeline generates a number of intermediary results files. The most important outputs are:

Going further

More details are provided in the documentation/ folder, including a description of the pipeline steps and how to run it on different datasets (not extensively tested).