Functional genomics analysis of developing zebrafish and human endoderm reveals highly conserved cis-regulatory modules acting during vertebrate organogenesis

  1. Andrew C Nelson1,3
  1. 1 University of Warwick;
  2. 2 Southwest University, Ministry of Agriculture and Rural Affairs
  • * Corresponding author; email: a.nelson.1{at}warwick.ac.uk
  • Abstract

    While vertebrate species are superficially diverse, they share key commonalities in terms of overall morphology, and organ configuration and function. Maintenance of these traits during evolution is partially explained by conservation of critical genes governing embryonic development. However, for conserved genes to deliver consistent developmental outcomes between species, similar gene regulatory programs and gene expression patterns must also be maintained. The endoderm germ layer makes major contributions to the respiratory and gastrointestinal tracts, and associated organs including liver and pancreas. We used functional genomics approaches to identify highly conserved endodermal cis-regulatory modules (CRMs) functioning across the 400 million years of evolution separating zebrafish and humans. Our analyses suggest that there are few endoderm-specific CRMs, with many CRMs governing pancreas development also likely acting within the nervous system. Furthermore, these highly conserved CRMs are strongly enriched for binding sites of “neuro-pancreatic” transcription factors governing both pancreas and nervous system development, potentially suggesting function across these distinct organ systems. Additionally, we identify highly conserved CRMs potentially participating in endodermal patterning of adjacent craniofacial structures and sensory tissues. The highly conserved CRMs we identify are characterized by conserved patterns of transcription factor binding site co-occurrence. However, rigid arrangement of binding sites is not a common characteristic of the identified CRMs, suggesting more complex or individual grammatical rules. Overall, our analyses provide key insights into critical gene regulatory control during vertebrate endoderm organogenesis, and define a compendium of highly conserved CRMs that should be prioritised for analysis of neuro-pancreatic gene transcriptional control, and anterior embryonic patterning.

    • Received April 24, 2025.
    • Accepted February 12, 2026.

    This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

    This article has not yet been cited by other articles.

    ACCEPTED MANUSCRIPT

    Preprint Server