Dynamic transcriptional and chromatin accessibility landscape of medaka embryogenesis

  1. Qiang Tu1,3
  1. 1 Institute of Genetics and Developmental Biology, Chinese Academy of Sciences;
  2. 2 National Institute for Basic Biology
  • * Corresponding author; email: qtu{at}genetics.ac.cn
  • Abstract

    Medaka (Oryzias latipes) has become an important vertebrate model widely used in genetics, developmental biology, environmental sciences, and many other fields. A high-quality genome sequence and a variety of genetic tools are available for this model organism. However, existing genome annotation is still rudimentary, as it was mainly based on computational prediction and short-read RNA-seq data. Here we report a dynamic transcriptome landscape of medaka embryogenesis profiled by long-read RNA-seq, short-read RNA-seq, and ATAC-seq. Integrating these datasets, we constructed a much-improved gene model set including about 17,000 novel isoforms, identified 1600 transcription factors, 1100 long non-coding RNAs, and 150,000 potential cis-regulatory elements as well. Time-series datasets provided another dimension of information. With the expression dynamics of genes and accessibility dynamics of cis-regulatory elements, we investigated isoform switching, regulatory logic between accessible elements and genes during embryogenesis. We built a user-friend medaka omics data portal to present these datasets. This resource provides the first comprehensive omics datasets of medaka embryogenesis. Ultimately, we term these three assays as the minimum ENCODE toolbox and propose the use of it as the initial and essential profiling genomic assays for model organisms that have limited data available. This work will be of great value for the research community using medaka as the model organism and many others as well.

    • Received October 31, 2019.
    • Accepted June 17, 2020.

    This manuscript is Open Access.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International license), as described at http://creativecommons.org/licenses/by/4.0/.

    OPEN ACCESS ARTICLE
    ACCEPTED MANUSCRIPT

    This Article

    1. Genome Res. gr.258871.119 Published by Cold Spring Harbor Laboratory Press

    Article Category

    ORCID

    Share

    Preprint Server