High resolution annotation of zebrafish transcriptome using long-read sequencing

German Nudelman; Antonio Frasca; Brandon Kent; Kirsten C. Sadler; Stuart C. Sealfon; Martin J. Walsh; Elena Zaslavsky

doi:10.1101/gr.223586.117

High resolution annotation of zebrafish transcriptome using long-read sequencing

¹Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, New York 10029, USA;
²Center for Advanced Research on Diagnostic Assays (CARDA), Icahn School of Medicine at Mount Sinai, New York, New York 10029, USA;
³Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, New York, New York 10029, USA;
⁴Department of Development and Regenerative Biology, Icahn School of Medicine at Mount Sinai, New York, New York 10029, USA;
⁵Program in Biology, New York University Abu Dhabi, Abu Dhabi, UAE;
⁶Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York 10029, USA;
⁷The Mount Sinai Center for RNA Biology and Medicine, Icahn School of Medicine at Mount Sinai, New York, New York 10029, USA

↵8 These authors are joint first authors and contributed equally to this work.

Corresponding authors: elena.zaslavsky{at}gmail.com, martin.walsh{at}mssm.edu

Abstract

With the emergence of zebrafish as an important model organism, a concerted effort has been made to study its transcriptome. This effort is limited, however, by gaps in zebrafish annotation, which are especially pronounced concerning transcripts dynamically expressed during zygotic genome activation (ZGA). To date, short-read sequencing has been the principal technology for zebrafish transcriptome annotation. In part because these sequence reads are too short for assembly methods to resolve the full complexity of the transcriptome, the current annotation is rudimentary. By providing direct observation of full-length transcripts, recently refined long-read sequencing platforms can dramatically improve annotation coverage and accuracy. Here, we leveraged the SMRT platform to study the transcriptome of zebrafish embryos before and after ZGA. Our analysis revealed additional novelty and complexity in the zebrafish transcriptome, identifying 2539 high-confidence novel transcripts that originated from previously unannotated loci and 1835 high-confidence new isoforms in previously annotated genes. We validated these findings using a suite of computational approaches including structural prediction, sequence homology, and functional conservation analyses, as well as by confirmatory transcript quantification with short-read sequencing data. Our analyses provided insight into new homologs and paralogs of functionally important proteins and noncoding RNAs, isoform switching occurrences, and different classes of novel splicing events. Several novel isoforms representing distinct splicing events were validated through PCR experiments, including the discovery and validation of a novel 8-kb transcript spanning multiple mir-430 elements, an important driver of early development. Our study provides a significantly improved zebrafish transcriptome annotation resource.

Footnotes

[Supplemental material is available for this article.]
Article published online before print. Article, supplemental material, and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.223586.117.

Received August 10, 2017.
Accepted July 5, 2018.

This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

Articles citing this article

CapTrap-seq: Advancing zebrafish transcriptomic research through high-fidelity full-length RNA sequencing bioRxiv May 17, 2025 0: 2025.05.12.653332v1-2025.05.12.653332

Syntenic lncRNAs exhibit DNA regulatory functions with sequence evolution bioRxiv April 29, 2024 0: 2024.04.26.588027v1-2024.04.26.588027

A maternal-to-zygotic-transition gene block on the zebrafish sex chromosome bioRxiv December 11, 2023 0: 2023.12.06.570431v1-2023.12.06.570431

SQANTI-SIM: a simulator of controlled transcript novelty for lrRNA-seq benchmark bioRxiv August 28, 2023 0: 2023.08.23.554392v1-2023.08.23.554392

Genome assembly and isoform analysis of a highly heterozygous New Zealand fisheries species, the tarakihi (Nemadactylus macropterus) bioRxiv February 22, 2022 0: 2022.02.19.481167v1-2022.02.19.481167

Long-read RNA sequencing reveals widespread sex-specific alternative splicing in threespine stickleback fish Genome Res August 1, 2021 31: 1486-1497

Combined Nanopore and Single-Molecule Real-Time Sequencing Survey of Human Betaherpesvirus 5 Transcriptome bioRxiv April 1, 2021 0: 2021.03.30.437686v1-2021.03.30.437686

Multi-sample Full-length Transcriptome Analysis of 22 Breast Cancer Clinical Specimens with Long-Read Sequencing bioRxiv July 18, 2020 0: 2020.07.15.199851v1-2020.07.15.199851

Dynamic transcriptional and chromatin accessibility landscape of medaka embryogenesis Genome Res June 1, 2020 30: 924-937

Time-course Profiling of Bovine Herpesvirus Type 1 and Host Cell Transcriptomes using Multiplatform Sequencing bioRxiv May 30, 2020 0: 2020.05.25.114843v1-2020.05.25.114843

The Spatio-Temporal Control of Zygotic Genome Activation bioRxiv June 5, 2019 0: 488056v2-488056

Full-Length Transcriptome Sequencing and the Discovery of New Transcripts in the Unfertilized Eggs of Zebrafish (Danio rerio) G3 May 29, 2019 9: 1831-1838

Multiple Long-read Sequencing Survey of Herpes Simplex Virus Lytic Transcriptome bioRxiv April 13, 2019 0: 605956v1-605956

High resolution annotation of zebrafish transcriptome using long-read sequencing

Abstract

Footnotes

Articles citing this article

This Article

Article Category

Services

Citing Articles

Google Scholar

PubMed/NCBI

ORCID

Share

Preprint Server

Current Issue

In This Issue