Long terminal repeats power evolution of genes and gene expression programs in mammalian oocytes and zygotes

  1. Petr Svoboda2,5
  1. 1 University of Zagreb;
  2. 2 Institute of Molecular Genetics, Academy of Sciences of the Czech Republic;
  3. 3 University of Pennsylvania;
  4. 4 The University of Tokyo
  1. * Corresponding author; email: svobodap{at}img.cas.cz

Abstract

Retrotransposons are 'copy-and-paste' insertional mutagens that substantially contribute to mammalian genome content. Retrotransposons often carry long terminal repeats (LTRs) for retrovirus-like reverse transcription and integration into the genome. We report an extraordinary impact of a group of LTRs from the mammalian endogenous retrovirus-related ERVL retrotransposon class on gene expression in the germline and beyond. In mouse, we identified >800 LTRs from ORR1, MT, MT2, and MLT families, which resemble mobile gene-remodeling platforms that supply promoters and first exons. The LTR-mediated gene remodeling also extends to hamster, human, and bovine oocytes. The LTRs function in a stage-specific manner during the oocyte-to-embryo transition by activating transcription, altering protein-coding sequences, producing non-coding RNAs, and even supporting evolution of new protein-coding genes. These functions result, for example, in recycling processed pseudogenes into mRNAs or lncRNAs with regulatory roles. The functional potential of the studied LTRs is even higher because we show that dormant LTR promoter activity can rescue loss of an essential upstream promoter. We also report a novel protein-coding gene evolution - D6Ertd527e, where an MT LTR provided a promoter and the 5' exon with a functional start codon while the bulk of the protein-coding sequence evolved through an CAG repeat expansion. Altogether, ERVL LTRs provide molecular mechanisms for stochastically scanning, rewiring, and recycling genetic information on an extraordinary scale. ERVL LTRs thus offer means for a comprehensive survey of genome's expression potential, tightly intertwining with gene expression and evolution in the germline.

  • Received September 19, 2016.
  • Accepted May 15, 2017.

This manuscript is Open Access.

This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International license), as described at http://creativecommons.org/licenses/by-nc/4.0/.

Articles citing this article

OPEN ACCESS ARTICLE
ACCEPTED MANUSCRIPT

Preprint Server