Capture of a functionally active methyl-CpG binding domain by an arthropod retrotransposon family

  1. Ryan Lister1,2
  1. 1Australian Research Council Centre of Excellence in Plant Energy Biology, School of Molecular Sciences, The University of Western Australia, Perth, Western Australia, 6009, Australia;
  2. 2Harry Perkins Institute of Medical Research, Perth, Western Australia, 6009, Australia
  • Corresponding authors: alex.demendoza{at}uwa.edu.au, ryan.lister{at}uwa.edu.au
  • Abstract

    The repressive capacity of cytosine DNA methylation is mediated by recruitment of silencing complexes by methyl-CpG binding domain (MBD) proteins. Despite MBD proteins being associated with silencing, we discovered that a family of arthropod Copia retrotransposons have incorporated a host-derived MBD. We functionally show how retrotransposon-encoded MBDs preferentially bind to CpG-dense methylated regions, which correspond to transposable element regions of the host genome, in the myriapod Strigamia maritima. Consistently, young MBD-encoding Copia retrotransposons (CopiaMBD) accumulate in regions with higher CpG densities than other LTR-retrotransposons also present in the genome. This would suggest that retrotransposons use MBDs to integrate into heterochromatic regions in Strigamia, avoiding potentially harmful insertions into host genes. In contrast, CopiaMBD insertions in the spider Stegodyphus dumicola genome disproportionately accumulate in methylated gene bodies compared with other spider LTR-retrotransposons. Given that transposons are not actively targeted by DNA methylation in the spider genome, this distribution bias would also support a role for MBDs in the integration process. Together, these data show that retrotransposons can co-opt host-derived epigenome readers, potentially harnessing the host epigenome landscape to advantageously tune the retrotransposition process.

    Footnotes

    • Received September 5, 2018.
    • Accepted June 20, 2019.

    This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

    | Table of Contents

    Preprint Server