Unveiling the functional fate of duplicated genes through expression profiling and structural analysis

  1. Irene Julca1,2
  1. 1SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland;
  2. 2Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland;
  3. 3BioSoft Research UK, London EC2A 4NE, United Kingdom;
  4. 4Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, California 90033, USA
  • Corresponding author: irene.julca{at}unil.ch
  • Abstract

    Gene duplication is a major evolutionary source of functional innovation. Following duplication events, gene copies (paralogs) may undergo various fates, including retention with functional modifications (such as subfunctionalization or neofunctionalization) or loss. When paralogs are retained, this results in complex orthology relationships, including one-to-many or many-to-many. In such cases, determining which one-to-one pair is more likely to have conserved functions can be challenging. It has been proposed that, following gene duplication, the copy that diverges more slowly in sequence is more likely to maintain the ancestral function—referred to here as “the least diverged ortholog (LDO) conjecture.” This study explores this conjecture, using a novel method to identify asymmetric evolution of paralogs and applying it to all gene families across the Tree of Life in the PANTHER database. Structural data for over 1 million proteins and expression data for 16 animals and 20 plants are used to investigate functional divergence following duplication. This analysis, the most comprehensive to date, reveals that, whereas the majority of paralogs display similar rates of sequence evolution, significant differences in branch lengths following gene duplication can be correlated with functional divergence. Overall, the results support the least diverged ortholog conjecture, suggesting that the least diverged ortholog tends to retain the ancestral function, whereas the most diverged ortholog (MDO) may acquire a new, potentially specialized role.

    Footnotes

    • Received October 29, 2024.
    • Accepted August 7, 2025.

    This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

    | Table of Contents

    Preprint Server