Quadrupia provides a comprehensive catalog of G-quadruplexes across genomes from the tree of life

  1. Ilias Georgakopoulos-Soares1,9
  1. 1 The Pennsylvania State University College of Medicine;
  2. 2 BSRC "Alexander Fleming", University of Crete;
  3. 3 City University of Hong Kong, Shenzhen Research Institute of City University of Hong Kong;
  4. 4 Frederick National Laboratory for Cancer Research;
  5. 5 AGH University of Krakow;
  6. 6 The University of Texas at Austin;
  7. 7 European University Cyprus, Basic and Translational Cancer Research Center;
  8. 8 BSRC "Alexander Fleming"
  • * Corresponding author; email: izg5139{at}psu.edu
  • Abstract

    G-quadruplex DNA structures exhibit a profound influence on essential biological processes, including transcription, replication, telomere maintenance, and genomic stability. These structures have demonstrably shaped organismal evolution. However, a comprehensive, organism-wide G-quadruplex map encompassing the diversity of life has remained elusive. Here, we introduce Quadrupia, the most extensive and well-characterized G-quadruplex database to date, facilitating the exploration of G-quadruplex structures across the evolutionary spectrum. Quadrupia has identified G-quadruplex sequences in 108,449 reference genomes, with a total of 140,181,277 G-quadruplexes. The database also hosts a collection of 319,784 G-quadruplex clusters of 20 or more members, annotated by taxonomic distributions, multiple sequence alignments, profile hidden Markov models and cross-references to G-quadruplex 3D structures. Examination of G-quadruplexes across functional genomic elements in different taxa indicates preferential orientation and positioning, with significant differences between individual taxonomic groups. For example, we find that G-quadruplexes in bacteria with a single replication origin display profound preference for the leading orientation. Finally, we experimentally validate the most frequently observed G-quadruplexes using CD-spectroscopy, UV melting, and fluorescent-based approaches.

    • Received July 15, 2024.
    • Accepted August 21, 2025.

    This manuscript is Open Access.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International license), as described at http://creativecommons.org/licenses/by/4.0/.

    OPEN ACCESS ARTICLE
    ACCEPTED MANUSCRIPT

    This Article

    1. Genome Res. gr.279790.124 Published by Cold Spring Harbor Laboratory Press

    Article Category

    ORCID

    Share

    Preprint Server