Quadrupia provides a comprehensive catalog of G-quadruplexes across genomes from the tree of life
- Nikol Chantzi1,
- Akshatha Nayak1,
- Fotis A. Baltoumas2,
- Eleni Aplakidou2,
- Shiau Wei Liew3,
- Jesslyn Elvaretta Galuh3,
- Michail Patsakis1,
- Austin Montgomery1,
- Camille Moeckel1,
- Ioannis Mouratidis1,
- Saiful Arefeen Sazed1,
- Wilfried Guiblet4,
- Panagiotis Karmiris-Obratański5,
- Guliang Wang6,
- Apostolos Zaravinos7,
- Karen M. Vasquez6,
- Chun Kit Kwok3,
- Georgios A. Pavlopoulos8 and
- Ilias Georgakopoulos-Soares1,9
- 1 The Pennsylvania State University College of Medicine;
- 2 BSRC "Alexander Fleming", University of Crete;
- 3 City University of Hong Kong, Shenzhen Research Institute of City University of Hong Kong;
- 4 Frederick National Laboratory for Cancer Research;
- 5 AGH University of Krakow;
- 6 The University of Texas at Austin;
- 7 European University Cyprus, Basic and Translational Cancer Research Center;
- 8 BSRC "Alexander Fleming"
Abstract
G-quadruplex DNA structures exhibit a profound influence on essential biological processes, including transcription, replication, telomere maintenance, and genomic stability. These structures have demonstrably shaped organismal evolution. However, a comprehensive, organism-wide G-quadruplex map encompassing the diversity of life has remained elusive. Here, we introduce Quadrupia, the most extensive and well-characterized G-quadruplex database to date, facilitating the exploration of G-quadruplex structures across the evolutionary spectrum. Quadrupia has identified G-quadruplex sequences in 108,449 reference genomes, with a total of 140,181,277 G-quadruplexes. The database also hosts a collection of 319,784 G-quadruplex clusters of 20 or more members, annotated by taxonomic distributions, multiple sequence alignments, profile hidden Markov models and cross-references to G-quadruplex 3D structures. Examination of G-quadruplexes across functional genomic elements in different taxa indicates preferential orientation and positioning, with significant differences between individual taxonomic groups. For example, we find that G-quadruplexes in bacteria with a single replication origin display profound preference for the leading orientation. Finally, we experimentally validate the most frequently observed G-quadruplexes using CD-spectroscopy, UV melting, and fluorescent-based approaches.
- Received July 15, 2024.
- Accepted August 21, 2025.
- Published by Cold Spring Harbor Laboratory Press
This manuscript is Open Access.
This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International license), as described at http://creativecommons.org/licenses/by/4.0/.











