Modeling gene interactions in polygenic prediction via geometric deep learning

  1. Sai Zhang5,6
  1. 1School of Mathematical Sciences and LPMC, Nankai University, Tianjin, 300071, China;
  2. 2Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing, 100084, China;
  3. 3School of Engineering, Research Center for Industries of the Future, Westlake University, Hangzhou, 310030, Zhejiang, China;
  4. 4Department of Genetics, Center for Genomics and Personalized Medicine, Stanford University School of Medicine, Stanford, California 94304, USA;
  5. 5Department of Epidemiology, University of Florida, Gainesville, Florida 32603, USA;
  6. 6Departments of Biostatistics & Biomedical Engineering, UF Genetics Institute, University of Florida, Gainesville, Florida 32603, USA
  • Corresponding authors: zengjy{at}westlake.edu.cn, mpsnyder{at}stanford.edu, sai.zhang{at}ufl.edu
  • Abstract

    Polygenic risk score (PRS) is a widely used approach for predicting individuals’ genetic risk of complex diseases, playing a pivotal role in advancing precision medicine. Traditional PRS methods, predominantly following a linear structure, often fall short in capturing the intricate relationships between genotype and phenotype. In this study, we present PRS-Net, an interpretable geometric deep learning–based framework that effectively models the nonlinearity of biological systems for enhanced disease prediction and biological discovery. PRS-Net begins by deconvoluting the genome-wide PRS at the single-gene resolution and then explicitly encapsulates gene–gene interactions leveraging a graph neural network (GNN) for genetic risk prediction, enabling a systematic characterization of molecular interplay underpinning diseases. An attentive readout module is introduced to facilitate model interpretation. Extensive tests across multiple complex traits and diseases demonstrate the superior prediction performance of PRS-Net compared with a wide range of conventional PRS methods. The interpretability of PRS-Net further enhances the identification of disease-relevant genes and gene programs. PRS-Net provides a potent tool for concurrent genetic risk prediction and biological discovery for complex diseases.

    Footnotes

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at https://www.genome.org/cgi/doi/10.1101/gr.279694.124.

    • Freely available online through the Genome Research Open Access option.

    • Received June 14, 2024.
    • Accepted November 14, 2024.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server