Inferring disease progression stages in single-cell transcriptomics using a weakly supervised deep learning approach

  1. Yoshiaki Tanaka1
  1. 1Maisonneuve-Rosemont Hospital Research Center (CRHMR), Department of Medicine, University of Montreal, Quebec H1T 2M4, Canada;
  2. 2RWJMS Institute for Neurological Therapeutics, Rutgers–Robert Wood Johnson Medical School, Piscataway, New Jersey 08854, USA;
  3. 3Department of Biology, Bates College, Lewiston, Maine 04240, USA
  • Corresponding author: yoshiaki.tanaka{at}umontreal.ca
  • Abstract

    Application of single-cell/nucleus genomic sequencing to patient-derived tissues offers potential solutions to delineate disease mechanisms in humans. However, individual cells in patient-derived tissues are in different pathological stages, and hence, such cellular variability impedes subsequent differential gene expression analyses. To overcome such a heterogeneity issue, we present a novel deep learning approach, scIDST, that infers disease progression levels of individual cells with weak supervision framework. The disease progression–inferred cells display significant differential expression of disease-relevant genes, which cannot be detected by comparative analysis between patients and healthy donors. In addition, we demonstrate that pretrained models by scIDST are applicable to multiple independent data resources and are advantageous to infer cells related to certain disease risks and comorbidities. Taken together, scIDST offers a new strategy of single-cell sequencing analysis to identify bona fide disease-associated molecular features.

    Footnotes

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at https://www.genome.org/cgi/doi/10.1101/gr.278812.123.

    • Freely available online through the Genome Research Open Access option.

    • Received December 4, 2023.
    • Accepted November 26, 2024.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.

    OPEN ACCESS ARTICLE

    Preprint Server