Inferring disease progressive stages in single-cell transcriptomics using a weakly-supervised deep learning approach

  1. Yoshiaki Tanaka1,4
  1. 1 Maisonneuve-Rosemont Hospital Research Center, University of Montreal;
  2. 2 Rutgers-Robert Wood Johnson Medical School, Bates College;
  3. 3 Rutgers-Robert Wood Johnson Medical School
  • * Corresponding author; email: yoshiaki.tanaka{at}umontreal.ca
  • Abstract

    Application of single-cell/nucleus genomic sequencing to patient-derived tissues offers potential solutions to delineate disease mechanisms in human. However, individual cells in patient-derived tissues are in different pathological stages, and hence such cellular variability impedes subsequent differential gene expression analyses. To overcome such heterogeneity issue, we present a novel deep learning approach, scIDST, that infers disease progressive levels of individual cells with weak supervision framework. The inferred disease progressive cells displayed significant differential expression of disease-relevant genes, which could not be detected by comparative analysis between patients and healthy donors. In addition, we demonstrated that pretrained models by scIDST are applicable to multiple independent data resources, and advantageous to infer cells related to certain disease risks and comorbidities. Taken together, scIDST offers a new strategy of single-cell sequencing analysis to identify bona fide disease-associated molecular features.

    • Received December 4, 2023.
    • Accepted November 26, 2024.

    This manuscript is Open Access.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International license), as described at http://creativecommons.org/licenses/by/4.0/.

    Articles citing this article

    OPEN ACCESS ARTICLE
    ACCEPTED MANUSCRIPT

    This Article

    1. Genome Res. gr.278812.123 Published by Cold Spring Harbor Laboratory Press

    Article Category

    ORCID

    Share

    Preprint Server