Unsupervised cell functional annotation for single-cell RNA-seq

  1. Ziv Bar-Joseph1,3
  1. 1 Carnegie Mellon University;
  2. 2 McGill University Health Centre
  • * Corresponding author; email: zivbj{at}cs.cmu.edu
  • Abstract

    One of the first steps in the analysis of single-cell RNA sequencing data (scRNA-seq) is the assignment of cell types. While a number of supervised methods have been developed for this, in most cases such assignment is performed by first clustering cells in low-dimensional space and then assigning cell types to different clusters. To overcome noise and to improve cell type assignments we developed UNIFAN, a neural network method that simultaneously clusters and annotates cells using known gene sets. UNIFAN combines both, low-dimensional representation for all genes and cell specific gene set activity scores to determine the clustering. We applied UNIFAN to human and mouse scRNA-seq datasets from several different organs. As we show, by using knowledge on gene sets, UNIFAN greatly outperforms prior methods developed for clustering scRNA-seq data. The gene sets assigned by UNIFAN to different clusters provide strong evidence for the cell type that is represented by this cluster making annotations easier.

    • Received January 17, 2022.
    • Accepted June 10, 2022.

    This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

    This article has not yet been cited by other articles.

    Related Articles

    ACCEPTED MANUSCRIPT

    This Article

    1. Genome Res. gr.276609.122 Published by Cold Spring Harbor Laboratory Press

    Article Category

    ORCID

    Related Content

    Share

    Preprint Server