Accurate integration of multiple heterogeneous single-cell RNA-seq data sets by learning contrastive biological variation

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 3.
Figure 3.

Benchmarking scInt against other methods in the reference-based integration tasks. (A) UMAP visualizations of the density of integrated reference cells (top row) and the scatters of mapped query cells (bottom row) of simulation 2, from scInt, Seurat, Symphony, and online iNMF. Cells are colored by ground-truth cell type labels, with gray shadows representing the reference. (B) Heatmap comparing the 5-NN predicted labels (columns) and the original labels (rows) of the query. The color bar indicates the proportion of query cells per original cell type label that was predicted to be for each reference label (rows sum to 1). (C) UMAP visualizations of the density of integrated reference cells (top row) and the scatters of mapped query cells (bottom row) of the human pancreas data, from scInt, Seurat, Symphony, and online iNMF. Cells are colored by ground-truth cell type labels, with gray shadows representing the reference. The mast cells are the unique cell type in the query data and are colored dark purple. (D) Heatmap comparing the 5-NN predicted labels (columns) and the original labels (rows) of the query.

This Article

  1. Genome Res. 33: 750-762

Preprint Server