Efficient integration of spatial omics data for joint domain detection, matching, and alignment with stMSA

  1. Tao Wang1,2
  1. 1School of Computer Science, Northwestern Polytechnical University, 710072 Shaanxi, China;
  2. 2Key Laboratory of Big Data Storage and Management, Ministry of Industry and Information Technology, Northwestern Polytechnical University, 710072 Shaanxi, China;
  3. 3School of Computer Science and Engineering, Xi'an University of Technology, 710048 Shaanxi, China;
  4. 4Department of Pharmacy and Pharmaceutical Sciences, National University of Singapore, 117543 Singapore, Singapore;
  5. 5Center for Bioinformatics, School of Life Science and Technology, Harbin Institute of Technology, 150000 Heilongjiang, China;
  6. 6School of Interdisciplinary Medicine and Engineering, Harbin Medical University, 150081 Heilongjiang, China
  1. 7 These authors contributed equally to this work.

  • Corresponding authors: qhjiang{at}hit.edu.cn, Shang{at}nwpu.edu.cn, twang{at}nwpu.edu.cn
  • Abstract

    Spatial omics (SOs) are powerful methodologies that enable the study of genes, proteins, and other molecular features within the spatial context of tissue architecture. With the growing availability of SO data sets, researchers are eager to extract biological insights from larger data sets for a more comprehensive understanding. However, existing approaches focus on batch effect correction, often neglecting complex biological patterns in tissue slices, complicating feature integration and posing challenges when combining transcriptomics with other omics layers. Here, we introduce spatial multislice/omics analysis (stMSA), a deep graph contrastive learning model that incorporates graph auto-encoder techniques. stMSA is specifically designed to produce batch-corrected representations while retaining the distinct spatial patterns within each slice, considering both intra- and inter-batch relationships during integration. Extensive evaluations show that stMSA outperforms state-of-the-art methods in distinguishing tissue structures across diverse slices, even when faced with varying experimental protocols and sequencing technologies. Furthermore, stMSA effectively deciphers complex developmental trajectories by integrating spatial proteomics and transcriptomics data and excels in cross-slice matching and alignment for 3D tissue reconstruction.

    Footnotes

    • Received February 19, 2025.
    • Accepted August 7, 2025.

    This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

    This article has not yet been cited by other articles.

    This Article

    1. Genome Res. © 2025 Shu et al.; Published by Cold Spring Harbor Laboratory Press

    Article Category

    ORCID

    Share

    Preprint Server