Integration of high-throughput proteomic data and complementary omics layers with PriOmics

  1. Michael Altenbuchinger1,13
  1. 1Department of Medical Bioinformatics, University Medical Center Göttingen, 37077 Göttingen, Germany;
  2. 2Peter L. Reichertz Institute for Medical Informatics of TU Braunschweig and Hannover Medical School, 30625 Hannover, Germany;
  3. 3Chair and Institute of Functional Genomics, University of Regensburg, 93053 Regensburg, Germany;
  4. 4Department of Clinical Pathology, Robert-Bosch-Krankenhaus, 70376 Stuttgart, Germany;
  5. 5Dr. Margarete Fischer-Bosch Institute of Clinical Pharmacology, 70376 Stuttgart, and University of Tübingen, Germany;
  6. 6Institute of Theoretical Physics, University of Regensburg, 93040 Regensburg, Germany;
  7. 7Department of Internal Medicine 1 (Oncology, Hematology, Clinical Immunology, and Rheumatology), Saarland University Medical School, 66421 Homburg/Saar, Germany;
  8. 8Department of Internal Medicine 1, Westpfalz-Klinikum, 67655 Kaiserslautern, Germany;
  9. 9Institute for Medical Informatics, Statistics and Epidemiology, University Leipzig, 04107 Leipzig, Germany;
  10. 10Department of Medicine A (Hematology, Oncology, Pulmonology), University Hospital Münster, 48149 Münster, Germany;
  11. 11Institute of Human Genetics, Ulm University and Ulm University Medical Center, 89081 Ulm, Germany;
  12. 12Department of Statistical Bioinformatics, University of Regensburg, 93053 Regensburg, Germany
  1. 13 These authors contributed equally to this work.

  • Corresponding author: robin.kosch{at}protonmail.com
  • Abstract

    High-throughput bottom-up proteomic data cover thousands of proteins and related co- and post-translational modifications (CTMs/PTMs). Yet, it remains an open question how to holistically explore such data and their relationship to complementary omics/phenotypic information. Graphical models are particularly suited to study molecular networks and underlying regulatory mechanisms, as they can distinguish direct from indirect relationships, aside from their generalizability to diverse data types. Here, we propose PriOmics to integrate proteomic data with complementary omics and phenotypic data. PriOmics models intensities of individual proteotypic peptides and incorporates their protein affiliation as prior knowledge to resolve statistical relationships between proteins and CTMs/PTMs. This is verified in simulation studies, which also demonstrate that PriOmics can disentangle regulatory effects of protein modifications from those of respective protein abundances. These findings are substantiated in a diffuse large B cell lymphoma (DLBCL) data set in which we integrate SWATH-MS-based proteomics with transcriptomic and phenotypic data.

    Footnotes

    • Received April 19, 2024.
    • Accepted October 9, 2025.

    This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

    This Article

    1. Genome Res. © 2025 Kosch et al.; Published by Cold Spring Harbor Laboratory Press

    Article Category

    ORCID

    Share

    Preprint Server