A Rapid Multivariate Analysis Approach to Explore Differential Spatial Protein Profiles in Tissue

06 April 2022, Version 1
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

We have developed a multivariate approach for rapid exploration of differential protein profiles acquired from distinct tissue regions. Spatially targeted proteomics is a technology for analyzing the proteome of specific cell types and functional regions within tissue. While spatial context is often essential to understanding biological processes, interpreting complex protein profiles (e.g., of key tissue subregions) can pose a challenge due to the high-dimensional nature of the data. To address this challenge, we developed a multivariate approach to explore such data and applied it to analyze a published spatially targeted proteomics dataset collected from Staphylococcus aureus-infected murine kidney, 4-days and 10-days post-infection. The multivariate data analysis process we developed rapidly filters complex biological data to determine the most relevant species from hundreds to thousands of measured molecules avoiding the more traditional univariate and targeted viewpoint of tracking individual proteins. We employ principal component analysis (PCA) for dimensionality reduction and grouping of correlated and anticorrelated proteins among regions and timepoints previously measured by mass spectrometry through micro-liquid extraction surface analysis (microLESA). Subsequently, k-means clustering of the PCA-processed data was used to group samples in an unsupervised manner. Interpretation of the resultant cluster centers revealed a subset of proteins among those detected that differentiate among spatial regions of infection over two timepoints. These proteins are involved in the glycolysis and TCA metabolomic pathways, calcium-dependent processes, and cytoskeletal organization. Gene ontology analysis of the protein subsets in each cluster uncovered patterns in the dataset used related to tissue damage and repair as well as calcium-related defense mechanisms during staphylococcal infection. By applying this analysis in an infectious disease case study, we observed differential proteomic changes across abscess regions over time, reflecting the dynamic nature of host-pathogen interactions.

Keywords

Staphylococcus aureus
mass spectrometry
bioinformatics
abscess formation
host−pathogen interface
microLESA
spatially targeted proteomics
proteomics
computational proteomics

Supplementary materials

Title
Description
Actions
Title
Spatial Proteomics Supplementary Information
Description
Supplemental figures describing missing values per dataset, silhouette score determination, and results with dataset using the full zero-filled dataset.
Actions
Title
Spatial Proteomics Code for Zero-Filled Dataset
Description
Jupyter Notebook of Code Analyzing Zero-Filled Dataset
Actions
Title
Spatial Proteomics Code for Dataset with No Imputation
Description
Jupyter Notebook of Code Analyzing Dataset with No Imputation
Actions

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.