Integrated metabolomics and proteomics from voxelated cortical hemispheres of adult rhesus monkeys

oaai

OAAI Biology

OAAI Biol

OpenAccess.ai

2026 04 08T08:35:51.355+00:00

The spatial organization of molecular networks across cortex likely contributes to differences in local circuit vulnerability in aging and Alzheimer’s disease; yet many existing molecular datasets sacrifice spatial structure, sampling only a handful of regions per brain. Here, we present a framework for generating spatially registered, paired metabolomic and proteomic maps across an entire cortical hemisphere of an adult rhesus monkey, at millimeter resolution. One hemisphere each from two animals was harvested under controlled conditions, approximately flattened, and hand dissected at different sampling resolutions (roughly 2.5 and 4 mm/side) into tissue voxels. Each voxel was split after homogenization and extraction to provide matched aliquots for targeted metabolomics and deep untargeted proteomics. To handle these high dimensional data, we developed PChclust, a principal component guided feature clustering algorithm. For cross omic integration, we developed a spatially regularized sparse canonical correlation analysis (sr-sCCA), which incorporates spatial neighborhood structure via graph Laplacian smoothing. We recover meaningful biology: Molecular similarity between neighboring voxels decayed with distance in both modalities, confirming that voxelation captures spatially organized biological variance. The sr-sCCA identified joint proteome-metabolome components with coherent cortical gradients that were conserved across animals. Pathway enrichment analysis recovered brain relevant ontologies and reconstructed complete metabolic circuits from single voxels.

This article is distributed under the terms of the Creative Commons Attribution 4.0 International License.

Integrated metabolomics and proteomics from voxelated cortical hemispheres of adult rhesus monkeys

Authors

Qiuyi Wu¹,², Alev M. Brigande¹, Michael W. Lutz³, Pixu Shi², and Anita A. Disney¹

¹ Department of Neurobiology, Duke University, Durham, NC, 27710 ² Department of Biostatistics and Bioinformatics, Duke University, Durham, NC, 27710 ³ Department of Neurology, Duke University, Durham, NC, 27710

Corresponding author email address: anita.disney@duke.edu

Introduction

A central difficulty in understanding the most common, late-onset, form of Alzheimer’s disease (LOAD) is not a lack of molecular hypotheses; it is that many hypotheses are underdetermined by the measurements we typically make. Many of the challenges in understanding and clinically addressing LOAD arise from three facts: pathology begins ~20 years before diagnosis; pathology and function are not tightly coupled; and the paths to pathology are many. These observations imply that comparisons between diagnosed individuals and matched controls can be both reproducible and mechanistically ambiguous: they pool individuals who may share a diagnostic label but not a causal route, and they often sample tissue after much of the relevant biology has already unfolded.

One response to these challenges is to change the unit of inference. A spatially resolved molecular case study within one individual has one key advantage: at a fixed time point, risk factors and insults are held constant, and regional differences can be interpreted against that shared background. Spatially resolved studies at the scale of the whole cortex, in particular, are likely to be important because many LOAD-associated pathologies follow predictable spatiotemporal patterns. If these spatial patterns reveal regional differences in vulnerability or resilience, then measurements that erase spatial structure discard relevant information.

Despite this potential, most molecular datasets in Alzheimer’s disease neuroscience are spatially coarse. We typically sample a handful of regions, homogenize each, and search for associations with clinical or pathological labels. This strategy cannot represent gradients, boundaries, or patchy structure across the cortical sheet, and makes it difficult to distinguish true biological differences from differences in cellular composition, vascular content, or local microenvironment.

Here, we present a framework that enables spatially resolved molecular comparisons across the cortex, using paired metabolomics and proteomics. Our underlying motivation is to work toward understanding the LOAD prodrome, but the framework is general: it produces spatially registered molecular maps that can be aligned to pathology, cytoarchitecture, and connectivity.

Why the cortex of rhesus monkeys? Many LOAD-relevant vulnerability patterns are properties of cortex, which is a structured sheet embedded with primate-specific association networks. Sufficiently detailed spatial mapping is difficult to obtain from humans — particularly in the prodromal phase — because the relevant individuals are hard to identify pre-mortem, documentation of environmental factors is limited, and preservation of biomolecules is challenging under typical harvest conditions. In rhesus monkeys, tissue collection can be controlled in ways that reduce these confounds.

Why paired metabolomics and proteomics? The biological questions motivating LOAD research — energetics, oxidative stress, lipid remodeling, neurotransmitter metabolism, proteostasis — are pathway questions. The proteome reflects catalytic and structural capacity; the metabolome reflects biochemical activity at the time of measurement; either layer alone is easy to over-interpret. Paired measurements constrain interpretation by asking whether a pathway-level narrative has support on both sides of the enzyme–substrate divide.

Methods (Summary)

Animals: Two adult female rhesus monkeys (Macaca mulatta; A27, 22 years; A26, 24 years) from the Oregon National Primate Research Center were used. Both were pre/peri-menopause at tissue harvest.

Tissue sample collection: Animals were sedated, prepared for surgery, and euthanized by propofol overdose prior to rapid transcardial perfusion with chilled PBS. The brain was bisected, and one hemisphere was separated from subcortical structures, cooled, and approximately flattened. The flattened cortex was voxelated by hand on dry ice into small tissue blocks (A26: ~4 mm/side; A27: ~2.5 mm/side). Voxels were immediately frozen and stored at -80°C.

Exsanguination control: A third animal (U5, juvenile male) had brain harvested without exsanguination, flash-frozen, and sampled by 3-mm punch biopsy.

Targeted metabolomics: Samples were profiled using the Quant 500 kit (Biocrates). A26: 406 samples; A27: 549 samples. Of 630 targeted metabolites, 328 (A26) and 343 (A27) were retained after quality filtering.

Untargeted proteomics: Data-independent acquisition (DIA) proteomics was performed. A26: 9,990 protein groups; A27: 10,253 protein groups after quality filtering.

PChclust: A principal component guided feature clustering algorithm that aggregates co-linear variables into representative clusters, with clusters represented by their first principal component.

Spatially regularized sparse CCA (sr-sCCA): Incorporates spatial neighborhood structure via graph Laplacian smoothing. Uses Delaunay triangulation to define spatial neighbors.

Pathway enrichment analysis: Used Metascape for pathway analysis with GO biological processes, Reactome, KEGG, and WikiPathways databases.

Results

The entire cortical sheet of one brain hemisphere was flattened and voxelated for two animals, yielding 237 (A26) and 511 (A27) unique mapped cortex locations. After quality control and matching, 233 (A26) and 508 (A27) samples had both metabolomic and proteomic data.

Targeted Metabolomics: Analytical Performance and Quality Control

Of 630 targeted metabolites, 591 (A26) and 630 (A27) were detectable. After removing analytes with excessive missingness, 328 (A26) and 343 (A27) were retained. Within-plate technical reproducibility was high, with triglycerides and fatty acids showing higher variability (17–21%). PCA confirmed central clustering of study pool quality control (SPQC) replicates relative to study samples.

Untargeted Proteomics: Analytical Performance and Quality Control

Spectral library construction yielded 13,153 (A26) and 13,645 (A27) protein groups. After quality filtering, 9,990 (A26) and 10,253 (A27) protein groups were quantified. Per-sample identification rates averaged ~8,250 (A26) and ~8,600 (A27) protein groups per plate. Technical reproducibility was excellent: median CVs for SPQC were 6.2% (prep) and 6.4% (tech) for A26.

Table

	A26 Metabolomics	A27 Metabolomics	A26 Proteomics	A27 Proteomics
Samples submitted	432	549	432	549
Plates	6	7	6	7
Features detected	591 metabolites	630 metabolites	13,153 protein groups	13,645 protein groups
Features after QC	328 metabolites	343 metabolites	9,990 protein groups	10,253 protein groups
Matched cortex samples (final)	233	508	233	508

Spatially Regularized Sparse Canonical Correlation (sr-sCCA) Analysis

Because neighboring cortical voxels are not statistically independent, we developed sr-sCCA that incorporates spatial neighborhood structure via graph Laplacian smoothing. Figure 3 shows spatial distributions of the first (V1, U1) and third (V3, U3) canonical variate pairs for both animals. In both animals, metabolomic and proteomic canonical scores exhibit broadly similar spatial patterns: a mediolateral gradient for both first V and U components, and an anteromedial-posterolateral gradient for the third metabolomic component. Regions with higher metabolomic scores generally corresponded to regions with higher proteomic scores.

Figure 3

Spatial maps created by projecting metabolomic canonical scores (V) and proteomic canonical scores (U) from sr-sCCA onto cortical spatial coordinates. (a) Spatial maps for the first component’s metabolomic (V1) and proteomic (U1) scores in A26. (b) Third component (V3/U3) in A26. © First component in A27. (d) Third component in A27. (e) A27-derived sr-sCCA canonical vectors for component 1 projected onto A26 data. (f) Component 3 projected onto A26 data.

Spatial Decay of Molecular Similarity: Sampling Density Study

The two animals were voxelated at different resolutions (~4 mm for A26, ~2.5 mm for A27) to assess sampling density requirements. Across both metabolomic and proteomic data, average correlation decreases as neighbor order increases, indicating spatial decay of molecular similarity. A27’s denser sampling resolved regional heterogeneity in correlation structure in the metabolomic data that was not detectable in A26. This difference was less pronounced for proteomics.

Figure 4

Spatial correlation maps for (a) Metabolomics A26, (b) Proteomics A26, © Metabolomics A27, (d) Proteomics A27 at 1st, 2nd, 3rd neighbor orders (A26) or 1st, 3rd, 5th (A27). Both sampling resolutions captured spatial decay of molecular similarity.

Cross-Animal Reproducibility of Spatial Canonical Components

The A27-derived sr-sCCA canonical vectors were projected onto A26 data. For component 1, the projected maps showed a more coherent mediolateral gradient than evident in A27’s own sr-sCCA, particularly for proteomics. For component 3, A26’s own proteomic scores showed little spatial organization; the projection resolved this into a clear anteromedial-to-posterolateral gradient.

Effect of Exsanguination on Metabolomic Profiles

PCA of combined, QC3-normalized data showed clear separation of U5 (unperfused) from A26 and A27 along PC1, which accounted for 40.2% of total variance. A26 and A27 samples were intermixed despite originating from different animals and batches. Metabolites loading positively on PC1 (enriched in exsanguinated tissue) were dominated by brain-parenchymal membrane lipids: phosphatidylcholines, hexosylceramides, ceramides, and sphingomyelins. Metabolites loading negatively (enriched in blood-containing tissue) included triglycerides, diacylglycerols, spermine, and p-cresol sulfate.

Proteomic Clusters based on PChclust are Enriched with Proteins Associated with Signaling and Regulation

PChclust identified 5,041 clusters ranging from 1 to 638 proteins. Fifty-nine top-scoring clusters were used for pathway enrichment analysis. Three major pathways are highlighted:

Synaptic signaling, vesicle-mediated transport, neurotransmitters, cell-cell adhesion: This cluster contains 122 proteins. FDR significant pathways include regulation of synapse organization, modulation of chemical synaptic transmission, cell junction organization, and neuron projection development.

Mitochondria, aerobic respiration: This cluster contains 27 proteins. FDR significant pathways include mitochondrial protein degradation, mitochondrion organization, carboxylic acid metabolic process, and mitochondrial fission.

Splicing: Splicing proteins are highly represented in one cluster of 638 proteins. FDR significant pathways include spliceosome, chromatin remodeling, regulation of RNA splicing, and epigenetic regulation of gene expression.

Figure 6

Metascape network graphs for three protein clusters: (a) Synaptic signaling, vesicle-mediated transport, neurotransmitters, cell-cell adhesion; (b) Mitochondria, aerobic respiration; © Splicing.

Integrated Pathway Analysis Reconstructs Neuronal Metabolic Processes

Synaptic signaling, vesicle-mediated transport, neurotransmitters, cell-cell adhesion: Joint pathway analysis identified 211 pathways (33 FDR significant, p < 0.05). Pathways included alanine, aspartate and glutamate metabolism, cysteine and methionine metabolism, and ABC transporters. The synaptic vesicle cycle pathway identified 5 enriched metabolites and 2 proteins, covering both inhibitory (GABA, glycine) and excitatory (glutamic acid) signaling.

Mitochondria, aerobic respiration: Joint pathway analysis identified 201 pathways (35 FDR significant, p < 0.05). The valine, leucine, and isoleucine degradation pathway showed balance between proteins and metabolites. Glutathione metabolism was identified with 1 key protein (MGST3) and 7 metabolites.

Splicing: Joint pathway analysis identified 261 pathways (19 FDR significant, p < 0.05). The spliceosome was the foremost pathway.

Integrated pathway analysis of full proteomic and metabolomic results: Analysis identified 346 pathways (117 FDR significant, p < 0.05). Major pathways included endocytosis, axon guidance, glutamatergic synapse, sphingolipid signaling, and dopaminergic synapse. Neurodegenerative disease pathways had large numbers of mapped proteins including Alzheimer’s disease (271 proteins) and Parkinson’s disease (200).

Table

Protein cluster	Pathway	Analyte	Name
Cluster 3246: synaptic signaling, cell junction organization	Synaptic vesicle cycle	Protein	solute carrier family 6 member 7 (SLC6A7)
		Protein	calcium voltage-gated channel subunit alpha1 B (CACNA1B)
		Metabolite	gamma-Aminobutyric acid
		Metabolite	Glutamic acid
		Metabolite	Glycine
		Metabolite	Serotonin
		Metabolite	Histamine
Cluster 3986: mitochondria, aerobic respiration	Valine, leucine and isoleucine degradation	Protein	isovaleryl-CoA dehydrogenase (IVD)
		Protein	methylmalonyl-CoA mutase (MMUT)
		Protein	acyl-CoA dehydrogenase family member 8 (ACAD8)
		Metabolite	Leucine
		Metabolite	L-Valine
		Metabolite	Isoleucine
		Protein	microsomal glutathione S-transferase 3 (MGST3)
	Glutathione metabolism	Metabolite	L-Cysteine
		Metabolite	Glutamic acid
		Metabolite	Glycine
		Metabolite	Ornithine
		Metabolite	Putrescine
		Metabolite	Spermidine
		Metabolite	Spermine

Discussion

We present a framework for generating spatially registered, paired metabolomic and proteomic maps across an entire cortical hemisphere in the rhesus monkey, together with analysis tools that explicitly incorporate spatial dependence. The core design decision was to have assays paired at the sample level rather than integrated by post hoc alignment. To handle dimensionality and collinearity, we developed PChclust. For cross-omic integration, we developed sr-sCCA, which incorporates spatial neighborhood structure.

Several lines of evidence indicate that our analysis captures real biology at the spatial scale of our measurements. Both sampling resolutions captured spatial decay of molecular similarity, but denser sampling in A27 resolved regional heterogeneity in the metabolome not detectable in A26. At both resolutions, sr-sCCA identified joint proteome–metabolome components that were spatially coherent, and cross-animal projection revealed conserved spatial structure. At the pathway level, unsupervised proteomic clustering recovered major neuronal processes — synaptic signaling, mitochondrial respiration, RNA splicing — and integrated pathway analysis reconstructed complete metabolic circuits within single voxels. Our recovery of neurotransmitter pathways from unsupervised spatial decomposition of bulk homogenized cortical tissue provides biological face validity.

Comparison with other spatial multi-omics approaches: Current spatial molecular methods offer micrometer-scale resolution but face fundamental spatial coverage problems for large brains. Our approach sacrifices resolution (millimeters rather than micrometers) and gains whole-hemisphere coverage and deep molecular profiling (~10,000 protein groups and ~330 metabolites per voxel). For questions about vulnerability and resilience in aging and Alzheimer’s disease, relevant spatial patterns span centimeters, making this trade-off reasonable.

Why non-human primates, and why exsanguination matters: Spatial patterns of pathology in AD have fine-grained features across association cortex for which rodent models offer limited homology. Dense spatial mapping would be difficult to obtain from humans directly; analyzing a whole human cortical hemisphere represents significant opportunity cost. Furthermore, preservation and isolation of biomolecules in human samples is challenging. The exsanguination-associated variance dominated metabolomic data structure (PC1 ~40%), driven by blood-borne lipids rather than brain-parenchymal species. The PC1 axis did not correlate with sr-sCCA components, indicating spatial patterns in A26 and A27 are not primarily explained by blood-related effects.

Limitations: The most obvious limitation is sample size. While primarily a methods-development report, with metabolome and proteome from the same homogenate and maps defined across each animal’s cortex, the hundreds of voxels per animal are the biological replicates. Cross-animal projection provides evidence of conserved spatial patterns. The targeted metabolomics panel constrains coverage to ~330 metabolites after quality filtering. Voxel sizes (~2.5–4 mm) do not support laminar or columnar resolution. The tissue handling timeline (~2.5 hours from anoxia to last frozen voxel) introduces possibility of metabolite degradation; dissection order was counterbalanced and no opposing spatial gradients were observed.

What our method enables: We do not yet know at what spatial scales molecular vulnerability and resilience patterns are organized in the aging primate cortex. The framework we developed makes it feasible to ask these questions across a series of animals spanning the aging trajectory, with paired molecular and histopathological atlases. The methods are our contributions; insights will come from applying them.