Unified integration of spatial transcriptomics across platforms with LLOKI

Jian Ma & Spencer Krieger · 2025-12-03

Spatial transcriptomics (ST) has transformed our understanding of tissue architecture and cellular interactions, but integrating ST data across platforms remains challenging due to differences in gene panels, data sparsity, and technical variability. Here, we introduce LLOKI, a novel framework for integrating imaging-based ST data from diverse platforms without requiring shared gene panels. LLOKI addresses ST integration through two key alignment tasks: feature alignment across technologies and batch alignment across data sets. Optimal transport-guided feature propagation adjusts data sparsity to match scRNA-seq references through graph-based imputation, enabling single-cell foundation models such as scGPT to generate unified features. Batch alignment then refines scGPT-transformed embeddings, mitigating batch effects while preserving biological variability. Evaluations on mouse brain samples from five different technologies demonstrate that LLOKI outperforms existing methods and is effective for cross-technology spatial gene program identification, and tissue slice alignment. Applying LLOKI to five ovarian cancer data sets, we identify an integrated gene program indicative of tumor-infiltrating T cells across gene panels. Together, LLOKI provides a robust foundation for cross-platform ST studies, with the potential to scale to large atlas data sets, enabling deeper insights into cellular organization and tissue environments.

Funding
Three-dimensional mapping and modeling of combinatorial interactions underlying biomolecular condensates in olfactory neuronsSpatial omics technologies to map the senescent cell microenvironmentWashU-Northwestern Genomic Variation and Function Data and Administrative Coordinating CenterComputational Methods for Next-Generation Comparative GenomicsIntegrative Machine Learning for Common Fund Spatial OmicsMultiscale Analyses of 4D Nucleome Structure and Function by Comprehensive Multimodal Data IntegrationComputational methods for studying single-cell 3D genomeMultiscale Analyses of 4D Nucleome Structure and Function by Comprehensive Multimodal Data IntegrationComputational Methods for Next-Generation Comparative GenomicsComputational methods for studying single-cell 3D genomeThree-dimensional mapping and modeling of combinatorial interactions underlying biomolecular condensates in olfactory neuronsIntegrative Machine Learning for Common Fund Spatial OmicsWashU-Northwestern Genomic Variation and Function Data and Administrative Coordinating CenterSpatial omics technologies to map the senescent cell microenvironment

NIDA NIH HHS

R21 DA061481

NCI NIH HHS

UH3 CA268202

NHGRI NIH HHS

U24 HG012070

NHGRI NIH HHS

R01 HG007352

NIH HHS

R03 OD039980

NHGRI NIH HHS

UM1 HG011593

NHGRI NIH HHS

R01 HG012303

National Institutes of Health

UM1HG011593

National Institutes of Health

R01HG007352

National Institutes of Health

R01HG012303

National Institutes of Health

R21DA061481

National Institutes of Health

R03OD039980

National Institutes of Health

U24HG012070

National Institutes of Health Common Fund Cellular Senescence Network

UH3CA268202