spatiAlytica: Viewer-Grounded Multimodal Agentic System for Interactive Spatial Omics Analysis
spatiAlytica: Viewer-Grounded Multimodal Agentic System for Interactive Spatial Omics Analysis
Das, A.; Zhang, K.; Song, J.; Han, M.; Chen, A.; Meng, W.; Galloway, H.; Chen, P.-Y.; Jo, S.; Liu, Z.; Hasib, M. M.; Officer, A.; Sinha, H.; Chiu, Y.-C.; Gao, S.-J.; Li, L.; Huang, Y.
AbstractSpatial transcriptomics and proteomics map tissue architecture and cellular interactions, but analysis remains limited by programming demands and text-centered AI agents that lack viewer grounding and cross-turn context. We present spatiAlytica, a viewer-centric multimodal interactive agentic system embedded in the Napari viewer that enables non-programmer biologists to perform iterative, hypothesis-driven spatial omics analysis via natural language. spatiAlytica couples viewer-state serialization, agentic memory, biological concept-to-data-field mapping, code generation and debugging, Spatial VQA, and grounded interpretation to support an exploratory analysis and interpretive reasoning workflow. We introduce spatiAlyticaBench, a comprehensive benchmark spanning 222 single-turn spatial analytical coding questions, 178 multi-turn sequential workflow questions, and 7,350 image-grounded reasoning questions. spatiAlytica outperformed strong agentic baselines, while using less time and tokens. Case studies across Kaposi's sarcoma, colorectal cancer, and ovarian cancer recapitulated known spatial patterns and uncovered progressive CD8 T-cell dysfunction during KS progression.