A perspective on developing foundation models for analyzing spatial transcriptomic data
en-GBde-DEes-ESfr-FR

A perspective on developing foundation models for analyzing spatial transcriptomic data

01/04/2026 Frontiers Journals

Foundation models (FMs), which are deep learning models pretrained on large-scale data and applied to diverse downstream tasks, have transformed natural language processing and multimodal AI. However, in spatial transcriptomics (ST), no FM has yet demonstrated the capacity to generate novel, validated biological discoveries. The authors argue that this gap exists because ST data lack an explicit sequence-like structure, are noisy, and are more costly to collect than single-cell RNA sequencing data, making them unsuitable for simply reusing existing single-cell FMs. Therefore, how to leverage ST data to construct better foundation models is a highly promising research direction that warrants further exploration.
The paper distinguishes two types of FMs for ST analysis. Seq-based FMs are pretrained directly on large-scale ST sequencing data using self-supervised learning, with examples including NicheCompass, Nicheformer, STFormer, and CellPLM. Knowledge-based FMs instead leverage existing LLMs or large multimodal models pretrained on biological text or pathology images, such as QuST-LLM and Geneverse, to transfer general knowledge into spatial analysis. The authors also highlight an emerging hybrid approach combining both paradigms, as seen in spEMO and scGPT-spatial. Details are summarized in Figure 1.
The authors argue that FMs should tackle substantive, high-impact problems rather than simple tasks such as basic clustering. Specifically, FMs should help automate and standardize preprocessing pipelines — including quality control, normalization, and annotation — to reduce subjectivity and improve reproducibility across studies. They should also enhance performance on key downstream tasks such as cell-type annotation, spatial niche clustering, gene expression imputation, and spatial deconvolution.
A major opportunity for spatial FMs lies in accelerating biological discovery, reducing the need for costly wet-lab experiments. By analogy with tools like ChemCrow in chemistry, an FM-powered AI agent for ST data could identify novel cell types, predict perturbation effects, and explore spatially induced biological patterns. These areas remain largely unexplored. The paper stresses that the value of an FM must be weighed against the cost of human resources and model training.
In the future, several challenges must be addressed for ST FMs to reach their potential. These include collecting high-quality and diverse training data, designing pretraining objectives appropriate for non-sequential transcriptomic data, and building rigorous benchmarking frameworks that go beyond low-level tasks. Computational costs must also be carefully managed, and the authors advocate for open-sourcing models at multiple scales and providing online demos to make these tools broadly accessible to the research community.

DOI
10.1002/qub2.70010
Attached files
  • 59789875.png
01/04/2026 Frontiers Journals
Regions: Asia, China
Keywords: Science, Life Sciences

Disclaimer: AlphaGalileo is not responsible for the accuracy of content posted to AlphaGalileo by contributing institutions or for the use of any information through the AlphaGalileo system.

Testimonials

For well over a decade, in my capacity as a researcher, broadcaster, and producer, I have relied heavily on Alphagalileo.
All of my work trips have been planned around stories that I've found on this site.
The under embargo section allows us to plan ahead and the news releases enable us to find key experts.
Going through the tailored daily updates is the best way to start the day. It's such a critical service for me and many of my colleagues.
Koula Bouloukos, Senior manager, Editorial & Production Underknown
We have used AlphaGalileo since its foundation but frankly we need it more than ever now to ensure our research news is heard across Europe, Asia and North America. As one of the UK’s leading research universities we want to continue to work with other outstanding researchers in Europe. AlphaGalileo helps us to continue to bring our research story to them and the rest of the world.
Peter Dunn, Director of Press and Media Relations at the University of Warwick
AlphaGalileo has helped us more than double our reach at SciDev.Net. The service has enabled our journalists around the world to reach the mainstream media with articles about the impact of science on people in low- and middle-income countries, leading to big increases in the number of SciDev.Net articles that have been republished.
Ben Deighton, SciDevNet

We Work Closely With...


  • e
  • The Research Council of Norway
  • SciDevNet
  • Swiss National Science Foundation
  • iesResearch
Copyright 2026 by AlphaGalileo Terms Of Use Privacy Statement