Generative Artificial Intelligence in Medical Imaging: Foundations, Progress, and Clinical Translation

Background
Medical imaging is a cornerstone of modern clinical medicine, supporting diagnostic assessment, therapeutic planning, and prognostic evaluation. However, real-world imaging is often constrained by practical limitations, including scarce and heterogeneous data, acquisition trade-offs that can introduce noise, artifacts, and loss of resolution, and incomplete longitudinal follow-up. These challenges motivate methods that can synthesize missing data, harmonize heterogeneous inputs, and augment incomplete datasets, which has become a key clinical driver for bringing generative AI into medical imaging workflows.

Research Progress
In their Research review article, “Generative Artificial Intelligence in Medical Imaging: Foundations, Progress, and Clinical Translation,” Hairong Zheng and Shanshan Wang (Paul C. Lauterbur Research Center for Biomedical Imaging, Shenzhen Institutes of Advanced Technology,
Chinese Academy of Sciences, Shenzhen, China.) et al. provide a comprehensive overview of recent advances in generative modeling for medical imaging.

The review summarizes how generative AI is enabling capabilities such as data synthesis, image enhancement, modality translation, and spatiotemporal modeling. It covers a broad range of generative approaches, including GANs, VAEs, diffusion models, and emerging multimodal foundation architectures, and also discusses sequence modeling methods such as Transformers, Mamba, and autoregressive models.

A defining strength of the review is its clinic-facing, workflow-first organization. Rather than presenting generative methods only by architecture, the authors map how these approaches are being studied along the full imaging pathway, from acquisition and reconstruction to cross-modality synthesis, and onward to applications relevant to diagnosis, treatment, and prognosis. Framed this way, generative AI is discussed as a means to help clinicians see clearer under constrained scans, see more completely when modalities are missing, update information faster in time-sensitive settings, and model changes over time for longitudinal assessment.
To support more consistent benchmarking and clinical translation, the review introduces a three-level evaluation framework that considers pixel-level fidelity, feature-level realism, and task-level clinical relevance. In addition, the paper summarizes public datasets and benchmarking resources used in the field, and provides an expanded dataset list in the supplementary materials to facilitate reproducible development and evaluation.

Challenges and Boundaries
The authors identify several obstacles to real-world deployment. These include limited generalization under domain shift, risks of hallucinated or unreliable features, ongoing data scarcity and privacy concerns, and stringent regulatory and ethical constraints. They also note clinical considerations around output reliability, transparency for decision-making, and integration into established workflows.

Future Prospects
Looking ahead, the review discusses the convergence of generative AI with large-scale foundation models, highlighting how this synergy may support the next generation of scalable, reliable, and clinically integrated imaging systems. Beyond current foundation-model paradigms, the authors point to longer-term directions toward world models and digital twins that can simulate physiological processes and individualized disease trajectories in a dynamic and interpretable manner. They emphasize the role of multimodal information, including combinations of imaging, text, and genomics, to enable more holistic and personalized understanding of health and disease.

To realize this vision, the review highlights the need for improved robustness to data heterogeneity, incorporation of anatomical and physiological priors, uncertainty quantification, explainable design to foster clinical trust, and more efficient inference to support time-sensitive applications. It also stresses the importance of trustworthy and well-governed synthetic imaging, including hallucination detection, ethical oversight, and standardized evaluation, alongside multi-institutional foundation models and practical integration into clinical systems.

The complete study is accessible via DOI:10.34133/research.1029

https://spj.science.org/doi/10.34133/research.1029

Title: Generative Artificial Intelligence in Medical Imaging: Foundations, Progress, and Clinical Translation
Authors: SHANSHAN WANG HTTPS://ORCID.ORG/0000-0002-0575-6523, XUANRU ZHOU, CHENG LI, SHUQIANG WANG, YE LI, TAO TAN, AND HAIRONG ZHENG
Journal: RESEARCH 15 Dec 2025 Vol 8 Article ID: 1029
DOI:10.34133/research.1029

Attached files

Fig. 1 The challenges of medical imaging in clinical workflow.
Fig. 2 Architectures of generative AI models in medical imaging. (A) Generative adversarial network (GAN) comprising a generator and a discriminator. (B) Variational autoencoder (VAE) with an encoder–decoder structure and latent space mapping. (C) Diffusion probabilistic model (DPM) featuring forward diffusion and a denoising network (e.g., U-Net and DiT). (D) Sequence modeling architectures utilizing Transformer, Mamba, or autoregressive networks on image patches. (E) Foundation model pretraining architectures aligning vision and language encoders
Fig. 3 Three-Level Evaluation Pyramid for Generative Models in Medical Imaging. This figure illustrates a hierarchical evaluation framework comprising low-level, mid-level, and high-level metrics. The structure emphasizes a progression from basic image quality toward clinical applicability.

04/03/2026 Science and Technology Review Publishing House

Regions: Asia, China

Keywords: Applied science, Artificial Intelligence, Health, Medical, People in health research

Disclaimer: AlphaGalileo is not responsible for the accuracy of content posted to AlphaGalileo by contributing institutions or for the use of any information through the AlphaGalileo system.

Latest Publications

Testimonials

For well over a decade, in my capacity as a researcher, broadcaster, and producer, I have relied heavily on Alphagalileo.
All of my work trips have been planned around stories that I've found on this site.
The under embargo section allows us to plan ahead and the news releases enable us to find key experts.
Going through the tailored daily updates is the best way to start the day. It's such a critical service for me and many of my colleagues.

Koula Bouloukos, Senior manager, Editorial & Production Underknown

We have used AlphaGalileo since its foundation but frankly we need it more than ever now to ensure our research news is heard across Europe, Asia and North America. As one of the UK’s leading research universities we want to continue to work with other outstanding researchers in Europe. AlphaGalileo helps us to continue to bring our research story to them and the rest of the world.

Peter Dunn, Director of Press and Media Relations at the University of Warwick

AlphaGalileo has helped us more than double our reach at SciDev.Net. The service has enabled our journalists around the world to reach the mainstream media with articles about the impact of science on people in low- and middle-income countries, leading to big increases in the number of SciDev.Net articles that have been republished.

Generative Artificial Intelligence in Medical Imaging: Foundations, Progress, and Clinical Translation

This item is under embargo and is only visible to journalists

Latest Publications

Testimonials

Koula Bouloukos, Senior manager, Editorial & Production Underknown

Peter Dunn, Director of Press and Media Relations at the University of Warwick

Ben Deighton, SciDevNet