JOURNAL ARTICLE

Image synthesis with class‐aware semantic diffusion models for surgical scene segmentation

Yihang ZhouRebecca TowningZaid AwadStamatia Giannarou

Year: 2025 Journal:   Healthcare Technology Letters Vol: 12 (1)Pages: e70003-e70003   Publisher: Institution of Engineering and Technology

Abstract

Abstract Surgical scene segmentation is essential for enhancing surgical precision, yet it is frequently compromised by the scarcity and imbalance of available data. To address these challenges, semantic image synthesis methods based on generative adversarial networks and diffusion models have been developed. However, these models often yield non‐diverse images and fail to capture small, critical tissue classes, limiting their effectiveness. In response, a class‐aware semantic diffusion model (CASDM), a novel approach which utilizes segmentation maps as conditions for image synthesis to tackle data scarcity and imbalance is proposed. Novel class‐aware mean squared error and class‐aware self‐perceptual loss functions have been defined to prioritize critical, less visible classes, thereby enhancing image quality and relevance. Furthermore, to the authors' knowledge, they are the first to generate multi‐class segmentation maps using text prompts in a novel fashion to specify their contents. These maps are then used by CASDM to generate surgical scene images, enhancing datasets for training and validating segmentation models. This evaluation assesses both image quality and downstream segmentation performance, demonstrates the strong effectiveness and generalisability of CASDM in producing realistic image‐map pairs, significantly advancing surgical scene segmentation across diverse and challenging datasets.

Keywords:
Segmentation Computer science Artificial intelligence Class (philosophy) Image segmentation Image (mathematics) Scale-space segmentation Pattern recognition (psychology) Computer vision Machine learning

Metrics

2
Cited By
9.55
FWCI (Field Weighted Citation Impact)
20
Refs
0.90
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Generative Adversarial Networks and Image Synthesis
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing Techniques and Applications
Physical Sciences →  Engineering →  Media Technology
Radiomics and Machine Learning in Medical Imaging
Health Sciences →  Medicine →  Radiology, Nuclear Medicine and Imaging
© 2026 ScienceGate Book Chapters — All rights reserved.