DiffBoost: Enhancing Medical Image Segmentation via Text-Guided Diffusion Model

Zheyuan Zhang; Lanhong Yao; Bin Wang; Debesh Jha; Görkem Durak; Elif Keleş; Alpay Medetalibeyoğlu; Ulaş Bağcı

doi:10.1109/tmi.2024.3519307

ScienceGate Book Chapters

JOURNAL ARTICLE

DiffBoost: Enhancing Medical Image Segmentation via Text-Guided Diffusion Model

Zheyuan Zhang Lanhong Yao Bin Wang Debesh Jha Görkem Durak Elif Keleş Alpay Medetalibeyoğlu Ulaş Bağcı

Year: 2024 Journal: IEEE Transactions on Medical Imaging Vol: 44 (9)Pages: 3670-3682 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tmi.2024.3519307

Get Full-Text PDF Get Analytical Report

Abstract

Large-scale, big-variant, high-quality data are crucial for developing robust and successful deep-learning models for medical applications since they potentially enable better generalization performance and avoid overfitting. However, the scarcity of high-quality labeled data always presents significant challenges. This paper proposes a novel approach to address this challenge by developing controllable diffusion models for medical image synthesis, called DiffBoost. We leverage recent diffusion probabilistic models to generate realistic and diverse synthetic medical image data that preserve the essential characteristics of the original medical images by incorporating edge information of objects to guide the synthesis process. In our approach, we ensure that the synthesized samples adhere to medically relevant constraints and preserve the underlying structure of imaging data. Due to the random sampling process by the diffusion model, we can generate an arbitrary number of synthetic images with diverse appearances. To validate the effectiveness of our proposed method, we conduct an extensive set of medical image segmentation experiments on multiple datasets, including Ultrasound breast (+13.87%), CT spleen (+0.38%), and MRI prostate (+7.78%), achieving significant improvements over the baseline segmentation methods. The promising results demonstrate the effectiveness of our DiffBoost for medical image segmentation tasks and show the feasibility of introducing a first-ever text-guided diffusion model for general medical image segmentation tasks. With carefully designed ablation experiments, we investigate the influence of various data augmentations, hyper-parameter settings, patch size for generating random merging mask settings, and combined influence with different network architectures. Source code with checkpoints are available at https://github.com/NUBagciLab/DiffBoost.

Keywords:

Image segmentation Computer science Medical imaging Computer vision Artificial intelligence Image (mathematics) Segmentation Scale-space segmentation Diffusion Pattern recognition (psychology) Physics

Metrics

Cited By

13.11

FWCI (Field Weighted Citation Impact)

Refs

0.98

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Radiomics and Machine Learning in Medical Imaging

Health Sciences → Medicine → Radiology, Nuclear Medicine and Imaging

AI in cancer detection

Physical Sciences → Computer Science → Artificial Intelligence

Brain Tumor Detection and Classification

Life Sciences → Neuroscience → Neurology

DiffBoost: Enhancing Medical Image Segmentation via Text-Guided Diffusion Model

Abstract

Metrics

Citation History

Topics

Related Documents

Enhancing Label-Efficient Medical Image Segmentation with Text-Guided Diffusion Models

Diffusion model-based text-guided enhancement network for medical image segmentation

Text-guided and Edge-guided Fusion Network for Enhancing Medical Image Segmentation

Text Knowledge-Guided Segment Anything Model for Medical Image Segmentation

TextDiff3D: Text-guided Latent Diffusion Model for 3d Medical Images Segmentation