Prototype-Guided Zero-Shot Medical Image Segmentation with Large Vision-Language Models

Huong Pham; Samuel Cheng

doi:10.3390/app152111441

ScienceGate Book Chapters

JOURNAL ARTICLE

Prototype-Guided Zero-Shot Medical Image Segmentation with Large Vision-Language Models

Huong Pham Samuel Cheng

Year: 2025 Journal: Applied Sciences Vol: 15 (21)Pages: 11441-11441 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/app152111441

Get Full-Text PDF Get Analytical Report

Abstract

Building on advances in promptable segmentation models, this work introduces a framework that integrates Large Vision-Language Model (LVLM) bounding box priors with prototype-based region of interest (ROI) selection to improve zero-shot medical image segmentation. Unlike prior methods such as SaLIP, which often misidentify regions due to reliance on text–image CLIP similarity, the proposed approach leverages visual prototypes to mitigate language bias and enhance ROI ranking, resulting in more accurate segmentation. Bounding box estimation is further strengthened through systematic prompt engineering to optimize LVLM performance across diverse datasets and imaging modalities. Evaluation was conducted on three publicly available benchmark datasets—CC359 (brain MRI), HC18 (fetal head ultrasound), and CXRMAL (chest X-ray)—without any task-specific fine-tuning. The proposed method achieved substantial improvements over prior approaches. On CC359, it reached a Dice score of 0.95 ± 0.06 and a mean Intersection-over-Union (mIoU) of 0.91 ± 0.10. On HC18, it attained a Dice score of 0.82 ± 0.20 and mIoU of 0.74 ± 0.22. On CXRMAL, the model achieved a Dice score of 0.90 ± 0.08 and mIoU of 0.83 ± 0.12. These standard deviations reflect variability across test images within each dataset, indicating the robustness of the proposed zero-shot framework. These results demonstrate that integrating LVLM-derived bounding box priors with prototype-based selection substantially advances zero-shot medical image segmentation.

Keywords:

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.20

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

AI in cancer detection

Physical Sciences → Computer Science → Artificial Intelligence

COVID-19 diagnosis using AI

Health Sciences → Medicine → Radiology, Nuclear Medicine and Imaging

Radiomics and Machine Learning in Medical Imaging

Health Sciences → Medicine → Radiology, Nuclear Medicine and Imaging

Prototype-Guided Zero-Shot Medical Image Segmentation with Large Vision-Language Models

Abstract

Metrics

Topics

Related Documents

Weakly Supervised Zero-Shot Medical Image Segmentation Using Pretrained Medical Language Models

Zero-Shot Fine-Grained Image Classification Using Large Vision-Language Models

Self-prompting Large Vision Models for Few-Shot Medical Image Segmentation

LLaFS++: Few-Shot Image Segmentation With Large Language Models

Dual-Guided Prototype Alignment Network for Few-Shot Medical Image Segmentation