JOURNAL ARTICLE

CLIP-Driven Prototype Network for Few-Shot Semantic Segmentation

Shi-Cheng GuoShangkun LiuJingyu WangWeimin ZhengChengyu Jiang

Year: 2023 Journal:   Entropy Vol: 25 (9)Pages: 1353-1353   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Recent research has shown that visual–text pretrained models perform well in traditional vision tasks. CLIP, as the most influential work, has garnered significant attention from researchers. Thanks to its excellent visual representation capabilities, many recent studies have used CLIP for pixel-level tasks. We explore the potential abilities of CLIP in the field of few-shot segmentation. The current mainstream approach is to utilize support and query features to generate class prototypes and then use the prototype features to match image features. We propose a new method that utilizes CLIP to extract text features for a specific class. These text features are then used as training samples to participate in the model’s training process. The addition of text features enables model to extract features that contain richer semantic information, thus making it easier to capture potential class information. To better match the query image features, we also propose a new prototype generation method that incorporates multi-modal fusion features of text and images in the prototype generation process. Adaptive query prototypes were generated by combining foreground and background information from the images with the multi-modal support prototype, thereby allowing for a better matching of image features and improved segmentation accuracy. We provide a new perspective to the task of few-shot segmentation in multi-modal scenarios. Experiments demonstrate that our proposed method achieves excellent results on two common datasets, PASCAL-5i and COCO-20i.

Keywords:
Shot (pellet) Computer science Segmentation Artificial intelligence Computer vision Materials science

Metrics

13
Cited By
2.37
FWCI (Field Weighted Citation Impact)
59
Refs
0.87
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Adversarial Robustness in Machine Learning
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

BOOK-CHAPTER

Generalized Prototype-Based Few-Shot Semantic Segmentation Network

Hang ZhouGuanglu Sun

Communications in computer and information science Year: 2025 Pages: 307-324
JOURNAL ARTICLE

CLIP Driven Few-Shot Panoptic Segmentation

Pengfei XianLai-Man PoYuzhi ZhaoWing-Yin YuKwok-Wai Cheung

Journal:   IEEE Access Year: 2023 Vol: 11 Pages: 72295-72305
BOOK-CHAPTER

Part-Aware Prototype Network for Few-Shot Semantic Segmentation

Yongfei LiuXiangyi ZhangSongyang ZhangXuming He

Lecture notes in computer science Year: 2020 Pages: 142-158
JOURNAL ARTICLE

Dynamic Prototype Convolution Network for Few-Shot Semantic Segmentation

Jie LiuYanqi BaoGuo-Sen XieHuan XiongJan‐Jakob SonkeEfstratios Gavves

Journal:   2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Year: 2022 Pages: 11543-11552
JOURNAL ARTICLE

Cycle association prototype network for few-shot semantic segmentation

Zhuangzhuang HaoJi ShaoBo GongJingwen YangLing JingYingyi Chen

Journal:   Engineering Applications of Artificial Intelligence Year: 2024 Vol: 138 Pages: 109309-109309
© 2026 ScienceGate Book Chapters — All rights reserved.