JOURNAL ARTICLE

Prototype as query for few shot semantic segmentation

Leilei CaoYibo GuoYe YuanQiangguo Jin

Year: 2024 Journal:   Complex & Intelligent Systems Vol: 10 (5)Pages: 7265-7278   Publisher: Springer Science+Business Media

Abstract

Abstract Few-shot Semantic Segmentation (FSS) was proposed to segment unseen classes in a query image, referring to only a few annotated examples named support images. One of the characteristics of FSS is spatial inconsistency between query and support targets, e.g., texture or appearance. This greatly challenges the generalization ability of methods for FSS, which requires to effectively exploit the dependency of the query image and the support examples. Most existing methods abstracted support features into prototype vectors and implemented the interaction with query features using cosine similarity or feature concatenation. However, this simple interaction may not capture spatial details in query features. To address this limitation, some methods utilized pixel-level support information by computing pixel-level correlations between paired query and support features implemented with the attention mechanism of Transformer. Nevertheless, these approaches suffer from heavy computation due to dot-product attention between all pixels of support and query features. In this paper, we propose a novel framework, termed ProtoFormer, built upon the Transformer architecture, to fully capture spatial details in query features. ProtoFormer treats the abstracted prototype of the target class in support features as the Query and the query features as Key and Value embeddings, which are input to the Transformer decoder. This approach enables better capture of spatial details and focuses on the semantic features of the target class in the query image. The output of the Transformer-based module can be interpreted as semantic-aware dynamic kernels that filter the segmentation mask from the enriched query features. Extensive experiments conducted on PASCAL- $$5^{i}$$ 5 i and COCO- $$20^{i}$$ 20 i datasets demonstrate that ProtoFormer significantly outperforms the state-of-the-art methods in FSS.

Keywords:
Computer science Query optimization Query expansion Query by Example Query language Artificial intelligence Data mining Exploit Pattern recognition (psychology) Web query classification Sargable Information retrieval Web search query Search engine

Metrics

8
Cited By
5.11
FWCI (Field Weighted Citation Impact)
56
Refs
0.93
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Query-Guided Prototype Evolution Network for Few-Shot Segmentation

Runmin CongHang XiongJ.R. ChenWei ZhangQingming HuangYao Zhao

Journal:   IEEE Transactions on Multimedia Year: 2024 Vol: 26 Pages: 6501-6512
JOURNAL ARTICLE

Few-Shot Semantic Segmentation with Frequency Prototype Learning

Chunlin WenHuang HuiYan MaFeiniu YuanHongqing ZhuPeng Zhu

Journal:   Computing and Informatics Year: 2025 Vol: 44 (1)Pages: 92-123
JOURNAL ARTICLE

Dual Prototype Learning for Few Shot Semantic Segmentation

Wenxuan LiShaobo ChenChengyi Xiong

Journal:   IEEE Access Year: 2024 Vol: 12 Pages: 6356-6364
BOOK-CHAPTER

Generalized Prototype-Based Few-Shot Semantic Segmentation Network

Hang ZhouGuanglu Sun

Communications in computer and information science Year: 2025 Pages: 307-324
BOOK-CHAPTER

Prototype Mixture Models for Few-Shot Semantic Segmentation

Boyu YangChang LiuBohao LiJianbin JiaoQixiang Ye

Lecture notes in computer science Year: 2020 Pages: 763-778
© 2026 ScienceGate Book Chapters — All rights reserved.