JOURNAL ARTICLE

Self-Calibrated Cross Attention Network for Few-Shot Segmentation

Abstract

The key to the success of few-shot segmentation (FSS) lies in how to effectively utilize support samples. Most solutions compress support foreground (FG) features into prototypes, but lose some spatial details. Instead, others use cross attention to fuse query features with uncompressed support FG. Query FG could be fused with support FG, however, query background (BG) cannot find matched BG features in support FG, yet inevitably integrates dissimilar features. Besides, as both query FG and BG are combined with support FG, they get entangled, thereby leading to ineffective segmentation. To cope with these issues, we design a self-calibrated cross attention (SCCA) block. For efficient patch-based attention, query and support features are firstly split into patches. Then, we design a patch alignment module to align each query patch with its most similar support patch for better cross attention. Specifically, SCCA takes a query patch as Q, and groups the patches from the same query image and the aligned patches from the support image as K&V. In this way, the query BG features are fused with matched BG features (from query patches), and thus the aforementioned issues will be mitigated. Moreover, when calculating SCCA, we design a scaled-cosine mechanism to better utilize the support features for similarity calculation. Extensive experiments conducted on PASCAL-5 i and COCO-20 i demonstrate the superiority of our model, e.g., the mIoU score under 5-shot setting on COCO-20 i is 5.6%+ better than previous state-of-the-arts. The code is available at https://github.com/Sam1224/SCCAN.

Keywords:
Computer science Segmentation Pascal (unit) Image (mathematics) Information retrieval Pattern recognition (psychology) Data mining Artificial intelligence Programming language

Metrics

51
Cited By
9.28
FWCI (Field Weighted Citation Impact)
55
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Dual-Attention Network for Few-Shot Segmentation

Zhikui ChenHan WangSuhua ZhangFangming Zhong

Journal:   ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Year: 2022 Pages: 2210-2214
JOURNAL ARTICLE

Dual cross-attention Transformer network for few-shot image semantic segmentation

Yu LiuYana GuoYe ZhuMing Yu

Journal:   Chinese Journal of Liquid Crystals and Displays Year: 2024 Vol: 39 (11)Pages: 1494-1505
JOURNAL ARTICLE

Cross Attention Network for Few-shot Classification

Ruibing HouHong ChangBingpeng MaShiguang ShanXilin Chen

Journal:   arXiv (Cornell University) Year: 2019 Vol: 32 Pages: 4003-4014
© 2026 ScienceGate Book Chapters — All rights reserved.