Cross-Block Sparse Class Token Contrast for Weakly Supervised Semantic Segmentation

Keyang Cheng; Jingfeng Tang; Hongjian Gu; Hao Wan; Maozhen Li

doi:10.1109/tcsvt.2024.3442310

ScienceGate Book Chapters

JOURNAL ARTICLE

Cross-Block Sparse Class Token Contrast for Weakly Supervised Semantic Segmentation

Keyang Cheng Jingfeng Tang Hongjian Gu Hao Wan Maozhen Li

Year: 2024 Journal: IEEE Transactions on Circuits and Systems for Video Technology Vol: 34 (12)Pages: 13004-13015 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tcsvt.2024.3442310

Get Full-Text PDF Get Analytical Report

Abstract

Most existing Vision Transformer-based frameworks for weakly supervised semantic segmentation utilize class activation maps to generate pseudo masks. Although it mitigates the class-agnostic issue, this approach still suffers from misclassification and noise in segmentation results. To overcome these limitations, we propose an attention-based framework named Cross-block Sparse Class Token Contrast (CB-SCTC), which incorporates Dynamic Sparse Attention module (DSA) and Cross-block Class Token Contrast scheme (CB-CTC). Specifically, the proposed Cross-block Class Token Contrast scheme forces diversity between the final class tokens by learning from the lower similarity of the class tokens in the relatively shallower blocks. Moreover, the Dynamic Sparse Attention module is designed to post-process the output from the softmax function in the attention mechanism to reduce noise. Extensive experiments prove the proposed framework is a valid alternative to class activation maps. Our framework demonstrates competitive mIoU scores on the PASCAL VOC 2012(val:75.5%, test:75.2%) and MS COCO 2014 dataset(val:46.9%). Our code is available at https://github.com/Jingfeng-Tang/CB-SCTC.

Keywords:

Computer science Contrast (vision) Artificial intelligence Class (philosophy) Segmentation Block (permutation group theory) Pattern recognition (psychology) Image segmentation Mathematics Combinatorics

Metrics

Cited By

2.56

FWCI (Field Weighted Citation Impact)

Refs

0.86

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Anomaly Detection Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Cross-Block Sparse Class Token Contrast for Weakly Supervised Semantic Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

Token Contrast for Weakly-Supervised Semantic Segmentation

Multi-class Token Transformer for Weakly Supervised Semantic Segmentation

MCTformer+: Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation

Multi-Class Token Attention Learning for Weakly Supervised Semantic Segmentation

Weakly Supervised Semantic Segmentation of Remote Sensing Scenes With Cross-Image Class Token Constraints