JOURNAL ARTICLE

End-to-end Boundary Exploration for Weakly-supervised Semantic Segmentation

Abstract

It is full of challenges for weakly supervised semantic segmentation (WSSS) acquiring the pixel-level object location with only image-level annotations. Especially, the single-stage methods learn image- and pixel-level labels simultaneously to avoid complicated multi-stage computations and sophisticated training procedures. In this paper, we argue that using a single model to accomplish image- and pixel-level classification will fall into the balance of multi-target and consequently weakens the recognition capability. Because the image-level task tends to learn position-independent features, but the pixel-level task tends to be position-sensitive. Hence, we propose an effective encoder-decoder framework to explore object boundaries and solve the above dilemma. The encoder and decoder learn position-independent and position-sensitive features independently during the end-to-end training. In addition, a global soft pooling is suggested to suppress background pixels' activation for the encoder training and further improve the class activation map (CAM) performance. The edge annotations for the decoder training are synthesized by the high confidence CAMs, which do not requires extra supervision. The extensive experiments on the Pascal VOC12 dataset demonstrate that our method achieves state-of-the-art compared to the end-to-end approaches. It gets 63.6% and 65.7% mIoU scores on val and test sets respectively.

Keywords:
Computer science Artificial intelligence Pixel Encoder Segmentation End-to-end principle Computer vision Pooling Pattern recognition (psychology) Image segmentation

Metrics

12
Cited By
1.12
FWCI (Field Weighted Citation Impact)
60
Refs
0.79
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

CPEWS: Contextual Prototype-Based End-to-End Weakly Supervised Semantic Segmentation

Xiaoyan ShaoJiaqi HanLingling LiXuezhuan ZhaoJingjing Yan

Journal:   Computers, materials & continua/Computers, materials & continua (Print) Year: 2025 Vol: 83 (1)Pages: 595-617
JOURNAL ARTICLE

Branches Mutual Promotion for End-to-End Weakly Supervised Semantic Segmentation

Lei ZhuXinliang ZhangHangzhou HeQian ChenSha LiShuang ZengYibao ZhangQiushi RenYanye Lu

Journal:   IEEE Transactions on Neural Networks and Learning Systems Year: 2024 Vol: 36 (6)Pages: 11493-11506
JOURNAL ARTICLE

Self Correspondence Distillation for End-to-End Weakly-Supervised Semantic Segmentation

Rongtao XuChangwei WangJiaxi SunShibiao XuWeiliang MengXiaopeng Zhang

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2023 Vol: 37 (3)Pages: 3045-3053
© 2026 ScienceGate Book Chapters — All rights reserved.