Learning class-agnostic masks with cross-task refinement for weakly supervised semantic segmentation

Lian Xu; Mohammed Bennamoun; Farid Boussaïd; Wanli Ouyang; Dan Xu

doi:10.1007/s00521-023-08826-0

ScienceGate Book Chapters

JOURNAL ARTICLE

Learning class-agnostic masks with cross-task refinement for weakly supervised semantic segmentation

Lian Xu Mohammed Bennamoun Farid Boussaïd Wanli Ouyang Dan Xu

Year: 2023 Journal: Neural Computing and Applications Vol: 35 (27)Pages: 20189-20205 Publisher: Springer Science+Business Media

DOI: 10.1007/s00521-023-08826-0

Get Full-Text PDF Get Analytical Report

Abstract

Abstract Weakly supervised semantic segmentation (WSSS) commonly relies on Class Activation Mapping (CAM) to produce pseudo semantic labels using image-level annotations. However, because CAM maps often form sparse object regions with poor boundaries, they cannot provide sufficient segmentation supervision. Because off-the-shelf saliency maps can provide rich object boundaries that can be leveraged to improve semantic segmentation, we propose to jointly learn semantic segmentation and class-agnostic masks by using image-level annotations and off-the-shelf saliency maps as supervision. We also propose a cross-task label refinement mechanism, which takes advantage of the learned class-agnostic masks and semantic segmentation masks, to refine the pseudo labels and provide more accurate supervision to both tasks. Moreover, we introduce a new normalization method for CAM to generate more complete class-specific localization maps. The improved CAM maps complement our learned class-agnostic masks, leading to high-quality pseudo semantic segmentation labels. Extensive experiments demonstrate the effectiveness of the proposed approach, with state-of-the-art WSSS results established on PASCAL VOC 2012 and MS COCO.

Keywords:

Segmentation Computer science Pascal (unit) Artificial intelligence Class (philosophy) Task (project management) Pattern recognition (psychology) Normalization (sociology) Semantics (computer science)

Metrics

Cited By

0.18

FWCI (Field Weighted Citation Impact)

Refs

0.40

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Learning class-agnostic masks with cross-task refinement for weakly supervised semantic segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

Class agnostic and specific consistency learning for weakly-supervised point cloud semantic segmentation

Weakly-supervised Instance Segmentation via Class-agnostic Learning with Salient Images

Weakly-supervised Incremental learning for Semantic segmentation with Class Hierarchy

Completely weakly supervised class-incremental learning for semantic segmentation

Weakly Supervised Semantic Segmentation with Deep Learning