Abstract

It is difficult and expensive to obtain labels for image semantic segmentation tasks. This has led to more and more researches focusing on weakly supervised semantic segmentation (WSSS) with image-level labels and the class activation maps (CAM) is often used to locate objects. The performance of WSSS methods largely depends on the accuracy of the generated CAMs, but current methods can only perform rough locali/ation, especially cannot fit CAM to the object's edge well. Therefore, we propose the Edge Enhancement Network (EEN), which uses the shallow features of the network to enhance edge information. This allows us to obtain more accurate CAMs that fit the edges only through image-level labels. It greatly improves the accuracy of pseudo ground truth and easily reaches the level of current mainstream methods. After many experiments, it has reached 67.0% mIoU on the Pascal VOC2012 validation set, which exceeds the latest method under the same training settings.

Keywords:
Computer science Segmentation Artificial intelligence Ground truth Pascal (unit) Enhanced Data Rates for GSM Evolution Image segmentation Pattern recognition (psychology) Computer vision Image (mathematics) Set (abstract data type)

Metrics

2
Cited By
0.00
FWCI (Field Weighted Citation Impact)
24
Refs
0.06
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.