JOURNAL ARTICLE

Weakly Supervised Video Salient Object Detection via Point Supervision

Shuyong GaoHaozhe XingWei ZhangYan WangQianyu GuoWenqiang Zhang

Year: 2022 Journal:   Proceedings of the 30th ACM International Conference on Multimedia Pages: 3656-3665

Abstract

Fully supervised video salient object detection models have achieved excellent performance, yet obtaining pixel-by-pixel annotated datasets is laborious. Several works attempt to use scribble annotations to mitigate this problem, but point supervision as a more labor-saving annotation method (even the most labor-saving method among manual annotation methods for dense prediction), has not been explored. In this paper, we propose a strong baseline model based on point supervision. To infer saliency maps with temporal information, we mine inter-frame complementary information from short-term and long-term perspectives, respectively. Specifically, we propose a hybrid token attention module, which mixes optical flow and image information from orthogonal directions, adaptively highlighting critical optical flow information (channel dimension) and critical token information (spatial dimension). To exploit long-term cues, we develop the Long-term Cross-Frame Attention module (LCFA), which assists the current frame in inferring salient objects based on multi-frame tokens. Furthermore, we label two point-supervised datasets, P-DAVIS and P-DAVSOD, by relabeling the DAVIS and the DAVSOD dataset. Experiments on the six benchmark datasets illustrate our method outperforms the previous state-of-the-art weakly supervised methods and even is comparable with some fully supervised approaches. Our source code and datasets are available at: https://github.com/shuyonggao/PVSOD.

Keywords:
Computer science Artificial intelligence Benchmark (surveying) Frame (networking) Security token Optical flow Salient Annotation Dimension (graph theory) Object (grammar) Point (geometry) Pixel Key frame Source code Pattern recognition (psychology) Code (set theory) Term (time) Computer vision Data mining Image (mathematics)

Metrics

30
Cited By
2.07
FWCI (Field Weighted Citation Impact)
57
Refs
0.90
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Weakly-Supervised Salient Object Detection Using Point Supervision

Shuyong GaoWei ZhangYan WangQianyu GuoChenglong ZhangYangji HeWenqiang Zhang

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2022 Vol: 36 (1)Pages: 670-678
JOURNAL ARTICLE

Weakly-Supervised Salient Object Detection on Light Fields

Zijian LiangPengjie WangKe XuPingping ZhangRynson W. H. Lau

Journal:   IEEE Transactions on Image Processing Year: 2022 Vol: 31 Pages: 6295-6305
JOURNAL ARTICLE

Weakly-Supervised Saliency Detection via Salient Object Subitizing

Xiaoyang ZhengXin TanJie ZhouLizhuang MaRynson W. H. Lau

Journal:   IEEE Transactions on Circuits and Systems for Video Technology Year: 2021 Vol: 31 (11)Pages: 4370-4380
© 2026 ScienceGate Book Chapters — All rights reserved.