JOURNAL ARTICLE

Video Salient Object Detection Using Multi-Scale Self-Attention

Abstract

How to effectively model both spatial information and temporal dynamics is crucial to Video Salient Object Detection (VSOD). Recently, there are some works using self-attention mechanism to capture the spatiotemporal information due to its ability of modeling long-range dependencies of patch tokens. However, these models designate similar receptive fields of the spatiotemporal feature maps, which limits the ability of the models in handling the frames with multiple salient objects of different scales. To address this issue, we propose a Multi-Scale Self-Attention (MSSA) operation to better model the spatiotemporal features of salient objects with different scales. The experimental results demonstrate that our method achieves better performance on challenge datasets by using MSSA operation.

Keywords:
Salient Computer science Artificial intelligence Feature (linguistics) Scale (ratio) Object detection Object (grammar) Pattern recognition (psychology) Computer vision Geography

Metrics

1
Cited By
0.18
FWCI (Field Weighted Citation Impact)
7
Refs
0.45
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Salient object detection via multi-scale attention CNN

Yuzhu JiHaijun ZhangQ. M. Jonathan Wu

Journal:   Neurocomputing Year: 2018 Vol: 322 Pages: 130-140
JOURNAL ARTICLE

Salient Object Detection Using Multi-Scale Features with Attention Recurrent Mechanism

Shanmei LuQiang GuoRen WangCaiming Zhang

Journal:   Journal of Computer-Aided Design & Computer Graphics Year: 2020 Vol: 32 (12)Pages: 1926-1937
JOURNAL ARTICLE

Video salient object detection using dual-stream spatiotemporal attention

Chenchu XuZhifan GaoHeye ZhangShuo LiVictor Hugo C. de Albuquerque

Journal:   Applied Soft Computing Year: 2021 Vol: 108 Pages: 107433-107433
JOURNAL ARTICLE

Pyramid Constrained Self-Attention Network for Fast Video Salient Object Detection

Yuchao GuLijuan WangZiqin WangYun LiuMing‐Ming ChengShao-Ping Lu

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2020 Vol: 34 (07)Pages: 10869-10876
© 2026 ScienceGate Book Chapters — All rights reserved.