Video Salient Object Detection Using Multi-Scale Self-Attention

Jiahao Liu; Haoyuan Liu; Hiroshi Watanabe

doi:10.1109/gcce59613.2023.10315584

ScienceGate Book Chapters

JOURNAL ARTICLE

Video Salient Object Detection Using Multi-Scale Self-Attention

Jiahao Liu Haoyuan Liu Hiroshi Watanabe

Year: 2023 Pages: 368-371

DOI: 10.1109/gcce59613.2023.10315584

Get Full-Text PDF Get Analytical Report

Abstract

How to effectively model both spatial information and temporal dynamics is crucial to Video Salient Object Detection (VSOD). Recently, there are some works using self-attention mechanism to capture the spatiotemporal information due to its ability of modeling long-range dependencies of patch tokens. However, these models designate similar receptive fields of the spatiotemporal feature maps, which limits the ability of the models in handling the frames with multiple salient objects of different scales. To address this issue, we propose a Multi-Scale Self-Attention (MSSA) operation to better model the spatiotemporal features of salient objects with different scales. The experimental results demonstrate that our method achieves better performance on challenge datasets by using MSSA operation.

Keywords:

Salient Computer science Artificial intelligence Feature (linguistics) Scale (ratio) Object detection Object (grammar) Pattern recognition (psychology) Computer vision Geography

Metrics

Cited By

0.18

FWCI (Field Weighted Citation Impact)

Refs

0.45

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Video Salient Object Detection Using Multi-Scale Self-Attention

Abstract

Metrics

Citation History

Topics

Related Documents

Salient object detection via multi-scale attention CNN

Salient Object Detection Using Multi-Scale Features with Attention Recurrent Mechanism

Attention to the Scale: Deep Multi-Scale Salient Object Detection

Video salient object detection using dual-stream spatiotemporal attention

Pyramid Constrained Self-Attention Network for Fast Video Salient Object Detection