JOURNAL ARTICLE

Multi-Scale Spatiotemporal Conv-LSTM Network for Video Saliency Detection

Abstract

Recently, deep neural networks have been crucial techniques for image salient detection. However, two difficulties prevent the development of deep learning in video saliency detection. The first one is that the traditional static network cannot conduct a robust motion estimation in videos. The other is that the data-driven deep learning is in lack of sufficient manually annotated pixel-wise ground truths for video saliency network training. In this paper, we propose a multi-scale spatiotemporal convolutional LSTM network (MSST-ConvLSTM) to incorporate spatial and temporal cues for video salient objects detection. Furthermore, as manually pixel-wised labeling is very time-consuming, we sign lots of coarse labels, which are mixed with fine labels to train a robust saliency prediction model. Experiments on the widely used challenging benchmark datasets (e.g., FBMS and DAVIS) demonstrate that the proposed approach has competitive performance of video saliency detection compared with the state-of-the-art saliency models.

Keywords:
Computer science Artificial intelligence Benchmark (surveying) Convolutional neural network Pixel Deep learning Pattern recognition (psychology) Salient Kadir–Brady saliency detector Scale (ratio) Computer vision Image (mathematics) Object detection

Metrics

13
Cited By
1.44
FWCI (Field Weighted Citation Impact)
43
Refs
0.82
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image and Video Quality Assessment
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Multi-Scale Spatiotemporal Feature Fusion Network for Video Saliency Prediction

Yunzuo ZhangTian ZhangCunyu WuRan Tao

Journal:   IEEE Transactions on Multimedia Year: 2023 Vol: 26 Pages: 4183-4193
JOURNAL ARTICLE

Video Saliency Detection Using Spatiotemporal Cues

Yu ChenJing XiaoLiuyi HuDan ChenZhongyuan WangDengshi Li

Journal:   IEICE Transactions on Information and Systems Year: 2018 Vol: E101.D (9)Pages: 2201-2208
© 2026 ScienceGate Book Chapters — All rights reserved.