Multi-Scale Spatiotemporal Conv-LSTM Network for Video Saliency Detection

Yi Tang; Wenbin Zou; Zhi Jin; Xia Li

doi:10.1145/3206025.3206052

ScienceGate Book Chapters

JOURNAL ARTICLE

Multi-Scale Spatiotemporal Conv-LSTM Network for Video Saliency Detection

Yi Tang Wenbin Zou Zhi Jin Xia Li

Year: 2018 Pages: 362-369

DOI: 10.1145/3206025.3206052

Get Full-Text PDF Get Analytical Report

Abstract

Recently, deep neural networks have been crucial techniques for image salient detection. However, two difficulties prevent the development of deep learning in video saliency detection. The first one is that the traditional static network cannot conduct a robust motion estimation in videos. The other is that the data-driven deep learning is in lack of sufficient manually annotated pixel-wise ground truths for video saliency network training. In this paper, we propose a multi-scale spatiotemporal convolutional LSTM network (MSST-ConvLSTM) to incorporate spatial and temporal cues for video salient objects detection. Furthermore, as manually pixel-wised labeling is very time-consuming, we sign lots of coarse labels, which are mixed with fine labels to train a robust saliency prediction model. Experiments on the widely used challenging benchmark datasets (e.g., FBMS and DAVIS) demonstrate that the proposed approach has competitive performance of video saliency detection compared with the state-of-the-art saliency models.

Keywords:

Computer science Artificial intelligence Benchmark (surveying) Convolutional neural network Pixel Deep learning Pattern recognition (psychology) Salient Kadir–Brady saliency detector Scale (ratio) Computer vision Image (mathematics) Object detection

Metrics

Cited By

1.44

FWCI (Field Weighted Citation Impact)

Refs

0.82

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image and Video Quality Assessment

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multi-Scale Spatiotemporal Conv-LSTM Network for Video Saliency Detection

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-Scale Spatiotemporal Feature Fusion Network for Video Saliency Prediction

Triplet Spatiotemporal Aggregation Network for Video Saliency Detection

Video saliency detection using multi-level spatiotemporal orientation

STI-Net: Spatiotemporal integration network for video saliency detection

Video Saliency Detection Using Spatiotemporal Cues