JOURNAL ARTICLE

Multi-Scale Spatiotemporal Feature Fusion Network for Video Saliency Prediction

Yunzuo ZhangTian ZhangCunyu WuRan Tao

Year: 2023 Journal:   IEEE Transactions on Multimedia Vol: 26 Pages: 4183-4193   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Recently, video saliency prediction has attracted increasing attention, yet the improvement of its accuracy is still subject to the insufficient use of multi-scale spatiotemporal features. To address this issue, we propose a 3D convolutional Multi-scale Spatiotemporal Feature Fusion Network (MSFFNet) to achieve the full utilization of spatiotemporal features. Specifically, we propose a Bi-directional Temporal-Spatial Feature Pyramid (BiTSFP), the first application of bi-directional fusion architectures in this field, which adds the flow of shallow location information on the basis of the previous flow of deep semantic information. Then, different from simple addition and concatenation, we design an Attention-Guided Fusion (AGF) mechanism that can adaptively learn the fusion weights of adjacent features to integrate them appropriately. Moreover, a Framewise Attention (FA) module is introduced to selectively emphasize the useful frames, augmenting the multi-scale temporal features to be fused. Our model is simple but effective, and it can run in real-time. Experimental results on the DHF1K, Hollywood-2, and UCF-sports datasets demonstrate that the proposed MSFF-Net outperforms existing state-of-the-art methods in accuracy.

Keywords:
Computer science Concatenation (mathematics) Artificial intelligence Feature (linguistics) Pattern recognition (psychology) Fusion mechanism Convolutional neural network Pyramid (geometry) Scale (ratio) Feature extraction Fusion Computer vision

Metrics

96
Cited By
17.47
FWCI (Field Weighted Citation Impact)
64
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image and Video Quality Assessment
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.