JOURNAL ARTICLE

Multi-Stream Temporally Enhanced Network for Video Salient Object Detection

Dan XuJiale RuJinlong Shi

Year: 2023 Journal:   Computers, materials & continua/Computers, materials & continua (Print) Vol: 78 (1)Pages: 85-104

Abstract

Video salient object detection (VSOD) aims at locating the most attractive objects in a video by exploring the spatial and temporal features.VSOD poses a challenging task in computer vision, as it involves processing complex spatial data that is also influenced by temporal dynamics.Despite the progress made in existing VSOD models, they still struggle in scenes of great background diversity within and between frames.Additionally, they encounter difficulties related to accumulated noise and high time consumption during the extraction of temporal features over a long-term duration.We propose a multi-stream temporal enhanced network (MSTENet) to address these problems.It investigates saliency cues collaboration in the spatial domain with a multi-stream structure to deal with the great background diversity challenge.A straightforward, yet efficient approach for temporal feature extraction is developed to avoid the accumulative noises and reduce time consumption.The distinction between MSTENet and other VSOD methods stems from its incorporation of both foreground supervision and background supervision, facilitating enhanced extraction of collaborative saliency cues.Another notable differentiation is the innovative integration of spatial and temporal features, wherein the temporal module is integrated into the multi-stream structure, enabling comprehensive spatial-temporal interactions within an end-to-end framework.Extensive experimental results demonstrate that the proposed method achieves state-of-the-art performance on five benchmark datasets while maintaining a real-time speed of 27 fps (Titan XP).Our code and models are available at https://github.com/RuJiaLe/MSTENet.

Keywords:
Computer science Salient Benchmark (surveying) Artificial intelligence Feature extraction Object detection Computer vision Pattern recognition (psychology) Geography

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
59
Refs
0.18
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Olfactory and Sensory Function Studies
Life Sciences →  Neuroscience →  Sensory Systems

Related Documents

JOURNAL ARTICLE

Multi-Stream Attention-Aware Graph Convolution Network for Video Salient Object Detection

Mingzhu XuPing FuBing LiuJun-Bao Li

Journal:   IEEE Transactions on Image Processing Year: 2021 Vol: 30 Pages: 4183-4197
JOURNAL ARTICLE

Lightweight video salient object detection via channel-shuffle enhanced multi-modal fusion network

Kan HuangZhijing Xu

Journal:   Multimedia Tools and Applications Year: 2023 Vol: 83 (1)Pages: 1025-1039
BOOK-CHAPTER

Multi-stream CNN for Salient Object Detection

Mudassir RafiS. SaikeerthanA. SahithiSushama Rani Dutta

Communications in computer and information science Year: 2025 Pages: 28-36
JOURNAL ARTICLE

Video salient object detection using dual-stream spatiotemporal attention

Chenchu XuZhifan GaoHeye ZhangShuo LiVictor Hugo C. de Albuquerque

Journal:   Applied Soft Computing Year: 2021 Vol: 108 Pages: 107433-107433
© 2026 ScienceGate Book Chapters — All rights reserved.