Video Object Extraction Based on Spatiotemporal Consistency Saliency Detection

Yingchun Guo; Zhuo Li; Yi Liu; Gang Yan; Ming Yu

doi:10.1109/access.2018.2841062

ScienceGate Book Chapters

JOURNAL ARTICLE

Video Object Extraction Based on Spatiotemporal Consistency Saliency Detection

Yingchun Guo Zhuo Li Yi Liu Gang Yan Ming Yu

Year: 2018 Journal: IEEE Access Vol: 6 Pages: 35171-35181 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/access.2018.2841062

Get Full-Text PDF Get Analytical Report

Abstract

Video object extraction (VOE) refers to the challenging task of separating foreground objects automatically from a background. Aiming to resolve the problems of incomplete extraction of foreground objects and the background interference of irrelevant small moving objects, this paper proposes a new method of VOE based on spatiotemporal consistency saliency detection. The main innovation in this proposed method is composed of three parts: first, the spatiotemporal gradient field (SGF) is constructed by mutual consistency between the static gradient feature of intra-frame and the dynamic gradient feature inter-frame, and a coarse motion saliency map is obtained by minimizing relative gradients on the SGF; second, temporal consistency is proposed based on the adjacent frame similarity to fuse the adjacent dynamic saliency maps and get the fine motion saliency map; third, based on spatiotemporal consistency, salient objects are extracted by the fusion of the static saliency map and the motion saliency map adaptively. Experiments on the ViSal and SegtrackV2 public video saliency data sets show that, compared with the state-of-the-art image saliency methods and video sequence saliency object detection methods, the proposed algorithm can extract the salient object in the video sequence quickly, clearly, and accurately. It can be seen that the average F-score is close to 0.8, and the average mean absolute error (MAE) is about 0.06 on the ViSal data set, and on SegtrackV2, the average F-score is close to 0.7, and the MAE value is below 0.05, which indicates that the result of this algorithm is closer to the ground truth.

Keywords:

Artificial intelligence Computer science Computer vision Consistency (knowledge bases) Pattern recognition (psychology) Feature extraction Kadir–Brady saliency detector Object detection Feature (linguistics) Frame (networking) Similarity (geometry) Object (grammar) Image (mathematics)

Metrics

Cited By

0.87

FWCI (Field Weighted Citation Impact)

Refs

0.74

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Video Object Extraction Based on Spatiotemporal Consistency Saliency Detection

Abstract

Metrics

Citation History

Topics

Related Documents

Moving object detection in aerial video based on spatiotemporal saliency

A video saliency detection framework using spatiotemporal consistency optimization

Spatiotemporal Saliency Detection Based on Maximum Consistency Superpixels Merging for Video Analysis

Spatiotemporal Saliency Detection based Video Quality Assessment

Video saliency detection based on spatiotemporal feature learning