JOURNAL ARTICLE

Semi-supervised Video Object Segmentation Via an Edge Attention Gated Graph Convolutional Network

Yuqing ZhangYong ZhangShaofan WangYun LiangBaocai Yin

Year: 2023 Journal:   ACM Transactions on Multimedia Computing Communications and Applications Vol: 20 (1)Pages: 1-23   Publisher: Association for Computing Machinery

Abstract

Video object segmentation (VOS) exhibits heavy occlusions, large deformation, and severe motion blur. While many remarkable convolutional neural networks are devoted to the VOS task, they often mis-identify background noise as the target or output coarse object boundaries, due to the failure of mining detail information and high-order correlations of pixels within the whole video. In this work, we propose an edge attention gated graph convolutional network (GCN) for VOS. The seed point initialization and graph construction stages construct a spatio-temporal graph of the video by exploring the spatial intra-frame correlation and the temporal inter-frame correlation of superpixels. The node classification stage identifies foreground superpixels by using an edge attention gated GCN which mines higher-order correlations between superpixels and propagates features among different nodes. The segmentation optimization stage optimizes the classification of foreground superpixels and reduces segmentation errors by using a global appearance model which captures the long-term stable feature of objects. In summary, the key contribution of our framework is twofold: (a) the spatio-temporal graph representation can propagate the seed points of the first frame to subsequent frames and facilitate our framework for the semi-supervised VOS task; and (b) the edge attention gated GCN can learn the importance of each node with respect to both the neighboring nodes and the whole task with a small number of layers. Experiments on Davis 2016 and Davis 2017 datasets show that our framework achieves the excellent performance with only small training samples (45 video sequences).

Keywords:
Computer science Artificial intelligence Segmentation Initialization Pattern recognition (psychology) Convolutional neural network Graph Frame (networking) Computer vision Theoretical computer science

Metrics

2
Cited By
0.36
FWCI (Field Weighted Citation Impact)
45
Refs
0.53
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Spatial-temporal video object segmentation with graph convolutional network and attention mechanism

Rui YaoShixiong XiaZhou YongJiaqi ZhaoFuyuan Hu

Journal:   Journal of Image and Graphics Year: 2021 Vol: 26 (10)Pages: 2376-2387
JOURNAL ARTICLE

Semi-supervised Video Object Segmentation based on External Memory Attention

Jiyun KimSungeun Hong

Journal:   Journal of Broadcast Engineering Year: 2023 Vol: 28 (5)Pages: 613-622
BOOK-CHAPTER

Mask-Ranking Network for Semi-supervised Video Object Segmentation

Wen J. LiXiang ZhangYujie HuYingqi Tang

Lecture notes in computer science Year: 2021 Pages: 620-636
© 2026 ScienceGate Book Chapters — All rights reserved.