Spatio-Temporal Attention Network for Video Instance Segmentation

Xiaoyu Liu; Haibing Ren; Tingmeng Ye

doi:10.1109/iccvw.2019.00092

ScienceGate Book Chapters

JOURNAL ARTICLE

Spatio-Temporal Attention Network for Video Instance Segmentation

Xiaoyu Liu Haibing Ren Tingmeng Ye

Year: 2019 Pages: 725-727

DOI: 10.1109/iccvw.2019.00092

Get Full-Text PDF Get Analytical Report

Abstract

In this paper, we propose a method named spatio-temporal attention network for video instance segmentation. The spatio-temporal attention network can estimate the global correlation map between the successive frames and transfers it to the attention map. Added with the attention information, the new features may enhance the response of the instance for pre-defined categories. Therefore, the detection, segmentation and tracking accuracy will be greatly improved. Experimental result shows that combined with MaskTrack R-CNN, it may improve the video instance segmentation accuracy from 0.293 to 0.400@Youtube VIS test dataset with a single model. Our method took the 6th place in the video instance segmentation track of the 2nd Large-scale Video Object Segmentation Challenge.

Keywords:

Segmentation Computer science Artificial intelligence Computer vision Video tracking Object (grammar) Image segmentation Scale-space segmentation Segmentation-based object categorization Pattern recognition (psychology)

Metrics

Cited By

0.86

FWCI (Field Weighted Citation Impact)

Refs

0.78

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Spatio-Temporal Attention Network for Video Instance Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

Deformable VisTR: Spatio Temporal Deformable Attention for Video Instance Segmentation

Video Instance Segmentation via Multi-Scale Spatio-Temporal Split Attention Transformer

Spatio-Temporal Convolution-Attention Video Network

STC: Spatio-Temporal Contrastive Learning for Video Instance Segmentation

IAST: Instance Association Relying on Spatio-Temporal Features for Video Instance Segmentation