OCVOS: Object-Centric Representation for Video Object Segmentation

Junho Jo; Dongyoon Wee; Nam Ik Cho

doi:10.1109/icip49359.2023.10222708

ScienceGate Book Chapters

JOURNAL ARTICLE

OCVOS: Object-Centric Representation for Video Object Segmentation

Junho Jo Dongyoon Wee Nam Ik Cho

Year: 2023 Pages: 1655-1659

DOI: 10.1109/icip49359.2023.10222708

Get Full-Text PDF Get Analytical Report

Abstract

Semi-supervised video object segmentation (VOS) methods aim to segment target objects with the help of pixel-level annotations in the first frame. Many methods employ Transformer-based attention modules to propagate the given annotations in the first frame to the most similar patch or pixel in the following frames. Although they have shown impressive results, they can still be prone to errors in challenging scenes with multiple overlapping objects. To tackle this problem, we propose an object-centric VOS (OCVOS) method that exploits query-based Transformer decoder blocks. After aggregating target object information with typical matching-based approaches, the Transformer networks extract object-wise information by interacting with object queries. In this way, the proposed method considers not only global and contextual information but also object-centric representations. We validate its effectiveness in inducing object-wise information compared to existing methods on the DAVIS and YouTube-VOS benchmarks.

Keywords:

Computer science Artificial intelligence Computer vision Segmentation Object (grammar) Transformer Pixel Exploit Representation (politics) Frame (networking)

Metrics

Cited By

0.18

FWCI (Field Weighted Citation Impact)

Refs

0.42

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

OCVOS: Object-Centric Representation for Video Object Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation

Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric Representation

Weakly Supervised Referring Video Object Segmentation With Object-Centric Pseudo-Guidance

InstMove: Instance Motion for Object-centric Video Segmentation

Object-Centric Representation Learning for Video Scene Understanding