JOURNAL ARTICLE

Multi-class Semantic Video Segmentation with Exemplar-Based Object Reasoning

Abstract

We tackle the problem of semantic segmentation of dynamic scene in video sequences. We propose to incorporate foreground object information into pixel labeling by jointly reasoning semantic labels of super-voxels, object instance tracks and geometric relations between objects. We take an exemplar approach to object modeling by using a small set of object annotations and exploring the temporal consistency of object motion. After generating a set of moving object hypotheses, we design a CRF framework that jointly models the super voxel and object instances. The optimal semantic labeling is inferred by the MAP estimation of the model, which is solved by a single move-making based optimization procedure. We demonstrate the effectiveness of our method on three public datasets and show that our model can achieve superior or comparable results than the state of-the-art with less object-level supervision

Keywords:
Computer science Object (grammar) Artificial intelligence Segmentation Class (philosophy) Consistency (knowledge bases) Voxel Set (abstract data type) Computer vision Object model Method Pattern recognition (psychology) Natural language processing Object-oriented programming

Metrics

15
Cited By
2.50
FWCI (Field Weighted Citation Impact)
40
Refs
0.92
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Segmentation-based multi-class semantic object detection

Rémi VieuxJenny Benois‐PineauJean‐Philippe DomengerAchille Braquelaire

Journal:   Multimedia Tools and Applications Year: 2010 Vol: 60 (2)Pages: 305-326
JOURNAL ARTICLE

Multi-Level Representation Learning with Semantic Alignment for Referring Video Object Segmentation

Dongming WuXingping DongLing ShaoJianbing Shen

Journal:   2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Year: 2022
JOURNAL ARTICLE

Semantic-Assisted Object Clustering for Multi-Modal Referring Video Segmentation

Yong LiuZhuoyan LuoYicheng XiaoYitong WangShuyan LiXiu LiYujiu YangYansong Tang

Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Year: 2025 Vol: 48 (1)Pages: 572-590
© 2026 ScienceGate Book Chapters — All rights reserved.