JOURNAL ARTICLE

A Hierarchical Visual Model for Video Object Summarization

David LiuGang HuaTsuhan Chen

Year: 2010 Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Vol: 32 (12)Pages: 2178-2190   Publisher: IEEE Computer Society

Abstract

We propose a novel method for removing irrelevant frames from a video given user-provided frame-level labeling for a very small number of frames. We first hypothesize a number of windows which possibly contain the object of interest, and then determine which window(s) truly contain the object of interest. Our method enjoys several favorable properties. First, compared to approaches where a single descriptor is used to describe a whole frame, each window's feature descriptor has the chance of genuinely describing the object of interest; hence it is less affected by background clutter. Second, by considering the temporal continuity of a video instead of treating frames as independent, we can hypothesize the location of the windows more accurately. Third, by infusing prior knowledge into the patch-level model, we can precisely follow the trajectory of the object of interest. This allows us to largely reduce the number of windows and hence reduce the chance of overfitting the data during learning. We demonstrate the effectiveness of the method by comparing it to several other semi-supervised learning approaches on challenging video clips.

Keywords:
Computer science Artificial intelligence Overfitting Automatic summarization Object (grammar) Computer vision Window (computing) Frame (networking) Video tracking Feature (linguistics) Clutter Object detection Trajectory Pattern recognition (psychology) Artificial neural network Radar

Metrics

128
Cited By
5.44
FWCI (Field Weighted Citation Impact)
64
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.