DISSERTATION

Tracking non-rigid objects in video

Aeron Morgan BuchananBuchanan, A M

Year: 2008 University:   Oxford University Research Archive (ORA) (University of Oxford)   Publisher: University of Oxford

Abstract

Video is a sequence of 2D images of the 3D world generated by a camera. As the camera moves relative to the real scene and elements of that scene themselves move, correlated frame-to-frame changes in the video images are induced. Humans easily identify such changes as scene motion and can readily assess attempts to quantify it. For a machine, the identification of the 2D frame-to-frame motion is difficult. This problem is addressed by the computer vision process of tracking. Tracking underpins the solution to the problem of augmenting general video sequences with artificial imagery, a staple task in the visual effects industry. The problem is difficult because tracking in general video sequences is complicated by the presence of non-rigid motion, repeated texture and arbitrary occlusions. Existing methods provide solutions that rely on imposing limitations on the scenes that can be processed or that rely on human artistry and hard work. I introduce new paradigms, frameworks and algorithms for overcoming the challenges of processing general video and thus provide solutions that fill the gap between the `automated' and `manual' approaches. The work is easily sectioned into three parts, which can be considered separately or taken together for dealing with video without limitations. The initial focus is on directly addressing practical issues of human interaction in the tracking process: a new solution is developed by explicitly incorporating the user into an interactive algorithm. It is a novel tracking system based on fast full-frame patch searching and high-speed optimal track determination. This approach makes only minimal assumptions about motion and appearance, making it suitable for the widest variety of input video. I detail an implementation of the new system using k-d trees and dynamic programming. The second distinct contribution is an important extension to tracking algorithms in general. It can be noted that existing tracking algorithms occupy a spectrum in their use of global motion information. Local methods are easily confused by occlusions, repeated texture and image noise. Global motion models offer strong predictions to see through these difficulties and have been used in restricted circumstances, but are defeated by scenes containing independently moving objects or modest levels of non-rigid motion. I present a well principled way of combining local and global models to improve tracking, especially in these highly problematic cases. By viewing rank-constrained tracking as a probabilistic model of 2D tracks instead of 3D motion, I show how one can obtain a robust motion prior that can be easily incorporated in any existing tracking algorithm. The development of the global motion prior is based on rank-constrained factorization of measurement matrices. A common difficulty comes from the frequent occurrence of occlusions in video, which means that the relevant matrices are often not complete due to missing data. This defeats standard factorization algorithms. To fully explain and understand the algorithmic complexities of factorization in this practical context, I present a common notation for the direct comparison of existing algorithms and propose a new family of hybrid approaches that combine the superb initial performance of alternation methods with the convergence power of the Newton algorithm. Together, these investigations provide a wide-ranging, yet coherent exploration of tracking non-rigid objects in video.

Keywords:
Computer vision Computer science Artificial intelligence Video tracking Tracking (education) Process (computing) Frame (networking) Motion (physics) Task (project management) Focus (optics) Reference frame Video processing Engineering

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Computer Graphics and Visualization Techniques
Physical Sciences →  Computer Science →  Computer Graphics and Computer-Aided Design

Related Documents

JOURNAL ARTICLE

TRACKING NON-RIGID OBJECTS IN VIDEO SEQUENCES

Huiyu ZhouTangwei LiuJeffery ZhengFaquan LinYusheng PangJi Wu

Journal:   International Journal of Information Acquisition Year: 2006 Vol: 03 (02)Pages: 131-137
JOURNAL ARTICLE

Region tracking for non-rigid video objects in a non-parametric MAP framework

Chiou-Ting HsuMing-Shen Hsieh

Journal:   Signal Processing Image Communication Year: 2005 Vol: 21 (3)Pages: 235-251
JOURNAL ARTICLE

Hough-based tracking of non-rigid objects

Martin GodecPeter M. RothHorst Bischof

Journal:   Computer Vision and Image Understanding Year: 2012 Vol: 117 (10)Pages: 1245-1256
JOURNAL ARTICLE

Fast 3D tracking of non-rigid objects

Norimichi OkadaMartial Hebert

Year: 2004 Vol: 3 Pages: 3497-3503
© 2026 ScienceGate Book Chapters — All rights reserved.