DeepTrack: Learning Discriminative Feature Representations by Convolutional Neural Networks for Visual Tracking

Hanxi Li; Yi Li; Fatih Porikli

doi:10.5244/c.28.56

ScienceGate Book Chapters

JOURNAL ARTICLE

DeepTrack: Learning Discriminative Feature Representations by Convolutional Neural Networks for Visual Tracking

Hanxi Li Yi Li Fatih Porikli

Year: 2014 Pages: 56.1-56.12

DOI: 10.5244/c.28.56

Get Full-Text PDF Get Analytical Report

Abstract

Defining hand-crafted feature representations needs expert knowledge, requires timeconsuming manual adjustments, and besides, it is arguably one of the limiting factors of object tracking. In this paper, we propose a novel solution to automatically relearn the most useful feature representations during the tracking process in order to accurately adapt appearance changes, pose and scale variations while preventing from drift and tracking failures. We employ a candidate pool of multiple Convolutional Neural Networks (CNNs) as a data-driven model of different instances of the target object. Individually, each CNN maintains a specific set of kernels that favourably discriminate object patches from their surrounding background using all available low-level cues. These kernels are updated in an online manner at each frame after being trained with just one instance at the initialization of the corresponding CNN. Given a frame, the most promising CNNs in the pool are selected to evaluate the hypothesises for the target object. The hypothesis with the highest score is assigned as the current detection window and the selected models are retrained using a warm-start back-propagation which optimizes a structural loss function. In addition to the model-free tracker, we introduce a class-specific version of the proposed method that is tailored for tracking of a particular object class such as human faces. Our experiments on a large selection of videos from the recent benchmarks demonstrate that our method outperforms the existing state-of-the-art algorithms and rarely loses the track of the target object.

Keywords:

Computer science Artificial intelligence Discriminative model Convolutional neural network Initialization Feature (linguistics) Object (grammar) Pattern recognition (psychology) Video tracking Active appearance model Frame (networking) Object detection Set (abstract data type) Computer vision Class (philosophy) Tracking (education) Image (mathematics)

Metrics

143

Cited By

17.84

FWCI (Field Weighted Citation Impact)

Refs

0.99

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Face recognition and analysis

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Enhancement Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

DeepTrack: Learning Discriminative Feature Representations by Convolutional Neural Networks for Visual Tracking

Abstract

Metrics

Citation History

Topics

Related Documents

DeepTrack: Learning Discriminative Feature Representations Online for Robust Visual Tracking

Extended Siamese Convolutional Neural Networks for Discriminative Feature Learning

Discriminative Unsupervised Feature Learning with Exemplar Convolutional Neural Networks

Feature selection accelerated convolutional neural networks for visual tracking

Joint Supervision for Discriminative Feature Learning in Convolutional Neural Networks