JOURNAL ARTICLE

DeepTrack: Learning Discriminative Feature Representations by Convolutional Neural Networks for Visual Tracking

Abstract

Defining hand-crafted feature representations needs expert knowledge, requires timeconsuming manual adjustments, and besides, it is arguably one of the limiting factors of object tracking. In this paper, we propose a novel solution to automatically relearn the most useful feature representations during the tracking process in order to accurately adapt appearance changes, pose and scale variations while preventing from drift and tracking failures. We employ a candidate pool of multiple Convolutional Neural Networks (CNNs) as a data-driven model of different instances of the target object. Individually, each CNN maintains a specific set of kernels that favourably discriminate object patches from their surrounding background using all available low-level cues. These kernels are updated in an online manner at each frame after being trained with just one instance at the initialization of the corresponding CNN. Given a frame, the most promising CNNs in the pool are selected to evaluate the hypothesises for the target object. The hypothesis with the highest score is assigned as the current detection window and the selected models are retrained using a warm-start back-propagation which optimizes a structural loss function. In addition to the model-free tracker, we introduce a class-specific version of the proposed method that is tailored for tracking of a particular object class such as human faces. Our experiments on a large selection of videos from the recent benchmarks demonstrate that our method outperforms the existing state-of-the-art algorithms and rarely loses the track of the target object.

Keywords:
Computer science Artificial intelligence Discriminative model Convolutional neural network Initialization Feature (linguistics) Object (grammar) Pattern recognition (psychology) Video tracking Active appearance model Frame (networking) Object detection Set (abstract data type) Computer vision Class (philosophy) Tracking (education) Image (mathematics)

Metrics

143
Cited By
17.84
FWCI (Field Weighted Citation Impact)
21
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Face recognition and analysis
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Enhancement Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

DeepTrack: Learning Discriminative Feature Representations Online for Robust Visual Tracking

Hanxi LiYi LiFatih Porikli

Journal:   IEEE Transactions on Image Processing Year: 2015 Vol: 25 (4)Pages: 1834-1848
JOURNAL ARTICLE

Extended Siamese Convolutional Neural Networks for Discriminative Feature Learning

Sangyun LeeSungjun Hong

Journal:   International Journal of Fuzzy Logic and Intelligent Systems Year: 2022 Vol: 22 (4)Pages: 339-349
JOURNAL ARTICLE

Discriminative Unsupervised Feature Learning with Exemplar Convolutional Neural Networks

Alexey DosovitskiyPhilipp FischerJost Tobias SpringenbergMartin RiedmillerThomas Brox

Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Year: 2015 Vol: 38 (9)Pages: 1734-1747
JOURNAL ARTICLE

Feature selection accelerated convolutional neural networks for visual tracking

Zhiyan CuiNa Lü

Journal:   Applied Intelligence Year: 2021 Vol: 51 (11)Pages: 8230-8244
BOOK-CHAPTER

Joint Supervision for Discriminative Feature Learning in Convolutional Neural Networks

Jianyuan GuoYuhui YuanChao Zhang

Communications in computer and information science Year: 2017 Pages: 509-520
© 2026 ScienceGate Book Chapters — All rights reserved.