Ran DuanChanghong FuKostas AlexisErdal Kayacan
In this paper, we develop an online learning-based visual tracking framework that can optimize the target model and estimate the scale variation for object tracking. We propose a recommender-based tracker, which is capable of selecting the representative convolutional neural network (CNN) layers and feature maps autonomously. In addition, the proposed recommender computes the weights of these layers and feature maps. A discriminative target percept of each recommended layer is reconstructed by the weighted sum of the recommended feature maps. Then the target model of the correlation filter is updated by the weighted sum of the target percepts. Thus, a sub-network is extracted from the pre-trained CNN backbone for the tracking process of a specific target. To deal with scale changes, we propose a spatiotemporal-based min-channel method to estimate the target size variation directly from CNN features. Experimental results on 50 benchmark datasets and video data from a rescue drone demonstrate that the proposed tracker is quite competitive with the state-of-the-art CNN-based trackers in terms of accuracy, scale adaptation, and robustness for UAV-related applications.
Xin WangZhiqiang HouWangsheng YuZefenfen JinYufei ZhaXianxiang Qin
Qiang GuoXuefei CaoQinglong Zou
Martin DanelljanG HagerFahad Shahbaz KhanMichael Felsberg
蔡玉柱 CAI Yu-zhu杨德东 Yang Dedong毛宁 Mao Ning杨福才 YANG Fu-cai
Wenjing KangXinyou LiGongliang LiuShaobo Wang