JOURNAL ARTICLE

Deeper Siamese network with multi‐level feature fusion for real‐time visual tracking

Kang YangHuihui SongKaihua ZhangJiaqing Fan

Year: 2019 Journal:   Electronics Letters Vol: 55 (13)Pages: 742-745   Publisher: Institution of Engineering and Technology

Abstract

In recent years, using Siamese network (SiamN) for visual tracking has witnessed a great success in terms of accuracy and efficiency. Nevertheless, most SiamN‐based trackers employ shallow network such as AlexNet to extract the top‐layer features as target representation that are less discriminative, usually leading to tracking performance degeneration when suffering from large deformation and similar distractors. A straightforward idea to address this issue is to replace the backbone network of SiamN with deeper ResNet. However, this cannot boost performance much due to the low resolution of high‐level feature maps with useful spatial details losing. To address this issue, the authors propose a lightweight yet effective feature agglomeration module (FAM) to adaptively fuse low‐level and high‐level features for robust tracking. Specifically, they first develop a generalised non‐local attention module to enhance the discriminative capability of high‐level semantic features. Then, they design an inception‐like module to enhance the representative power of low‐level features with more spatial details. Both types of features are then adaptively fused in the FAM to complement their characteristics. Extensive evaluations on OTB‐2015 and VOT2017 challenge demonstrate that the proposed tracker consistently achieves favourable performance against several state‐of‐the‐art trackers and runs at 50 fps.

Keywords:
Discriminative model BitTorrent tracker Computer science Fuse (electrical) Artificial intelligence Feature (linguistics) Tracking (education) Computer vision Pattern recognition (psychology) Eye tracking Representation (politics) Engineering

Metrics

8
Cited By
0.86
FWCI (Field Weighted Citation Impact)
21
Refs
0.77
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Impact of Light on Environment and Health
Physical Sciences →  Environmental Science →  Global and Planetary Change
Image Enhancement Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Multi-feature fusion Siamese Network for Real-Time Object Tracking

Lijun ZhouHongyun LiJianlin Zhang

Journal:   Proceedings of the 2018 2nd International Conference on Computer Science and Artificial Intelligence Year: 2018 Pages: 478-481
JOURNAL ARTICLE

Siamese Network with Feature Fusion for Visual Tracking

Da LiYabing KangXing XiangWensheng TaoJiwei Hu

Journal:   2022 IEEE 6th Information Technology and Mechatronics Engineering Conference (ITOEC) Year: 2022 Pages: 1048-1052
JOURNAL ARTICLE

Deeper Siamese Network With Stronger Feature Representation for Visual Tracking

Chaoyi ZhangHoward WangJiwei WenLi Peng

Journal:   IEEE Access Year: 2020 Vol: 8 Pages: 119094-119104
JOURNAL ARTICLE

Multi-level prediction Siamese network for real-time UAV visual tracking

Mu ZhuHui ZhangJing ZhangLi Zhuo

Journal:   Image and Vision Computing Year: 2020 Vol: 103 Pages: 104002-104002
JOURNAL ARTICLE

Visual Tracking Method Based on Siamese Network with Multi-Feature Fusion

Qingdang LiRui XuMingyue ZhangZhen Sun

Journal:   Automatic Control and Computer Sciences Year: 2022 Vol: 56 (2)Pages: 150-159
© 2026 ScienceGate Book Chapters — All rights reserved.