JOURNAL ARTICLE

Siamese Network Tracker Based on Multi-Scale Feature Fusion

Jiaxu ZhaoDapeng Niu

Year: 2023 Journal:   Systems Vol: 11 (8)Pages: 434-434   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

The main task in visual object tracking is to track a moving object in an image sequence. In this process, the object’s trajectory and behavior can be described by calculating the object’s position, velocity, acceleration, and other parameters or by memorizing the position of the object in each frame of the corresponding video. Therefore, visual object tracking can complete many more advanced tasks, has great performance in relation to real scenes, and is widely used in automated driving, traffic monitoring, human–computer interaction, and so on. Siamese-network-based trackers have been receiving a great deal of attention from the tracking community, but they have many drawbacks. This paper analyzes the shortcomings of the Siamese network tracker in detail, uses the method of feature multi-scale fusion to improve the Siamese network tracker, and proposes a new target-tracking framework to address its shortcomings. In this paper, a feature map with low-resolution but strong semantic information and a feature map with high-resolution and rich spatial information are integrated to improve the model’s ability to depict an object, and the problem of scale change is solved by fusing features at different scales. Furthermore, we utilize the 3D Max Filtering module to suppress repeated predictions of features at different scales. Finally, our experiments conducted on the four tracking benchmarks OTB2015, VOT2016, VOT2018, and GOT10K show that the proposed algorithm effectively improves the tracking accuracy and robustness of the system.

Keywords:
BitTorrent tracker Artificial intelligence Computer vision Computer science Video tracking Robustness (evolution) Feature (linguistics) Frame (networking) Tracking (education) Process (computing) Object (grammar) Eye tracking

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
48
Refs
0.10
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
AI and Multimedia in Education
Physical Sciences →  Computer Science →  Artificial Intelligence
Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

SiamMBFAN: Siamese tracker with multi-branch feature aggregation network

Hao ZhangYan PiaoBailiang HuangBaolin Tan

Journal:   Journal of Visual Communication and Image Representation Year: 2022 Vol: 89 Pages: 103671-103671
JOURNAL ARTICLE

Visual Tracking Method Based on Siamese Network with Multi-Feature Fusion

Qingdang LiRui XuMingyue ZhangZhen Sun

Journal:   Automatic Control and Computer Sciences Year: 2022 Vol: 56 (2)Pages: 150-159
JOURNAL ARTICLE

A strong feature representation for siamese network tracker

Zhipeng ZhouRui ZhangDong Yin

Journal:   Multimedia Tools and Applications Year: 2020 Vol: 79 (35-36)Pages: 25873-25887
© 2026 ScienceGate Book Chapters — All rights reserved.