JOURNAL ARTICLE

SMSTracker: A Self-Calibration Multi-Head Self-Attention Transformer for Visual Object Tracking

Zhongyang WangHu ZhuFeng Liu

Year: 2024 Journal:   Computers, materials & continua/Computers, materials & continua (Print) Vol: 80 (1)Pages: 605-623

Abstract

Visual object tracking plays a crucial role in computer vision. In recent years, researchers have proposed various methods to achieve high-performance object tracking. Among these, methods based on Transformers have become a research hotspot due to their ability to globally model and contextualize information. However, current Transformer-based object tracking methods still face challenges such as low tracking accuracy and the presence of redundant feature information. In this paper, we introduce self-calibration multi-head self-attention Transformer (SMSTracker) as a solution to these challenges. It employs a hybrid tensor decomposition self-organizing multi-head self-attention transformer mechanism, which not only compresses and accelerates Transformer operations but also significantly reduces redundant data, thereby enhancing the accuracy and efficiency of tracking. Additionally, we introduce a self-calibration attention fusion block to resolve common issues of attention ambiguities and inconsistencies found in traditional tracking methods, ensuring the stability and reliability of tracking performance across various scenarios. By integrating a hybrid tensor decomposition approach with a self-organizing multi-head self-attentive transformer mechanism, SMSTracker enhances the efficiency and accuracy of the tracking process. Experimental results show that SMSTracker achieves competitive performance in visual object tracking, promising more robust and efficient tracking systems, demonstrating its potential to provide more robust and efficient tracking solutions in real-world applications.

Keywords:
Computer science Transformer Computer vision Eye tracking Artificial intelligence Engineering Electrical engineering Voltage

Metrics

2
Cited By
1.04
FWCI (Field Weighted Citation Impact)
83
Refs
0.67
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Brain Tumor Detection and Classification
Life Sciences →  Neuroscience →  Neurology
CCD and CMOS Imaging Sensors
Physical Sciences →  Engineering →  Electrical and Electronic Engineering
Infrared Target Detection Methodologies
Physical Sciences →  Engineering →  Aerospace Engineering

Related Documents

JOURNAL ARTICLE

Robust Visual Tracking Using Hierarchical Vision Transformer with Shifted Windows Multi-Head Self-Attention

Peng GaoXin-Yue ZHANGXiaoli YangJiancheng NiFei Wang

Journal:   IEICE Transactions on Information and Systems Year: 2023 Vol: E107.D (1)Pages: 161-164
JOURNAL ARTICLE

Multi-Head-Self-Attention based YOLOv5X-transformer for multi-scale object detection

Ponduri VasanthiLaavanya Mohan

Journal:   Multimedia Tools and Applications Year: 2023 Vol: 83 (12)Pages: 36491-36517
BOOK-CHAPTER

LMSA: Low-Relation Multi-head Self-attention Mechanism in Visual Transformer

Jingjie WangXiang WeiXiaoyu LiuSiyang Lu

Lecture notes in electrical engineering Year: 2023 Pages: 609-619
JOURNAL ARTICLE

Local-Global Self-Attention for Transformer-Based Object Tracking

Langkun ChenLong GaoaYan JiangYunsong LiGang HeaJifeng Ningc

Journal:   IEEE Transactions on Circuits and Systems for Video Technology Year: 2024 Vol: 34 (12)Pages: 12316-12329
© 2026 ScienceGate Book Chapters — All rights reserved.