Object Tracking Algorithm Based on Multi-Layer Feature Fusion and Semantic Enhancement

Jing Wang; Yanru Wang; Dan Yuan; Ying Que; Weichao Huang; Wei Yuan

doi:10.3390/app15137228

ScienceGate Book Chapters

JOURNAL ARTICLE

Object Tracking Algorithm Based on Multi-Layer Feature Fusion and Semantic Enhancement

Jing Wang Yanru Wang Dan Yuan Ying Que Weichao Huang Wei Yuan

Year: 2025 Journal: Applied Sciences Vol: 15 (13)Pages: 7228-7228 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/app15137228

Get Full-Text PDF Get Analytical Report

Abstract

The TransT object tracking algorithm, built on Transformer architecture, effectively integrates deep feature extraction with attention mechanisms, thereby enhancing the stability and accuracy of the algorithm. However, this algorithm exhibits insufficient tracking accuracy and boundary box drift when dealing with similar background clutter, which directly affects the subsequent tracking process. To overcome this problem, this paper constructs a semantic enhancement model, which utilizes multi-layer feature representations extracted from deep networks, and correlates and fuses shallow features with deep features by using cross-attention. At the same time, in order to adapt to the changes in the surrounding environment of the object and establish good discrimination with similar objects, this paper proposes a dynamic mask strategy to optimize the attention allocation mechanism and finally employs an object template update mechanism to improve the adaptability of the model by comparing the spatio-temporal information of successive frames to update the object template in time, further enhancing its tracking performance in complex scenes. Experimental comparison results demonstrate that the algorithm proposed in this paper can effectively handle similar background clutter, leading to a significant improvement in the overall performance of the tracking model.

Keywords:

Computer science Artificial intelligence Feature (linguistics) Object (grammar) Computer vision Layer (electronics) Fusion Pattern recognition (psychology) Materials science Linguistics

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.20

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Fire Detection and Safety Systems

Physical Sciences → Engineering → Safety, Risk, Reliability and Quality

Remote Sensing and Land Use

Physical Sciences → Earth and Planetary Sciences → Atmospheric Science

Object Tracking Algorithm Based on Multi-Layer Feature Fusion and Semantic Enhancement

Abstract

Metrics

Topics

Related Documents

Multi-Object Tracking Algorithm Based on Multi-layer Feature Adaptive Fusion

Pedestrian Multi-object Tracking Algorithm Based on Attention Feature Fusion

DETrack: Multi-Object Tracking Algorithm Based on Feature Decomposition and Feature Enhancement

A Feature Fusion Based Object Tracking Algorithm

JDE multi-object tracking algorithm integrating multi-level semantic enhancement