JOURNAL ARTICLE

Heterogeneous Graph Transformer for Multiple Tiny Object Tracking in RGB-T Videos

Qingyu XuLongguang WangWeidong ShengYingqian WangChao XiaoChao MaWei An

Year: 2024 Journal:   IEEE Transactions on Multimedia Vol: 26 Pages: 9383-9397   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Tracking multiple tiny objects is highly challenging due to their weak appearance and limited features. Existing multi-object tracking algorithms generally focus on singlemodality scenes, and overlook the complementary characteristics of tiny objects captured by multiple remote sensors. To enhance tracking performance by integrating complementary information from multiple sources, we propose a novel framework called HGT-Track (Heterogeneous Graph Transformer based Multi-Tiny-Object Tracking). Specifically, we first employ a Transformer-based encoder to embed images from different modalities. Subsequently, we utilize Heterogeneous Graph Transformer to aggregate spatial and temporal information from multiple modalities to generate detection and tracking features. Additionally, we introduce a target re-detection module (ReDet) to ensure tracklet continuity by maintaining consistency across different modalities. Furthermore, this paper introduces the first benchmark VT-Tiny-MOT (Visible-Thermal Tiny MultiObject Tracking) for RGB-T fused multiple tiny object tracking. Extensive experiments are conducted on VT-Tiny-MOT, and the results have demonstrated the effectiveness of our method. Compared to other state-of-the-art methods, our method achieves better performance in terms of MOTA (Multiple-Object Tracking Accuracy) and ID-F1 score. The code and dataset will be made available at https://github.com/xuqingyu26/HGTMT

Keywords:
Computer science Computer vision Artificial intelligence Video tracking Graph RGB color model Transformer Object (grammar) Theoretical computer science

Metrics

21
Cited By
75.10
FWCI (Field Weighted Citation Impact)
73
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Computing and Algorithms
Social Sciences →  Social Sciences →  Urban Studies
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
CCD and CMOS Imaging Sensors
Physical Sciences →  Engineering →  Electrical and Electronic Engineering

Related Documents

BOOK-CHAPTER

Multiple-criteria-Based Object Pose Tracking in RGB Videos

Mateusz MajcherBogdan Kwolek

Lecture notes in computer science Year: 2022 Pages: 477-490
JOURNAL ARTICLE

MOLTR: Multiple Object Localization, Tracking and Reconstruction From Monocular RGB Videos

Kejie LiHamid RezatofighiIan Reid

Journal:   IEEE Robotics and Automation Letters Year: 2021 Vol: 6 (2)Pages: 3341-3348
JOURNAL ARTICLE

Heterogeneous graph network for online multiple object tracking

Junwen ZhangXiaolong ZhangZiqi ZhuChunhua Deng

Journal:   Knowledge-Based Systems Year: 2025 Vol: 326 Pages: 114072-114072
JOURNAL ARTICLE

Semantic‐guided fusion for multiple object tracking and RGB‐T tracking

Xiaohu LiuYichuang LuoYan ZhangZhiyong Lei

Journal:   IET Image Processing Year: 2023 Vol: 17 (11)Pages: 3281-3291
© 2026 ScienceGate Book Chapters — All rights reserved.