JOURNAL ARTICLE

Cross-modal vehicle re-identification based on multi-scale features and attention mechanism

Abstract

Our work focuses on exploring the emerging field of cross-modal vehicle re-identification. Achieving accurate cross-modal vehicle re-identification requires a network that can capture local details from two different modality images while effectively fusing their valid information. However, existing methods only consider extracting high-level semantics, leading to a loss of fine-grained details and imprecise identification. Additionally, insufficient attention has been paid to effective information in different modalities, as cross-modality interaction has not been thoroughly explored. To address these issues, we propose a new cross-modal vehicle re-identification network consisting of a multi-scale feature fusion module and a cross-modal attention module. Specifically, the multiscale feature fusion module captures both global high-level semantics and local details by integrating multi-scale information in the feature extraction process, reducing the loss of local details. The cross-modal attention module explores valid information from different modalities and achieves feature-level fusion. We conducted experiments on the RGBNT100 cross-modal vehicle re-identification dataset to verify the proposed method's effectiveness.

Keywords:
Modal Computer science Identification (biology) Modality (human–computer interaction) Semantics (computer science) Feature (linguistics) Feature extraction Process (computing) Modalities Artificial intelligence Scale (ratio) Data mining Pattern recognition (psychology)

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
36
Refs
0.02
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Enhancement Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Multi-scale attention vehicle re-identification

Aihua ZhengXianmin LinJiacheng DongWenzhong WangJin TangBin Luo

Journal:   Neural Computing and Applications Year: 2020 Vol: 32 (23)Pages: 17489-17503
JOURNAL ARTICLE

Cross-modal pedestrian re-identification technique based on multi-scale feature attention and strategy balancing

Yiqiang Lai

Journal:   Engineering Research Express Year: 2025 Vol: 7 (1)Pages: 015273-015273
JOURNAL ARTICLE

Cross-modal person re-identification using fused local effective features and multi-scale features

Lihui LüRifan WangZhencong ChenJiaqi Chen

Journal:   Transactions of the Institute of Measurement and Control Year: 2024 Vol: 47 (14)Pages: 2827-2835
© 2026 ScienceGate Book Chapters — All rights reserved.