Cross-modal vehicle re-identification based on multi-scale features and attention mechanism

Xueqing Jin; Xu Zou; Sheng Zhong

doi:10.1117/12.2691288

ScienceGate Book Chapters

JOURNAL ARTICLE

Cross-modal vehicle re-identification based on multi-scale features and attention mechanism

Xueqing Jin Xu Zou Sheng Zhong

Year: 2024 Vol: 34 Pages: 14-14

DOI: 10.1117/12.2691288

Get Full-Text PDF Get Analytical Report

Abstract

Our work focuses on exploring the emerging field of cross-modal vehicle re-identification. Achieving accurate cross-modal vehicle re-identification requires a network that can capture local details from two different modality images while effectively fusing their valid information. However, existing methods only consider extracting high-level semantics, leading to a loss of fine-grained details and imprecise identification. Additionally, insufficient attention has been paid to effective information in different modalities, as cross-modality interaction has not been thoroughly explored. To address these issues, we propose a new cross-modal vehicle re-identification network consisting of a multi-scale feature fusion module and a cross-modal attention module. Specifically, the multiscale feature fusion module captures both global high-level semantics and local details by integrating multi-scale information in the feature extraction process, reducing the loss of local details. The cross-modal attention module explores valid information from different modalities and achieves feature-level fusion. We conducted experiments on the RGBNT100 cross-modal vehicle re-identification dataset to verify the proposed method's effectiveness.

Keywords:

Modal Computer science Identification (biology) Modality (human–computer interaction) Semantics (computer science) Feature (linguistics) Feature extraction Process (computing) Modalities Artificial intelligence Scale (ratio) Data mining Pattern recognition (psychology)

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.02

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Enhancement Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Cross-modal vehicle re-identification based on multi-scale features and attention mechanism

Abstract

Metrics

Topics

Related Documents

Cross-modality Person Re-Identification Based on Dual Attention Mechanism and Multi-scale Features

Multi-scale attention vehicle re-identification

Vehicle Re-identification Based on Multi-Scale Attention Feature Fusion

Cross-modal pedestrian re-identification technique based on multi-scale feature attention and strategy balancing

Cross-modal person re-identification using fused local effective features and multi-scale features