Research on Cross-modal Pedestrian Re-identification Based on Transformer

Zheyu Fu; Xiaoguang Hu; Xu Wang

doi:10.54097/8drb0c69

ScienceGate Book Chapters

JOURNAL ARTICLE

Research on Cross-modal Pedestrian Re-identification Based on Transformer

Zheyu Fu Xiaoguang Hu Xu Wang

Year: 2025 Journal: Journal of innovation and development Vol: 10 (3)Pages: 66-72

DOI: 10.54097/8drb0c69

Get Full-Text PDF Get Analytical Report

Abstract

Visible infrared human body recognition (VI ReID) is a challenging task for complex modal change retrieval. Existing methods usually focus on extracting discriminative visual features, while ignoring the reliability and commonness of visual features between different modes. In this paper, we propose a new deep learning framework, called multi-scale local progressive transformers (MLT), for effective VI-ReID. In order to reduce the negative impact of modal gap, we first take the gray image as an auxiliary mode, take the Transformer model as the benchmark, and propose a progressive learning strategy. The sea attention mechanism is fused with the dilateformer to further improve the discrimination ability of reliable features, and its feasibility is increased through ablation experiments.

Keywords:

Pedestrian Modal Dual (grammatical number) Identification (biology) Computer science Environmental science Artificial intelligence Transport engineering Materials science Engineering Biology

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.07

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Autonomous Vehicle Technology and Safety

Physical Sciences → Engineering → Automotive Engineering

Fire Detection and Safety Systems

Physical Sciences → Engineering → Safety, Risk, Reliability and Quality

Research on Cross-modal Pedestrian Re-identification Based on Transformer

Abstract

Metrics

Topics

Related Documents

Cross-modal Pedestrian Re-identification Based on Generative Confrontation Network

Pedestrian re-identification based on Swin Transformer

Cross-modal Pedestrian Re-identification based on Spatially Enhanced Dual-stream Network

Pedestrian re-identification based on multi-stream modal

Heterogeneous information alignment and re-ranking for cross-modal pedestrian re-identification