JOURNAL ARTICLE

Towards a Unified Middle Modality Learning for Visible-Infrared Person Re-Identification

Abstract

Visible-infrared person re-identification (VI-ReID) aims to search identities of pedestrians across different spectra. In this task, one of the major challenges is the modality discrepancy between the visible (VIS) and infrared (IR) images. Some state-of-the-art methods try to design complex networks or generative methods to mitigate the modality discrepancy while ignoring the highly non-linear relationship between the two modalities of VIS and IR. In this paper, we propose a non-linear middle modality generator (MMG), which helps to reduce the modality discrepancy. Our MMG can effectively project VIS and IR images into a unified middle modality image (UMMI) space to generate middle-modality (M-modality) images. The generated M-modality images and the original images are fed into the backbone network to reduce the modality discrepancy.Furthermore, in order to pull together the two types of M-modality images generated from the VIS and IR images in the UMMI space, we propose a distribution consistency loss (DCL) to make the modality distribution of the generated M-modalities images as consistent as possible. Finally, we propose a middle modality network (MMN) to further enhance the discrimination and richness of features in an explicit manner. Extensive experiments have been conducted to validate the superiority of MMN for VI-ReID over some state-of-the-art methods on two challenging datasets. The gain of MMN is more than 11.1% and 8.4% in terms of Rank-1 and mAP, respectively, even compared with the latest state-of-the-art methods on the SYSU-MM01 dataset.

Keywords:
Modality (human–computer interaction) Artificial intelligence Computer science Modalities Pattern recognition (psychology) Computer vision

Metrics

181
Cited By
11.04
FWCI (Field Weighted Citation Impact)
60
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Enhancement Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Adaptive Middle Modality Alignment Learning for Visible-Infrared Person Re-identification

Yukang ZhangYan YanYang LuHanzi Wang

Journal:   International Journal of Computer Vision Year: 2024 Vol: 133 (4)Pages: 2176-2196
JOURNAL ARTICLE

Hybrid Modality Metric Learning for Visible-Infrared Person Re-Identification

Zhang LaHaiyun GuoKuan ZhuHonglin QiaoGaopan HuangSen ZhangHuichen ZhangJian SunJinqiao Wang

Journal:   ACM Transactions on Multimedia Computing Communications and Applications Year: 2022 Vol: 18 (1s)Pages: 1-15
JOURNAL ARTICLE

Syncretic Modality Collaborative Learning for Visible Infrared Person Re-Identification

Ziyu WeiXi YangNannan WangXinbo Gao

Journal:   2021 IEEE/CVF International Conference on Computer Vision (ICCV) Year: 2021 Pages: 225-234
© 2026 ScienceGate Book Chapters — All rights reserved.