Style-Agnostic Representation Learning for Visible-Infrared Person Re-Identification

Jianbing Wu; Hong Liu; Wei Shi; Mengyuan Liu; Wenhao Li

doi:10.1109/tmm.2023.3294002

ScienceGate Book Chapters

JOURNAL ARTICLE

Style-Agnostic Representation Learning for Visible-Infrared Person Re-Identification

Jianbing Wu Hong Liu Wei Shi Mengyuan Liu Wenhao Li

Year: 2023 Journal: IEEE Transactions on Multimedia Vol: 26 Pages: 2263-2275 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tmm.2023.3294002

Get Full-Text PDF Get Analytical Report

Abstract

One main challenge of visible-infrared person re-identification (VI Re-ID) lies in the large style discrepancy between the heterogeneous data. We present a STyle-Agnostic Representation learning (STAR) framework that bridges the modality gaps at both data and feature levels in a progressive manner. At the data level, we present Cross Modality Blending (CMB), a powerful and parameter-free data augmentation scheme that smoothly synthesizes intermediate modalities by conducting identity-preserving patch exchange and smooth cross-modality blending. At the feature level, we explore the inter-modality feature alignment problem from a new perspective of the style-related feature statistics. Specifically, we design a plug-and-play Adaptive Style Normalization (ASN) module to discard the intrinsic style distractors without losing discriminative content via dual-level adaptive distribution normalization and discriminability compensation. Moreover, considering that an appropriate modality intermediary can convey relevant information on the inter-modality distribution shift, we propose Reciprocal Modality Bridging Learning (RMBL) to better steer the modality bridging process. Two lightweight modality transformation modules are designed in RMBL to model an appropriate intermediate space by manipulating high-order statistics under our shortest distance constraint. Meanwhile, intermediary-guided distribution alignment is reciprocally conducted to align heterogeneous features to the modality intermediary. Experiments on VI Re-ID benchmarks demonstrate the superiority and flexibility of STAR over state-of-the-art methods.

Keywords:

Computer science Normalization (sociology) Feature learning Modality (human–computer interaction) Artificial intelligence Discriminative model Feature vector Pattern recognition (psychology) Machine learning

Metrics

Cited By

3.28

FWCI (Field Weighted Citation Impact)

Refs

0.91

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

IoT and GPS-based Vehicle Safety Systems

Physical Sciences → Engineering → Mechanical Engineering

Style-Agnostic Representation Learning for Visible-Infrared Person Re-Identification

Abstract

Metrics

Citation History

Topics

Related Documents

Modality-agnostic learning for robust visible-infrared person re-identification

Shape-centered representation learning for visible–infrared person re-identification

Style-Aware Modality Fusion Learning for Visible-Infrared Person Re-Identification

Fine-Grained Semantic Representation Learning for Visible–Infrared Person Re-Identification

Serialized Local Feature Representation Learning for Infrared-Visible Person Re-identification