JOURNAL ARTICLE

Transformer Based Multi-Grained Features for Unsupervised Person Re-Identification

Abstract

Multi-grained features extracted from convolutional neural networks (CNNs) have demonstrated their strong dis-crimination ability in supervised person re-identification (Re-ID) tasks. Inspired by them, this work investigates the way of extracting multi-grained features from a pure transformer network to address the unsupervised Re-ID problem that is label-free but much more challenging. To this end, we build a dual-branch network architecture based upon a modified Vision Transformer (ViT). The local tokens output in each branch are reshaped and then uniformly partitioned into multiple stripes to generate part-level features, while the global tokens of two branches are averaged to produce a global feature. Further, based upon offline-online associated camera-aware proxies (02CAP) that is a top-performing unsupervised Re-ID method, we define offline and online contrastive learning losses with respect to both global and part-level features to conduct unsupervised learning. Extensive experiments on three person Re-ID datasets show that the proposed method outperforms state-of-the-art unsupervised methods by a considerable margin, greatly mitigating the gap to supervised counterparts. Code will be available soon at https://github.com/RikoLi/WACV23-workshop-TMGF.

Keywords:
Computer science Unsupervised learning Artificial intelligence Convolutional neural network Transformer Margin (machine learning) Feature learning Machine learning Pattern recognition (psychology) Feature extraction Artificial neural network Deep learning

Metrics

35
Cited By
6.37
FWCI (Field Weighted Citation Impact)
75
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Multi-Grained feature aggregation based on Transformer for unsupervised person re-identification

Zhongmin LiuChangkai Zhang

Journal:   Journal of Control Engineering and Applied Informatics Year: 2024 Vol: 26 (1)
JOURNAL ARTICLE

Transformer-based Contrastive Learning for Unsupervised Person Re-Identification

Yusheng TaoJian ZhangTianquan ChenYuqing WangYuesheng Zhu

Journal:   2022 International Joint Conference on Neural Networks (IJCNN) Year: 2022 Pages: 1-9
© 2026 ScienceGate Book Chapters — All rights reserved.