Self-supervised Fine-grained Image Recognition Method Based on Multi-scale Attention and Contrastive Learning

Chih-Hao Lin; Yu-Hsuan Tseng; Pei-Chen Wu; Cheng-Yu Huang; Meng-Ying Lai

doi:10.5281/zenodo.15232949

ScienceGate Book Chapters

JOURNAL ARTICLE

Self-supervised Fine-grained Image Recognition Method Based on Multi-scale Attention and Contrastive Learning

Chih-Hao Lin Yu-Hsuan Tseng Pei-Chen Wu Cheng-Yu Huang Meng-Ying Lai

Year: 2025 Journal: Zenodo (CERN European Organization for Nuclear Research) Publisher: European Organization for Nuclear Research

DOI: 10.5281/zenodo.15232949

Get Full-Text PDF Get Analytical Report

Abstract

Fine-grained image recognition aims to accurately distinguish subclass differences within the same major category. However, due to subtle inter-class differences and high annotation costs, it has long been a significant challenge in the field of computer vision. This study innovatively proposes a self-supervised image recognition framework integrating multi-scale attention mechanisms and contrastive learning, enabling efficient and high-quality feature extraction without manual annotation. The method leverages a multi-level attention module to deeply explore both local and global image information. Meanwhile, momentum encoding strategies and data augmentation techniques are used to generate positive and negative sample pairs for contrastive training. Experimental results on standard datasets such as CUB-200-2011 and FGVC-Aircraft show that the proposed method achieves Top-1 recognition accuracies of 89.2% and 87.5%, respectively, demonstrating a significant performance improvement over current mainstream methods.

Keywords:

Pattern recognition (psychology) Feature (linguistics) Image (mathematics) Feature extraction Encoding (memory) Field (mathematics)

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.53

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Face recognition and analysis

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Self-supervised Fine-grained Image Recognition Method Based on Multi-scale Attention and Contrastive Learning

Abstract

Metrics

Topics

Related Documents

Self-supervised Fine-grained Image Recognition Method Based on Multi-scale Attention and Contrastive Learning

Attention-based supervised contrastive learning on fine-grained image classification

Fine‑Grained Image Recognition Method Based on Attention and Multi‑scale Ensemble Learning

Self-Supervised Image Classification Method Based on Multi-Attention Contrastive Learning

Fine-grained image recognition based on multi-branch and multi-scale learning