Revisiting Fine-grained Image Analysis by Semantic-Part Alignment

Qi Bi; Jingjun Yi; Haolan Zhan; Wei Ji; Bo Du

doi:10.1109/tip.2025.3649364

ScienceGate Book Chapters

JOURNAL ARTICLE

Revisiting Fine-grained Image Analysis by Semantic-Part Alignment

Qi Bi Jingjun Yi Haolan Zhan Wei Ji Bo Du

Year: 2026 Journal: IEEE Transactions on Image Processing Vol: PP Pages: 1-1 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tip.2025.3649364

Get Full-Text PDF Get Analytical Report

Abstract

Fine-grained image analysis is widely recognized as highly challenging, since distinguishing individual differences within a certain category, species, or type often depends on tiny, subtle patterns. However, learning fine-grained semantic categories from these subtle part patterns is inherently fragile, as they can easily be overwhelmed by the dominant patterns resting in the coarse-category information. Therefore, how to enhance the relation between the fine-grained semantics and these subtle patterns is the key. To push this frontier, a novel semantic-part alignment (SPA) learning scheme is proposed in this paper. Its general idea is to firstly measure the relevance of each part to the fine-grained semantics, and then regularize the fine-grained visual representation learning. Specifically, it consists of three key components, namely, joint semantic-part modeling, semantic-part set modeling, and optimal semantic-part transport. The joint semantic-part modeling associates each part in an image with the fine-grained semantics in a latent space. Then, the optimal semantic-part transport component is devised to enhance the relation between fine-grained semantic embeddings and the discriminative part embeddings. Notably, the proposed SPA is plug-in-and-play, easy-to-implement, and insensitive to the latent embedding dimension and loss weight. Experiments show the proposed method can substantially boost performance on multiple fine-grained image analysis tasks across various baselines.

Keywords:

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.77

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Generative Adversarial Networks and Image Synthesis

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Medical Image Segmentation Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Face recognition and analysis

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Revisiting Fine-grained Image Analysis by Semantic-Part Alignment

Abstract

Metrics

Topics

Related Documents

Semantic-Guided Information Alignment Network for Fine-Grained Image Recognition

Ultra Fine-Grained Image Semantic Embedding

Domain Adaptative Semantic Segmentation by Fine-Grained Alignment

Fine-grained shoe image retrieval by part detection and semantic network

Tacoma: Enhanced Browser Fuzzing with Fine-Grained Semantic Alignment