JOURNAL ARTICLE

Semantic-Guided Information Alignment Network for Fine-Grained Image Recognition

Shijie WangZhihui WangHaojie LiJianlong ChangWanli OuyangQi Tian

Year: 2023 Journal:   IEEE Transactions on Circuits and Systems for Video Technology Vol: 33 (11)Pages: 6558-6570   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Existing fine-grained image recognition works have attempted to dig into low-level details for emphasizing subtle discrepancies among sub-categories. However, a potential limitation of these methods is that they integrate the low-level details and high-level semantics directly, and neglect their content complementarity and spatial corresponding correlation. To handle this limitation, we propose an end-to-end Semantic-guided Information Alignment Network (SIA-Net) to dynamically pick out the low-level details under the guidance of accurate semantics to make selected details spatially corresponding to high-level semantics and complementary in content. Technically, SIA-Net consists of an Accurate Semantic Calibration (ASC) module for providing accurate semantics and a Discriminative Feature Alignment (DFA) module for aggregating low-level details and high-level semantics using accurate semantics generated by ASC. ASC learns the pixel-level feature shifting caused by convolutional operations, which is utilized for replacing the incorrectly highlighted semantics by shifting discriminative semantics or background features. After obtaining the accurate semantic features, DFA digs into the complementary details and simultaneously makes the selected details spatially corresponding via applying the guidance of accurate semantics to obtain the reassembly features. Finally, the reassembly features, which serve as discriminative cues, are used for more accurate discriminative region localization. Extensive experiments verify that our proposed method yields the best performance under the same settings with the most competitive approaches on CUB-birds, Stanford-Cars, and FGVC Aircraft datasets.

Keywords:
Discriminative model Computer science Semantics (computer science) Artificial intelligence Feature (linguistics) Convolutional neural network Pattern recognition (psychology) Feature extraction Image (mathematics) Natural language processing Information retrieval Programming language

Metrics

22
Cited By
4.00
FWCI (Field Weighted Citation Impact)
71
Refs
0.93
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Alignment and localization in fine-grained image recognition

Hanselmann, Harald

Journal:   RWTH Publications (RWTH Aachen) Year: 2020
JOURNAL ARTICLE

Revisiting Fine-grained Image Analysis by Semantic-Part Alignment

Qi BiJingjun YiHaolan ZhanWei JiBo Du

Journal:   IEEE Transactions on Image Processing Year: 2026 Vol: PP Pages: 1-1
BOOK-CHAPTER

Structural Information-Guided Fine-Grained Texture Image Inpainting

Zhiyi FangYi QianXianjin Dai

Lecture notes in computer science Year: 2025 Pages: 100-113
JOURNAL ARTICLE

A semantic-driven image scene fine-grained enhancement recognition

Dongyang QuYaling LiXiaoyan LuoXiaofeng Shi

Journal:   Seventh Asia Pacific Conference on Optics Manufacture and 2021 International Forum of Young Scientists on Advanced Optical Manufacturing (APCOM and YSAOM 2021) Year: 2022 Vol: 86 Pages: 318-318
© 2026 ScienceGate Book Chapters — All rights reserved.