JOURNAL ARTICLE

Bidirectional Attention-Recognition Model for Fine-Grained Object Classification

Chuanbin LiuHongtao XieZheng-Jun ZhaLingyun YuZhineng ChenYongdong Zhang

Year: 2019 Journal:   IEEE Transactions on Multimedia Vol: 22 (7)Pages: 1785-1795   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Fine-grained object classification (FGOC) is a challenging research topic in multimedia computing with machine learning, which faces two pivotal conundrums: focusing attention on the discriminate part regions, and then processing recognition with the part-based features. Existing approaches generally adopt a unidirectional two-step structure, that first locate the discriminate parts and then recognize the part-based features. However, they neglect the truth that part localization and feature recognition can be reinforced in a bidirectional process. In this paper, we propose a novel bidirectional attention-recognition model (BARM) to actualize the bidirectional reinforcement for FGOC. The proposed BARM consists of one attention agent for discriminate part regions proposing and one recognition agent for feature extraction and recognition. Meanwhile, a feedback flow is creatively established to optimize the attention agent directly by recognition agent. Therefore, in BARM the attention agent and the recognition agent can reinforce each other in a bidirectional way and the overall framework can be trained end-to-end without neither object nor parts annotations. Moreover, a novel Multiple Random Erasing data augmentation is proposed, and it exhibits impressive pertinency and superiority for FGOC. Conducted on several extensive FGOC benchmarks, BARM outperforms the present state-of-the-art methods in classification accuracy. Furthermore, BARM exhibits a clear interpretability and keeps consistent with the human perception in visualization experiments.

Keywords:
Computer science Interpretability Artificial intelligence Feature extraction Object (grammar) Cognitive neuroscience of visual object recognition Visualization Pattern recognition (psychology) Feature (linguistics) Machine learning Perception Process (computing)

Metrics

64
Cited By
4.06
FWCI (Field Weighted Citation Impact)
93
Refs
0.95
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Object-Part Attention Model for Fine-Grained Image Classification

Yuxin PengXiangteng HeJunjie Zhao

Journal:   IEEE Transactions on Image Processing Year: 2017 Vol: 27 (3)Pages: 1487-1500
JOURNAL ARTICLE

Diversified Visual Attention Networks for Fine-Grained Object Classification

Bo ZhaoXiao WuJiashi FengQiang PengShuicheng Yan

Journal:   IEEE Transactions on Multimedia Year: 2017 Vol: 19 (6)Pages: 1245-1256
© 2026 ScienceGate Book Chapters — All rights reserved.