JOURNAL ARTICLE

Diversified Visual Attention Networks for Fine-Grained Object Classification

Bo ZhaoXiao WuJiashi FengQiang PengShuicheng Yan

Year: 2017 Journal:   IEEE Transactions on Multimedia Vol: 19 (6)Pages: 1245-1256   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Fine-grained object classification is a challenging task due to the subtle\ninter-class difference and large intra-class variation. Recently, visual\nattention models have been applied to automatically localize the discriminative\nregions of an image for better capturing critical difference and demonstrated\npromising performance. However, without consideration of the diversity in\nattention process, most of existing attention models perform poorly in\nclassifying fine-grained objects. In this paper, we propose a diversified\nvisual attention network (DVAN) to address the problems of fine-grained object\nclassification, which substan- tially relieves the dependency on\nstrongly-supervised information for learning to localize discriminative regions\ncompared with attentionless models. More importantly, DVAN explicitly pursues\nthe diversity of attention and is able to gather discriminative information to\nthe maximal extent. Multiple attention canvases are generated to extract\nconvolutional features for attention. An LSTM recurrent unit is employed to\nlearn the attentiveness and discrimination of attention canvases. The proposed\nDVAN has the ability to attend the object from coarse to fine granularity, and\na dynamic internal representation for classification is built up by\nincrementally combining the information from different locations and scales of\nthe image. Extensive experiments con- ducted on CUB-2011, Stanford Dogs and\nStanford Cars datasets have demonstrated that the proposed diversified visual\nattention networks achieve competitive performance compared to the state-\nof-the-art approaches, without using any prior knowledge, user interaction or\nexternal resource in training or testing.\n

Keywords:

Metrics

361
Cited By
19.57
FWCI (Field Weighted Citation Impact)
54
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Channel Interaction Attention Networks for Fine-Grained Visual Classification

Qiongjia ChenWenguo Wei

Journal:   2022 3rd International Conference on Computer Vision, Image and Deep Learning & International Conference on Computer Engineering and Applications (CVIDL & ICCEA) Year: 2022 Pages: 606-611
JOURNAL ARTICLE

Object-Part Attention Model for Fine-Grained Image Classification

Yuxin PengXiangteng HeJunjie Zhao

Journal:   IEEE Transactions on Image Processing Year: 2017 Vol: 27 (3)Pages: 1487-1500
© 2026 ScienceGate Book Chapters — All rights reserved.