Diversified Visual Attention Networks for Fine-Grained Object Classification

Bo Zhao; Xiao Wu; Jiashi Feng; Qiang Peng; Shuicheng Yan

doi:10.1109/tmm.2017.2648498

ScienceGate Book Chapters

JOURNAL ARTICLE

Diversified Visual Attention Networks for Fine-Grained Object Classification

Bo Zhao Xiao Wu Jiashi Feng Qiang Peng Shuicheng Yan

Year: 2017 Journal: IEEE Transactions on Multimedia Vol: 19 (6)Pages: 1245-1256 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tmm.2017.2648498

Get Full-Text PDF Get Analytical Report

Abstract

Fine-grained object classification is a challenging task due to the subtle\ninter-class difference and large intra-class variation. Recently, visual\nattention models have been applied to automatically localize the discriminative\nregions of an image for better capturing critical difference and demonstrated\npromising performance. However, without consideration of the diversity in\nattention process, most of existing attention models perform poorly in\nclassifying fine-grained objects. In this paper, we propose a diversified\nvisual attention network (DVAN) to address the problems of fine-grained object\nclassification, which substan- tially relieves the dependency on\nstrongly-supervised information for learning to localize discriminative regions\ncompared with attentionless models. More importantly, DVAN explicitly pursues\nthe diversity of attention and is able to gather discriminative information to\nthe maximal extent. Multiple attention canvases are generated to extract\nconvolutional features for attention. An LSTM recurrent unit is employed to\nlearn the attentiveness and discrimination of attention canvases. The proposed\nDVAN has the ability to attend the object from coarse to fine granularity, and\na dynamic internal representation for classification is built up by\nincrementally combining the information from different locations and scales of\nthe image. Extensive experiments con- ducted on CUB-2011, Stanford Dogs and\nStanford Cars datasets have demonstrated that the proposed diversified visual\nattention networks achieve competitive performance compared to the state-\nof-the-art approaches, without using any prior knowledge, user interaction or\nexternal resource in training or testing.\n

Keywords:

Metrics

361

Cited By

19.57

FWCI (Field Weighted Citation Impact)

Refs

0.99

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Visual Attention and Saliency Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Diversified Visual Attention Networks for Fine-Grained Object Classification

Abstract

Metrics

Citation History

Topics

Related Documents

Channel Interaction Attention Networks for Fine-Grained Visual Classification

Robust Fine- Grained Visual Classification via Mutual Attention

Bidirectional Attention-Recognition Model for Fine-Grained Object Classification

Object-Part Attention Model for Fine-Grained Image Classification

Attention-based Multi-scale ViT Fine-grained Visual Classification