JOURNAL ARTICLE

Attention-Guided Spatial Transformer Networks for Fine-Grained Visual Recognition

Dichao LiuYu WangJien Kato

Year: 2019 Journal:   IEICE Transactions on Information and Systems Vol: E102.D (12)Pages: 2577-2586   Publisher: Institute of Electronics, Information and Communication Engineers

Abstract

The aim of this paper is to propose effective attentional regions for fine-grained visual recognition. Based on the Spatial Transformers' capability of spatial manipulation within networks, we propose an extension model, the Attention-Guided Spatial Transformer Networks (AG-STNs). This model can guide the Spatial Transformers with hard-coded attentional regions at first. Then such guidance can be turned off, and the network model will adjust the region learning in terms of the location and scale. Such adjustment is conditioned to the classification loss so that it is actually optimized for better recognition results. With this model, we are able to successfully capture detailed attentional information. Also, the AG-STNs are able to capture attentional information in multiple levels, and different levels of attentional information are complementary to each other in our experiments. A fusion of them brings better results.

Keywords:
Computer science Transformer Artificial intelligence Pattern recognition (psychology) Electrical engineering Voltage

Metrics

2
Cited By
0.21
FWCI (Field Weighted Citation Impact)
38
Refs
0.56
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.