Attention-Augmented Memory Network for Image Multi-Label Classification

Wei Zhou; Yanke Hou; Dihu Chen; Haifeng Hu; Tao Su

doi:10.1145/3570166

ScienceGate Book Chapters

JOURNAL ARTICLE

Attention-Augmented Memory Network for Image Multi-Label Classification

Wei Zhou Yanke Hou Dihu Chen Haifeng Hu Tao Su

Year: 2022 Journal: ACM Transactions on Multimedia Computing Communications and Applications Vol: 19 (3)Pages: 1-24 Publisher: Association for Computing Machinery

DOI: 10.1145/3570166

Get Full-Text PDF Get Analytical Report

Abstract

The purpose of image multi-label classification is to predict all the object categories presented in an image. Some recent works exploit graph convolution network to capture the correlation between labels. Although promising results have been reported, these methods cannot learn salient object features in the images and ignore the correlation between channel feature maps. In addition, the current researches only learn the feature information within individual input image, but fail to mine the contextual information of various categories from the dataset to enhance the input feature representation. To address these issues, we propose an A ttention- A ugmented M emory N etwork ( AAMN ) model for the image multi-label classification task. Specifically, we first propose a novel categorical memory module to excavate the contextual information of various categories from the dataset to augment the current input feature. Secondly, we design a new channel-relation exploration module to capture the inter-channel relationship of features, so as to enhance the correlation between objects in the images. Thirdly, we develop a spatial-relation enhancement module to model second-order statistics of features and capture long-range dependencies between pixels in feature maps, so as to learn salient object features. Experimental results on standard benchmarks, including MS-COCO 2014, PASCAL VOC 2007, and VG-500, demonstrate the effectiveness and superiority of AAMN model, which outperforms current state-of-the-art methods.

Keywords:

Computer science Feature (linguistics) Artificial intelligence Categorical variable Pattern recognition (psychology) Salient Graph Pascal (unit) Relation (database) Pixel Convolutional neural network Correlation Image (mathematics) Exploit Data mining Machine learning Mathematics Theoretical computer science

Metrics

Cited By

1.36

FWCI (Field Weighted Citation Impact)

Refs

0.79

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Attention-Augmented Memory Network for Image Multi-Label Classification

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-Label Image Classification by Feature Attention Network

Graph Attention Transformer Network for Multi-label Image Classification

Label-Guided Cross-Modal Attention Network for Multi-Label Aerial Image Classification

Double Attention Based on Graph Attention Network for Image Multi-Label Classification

Double Attention for Multi-Label Image Classification