JOURNAL ARTICLE

Attention-Augmented Memory Network for Image Multi-Label Classification

Wei ZhouYanke HouDihu ChenHaifeng HuTao Su

Year: 2022 Journal:   ACM Transactions on Multimedia Computing Communications and Applications Vol: 19 (3)Pages: 1-24   Publisher: Association for Computing Machinery

Abstract

The purpose of image multi-label classification is to predict all the object categories presented in an image. Some recent works exploit graph convolution network to capture the correlation between labels. Although promising results have been reported, these methods cannot learn salient object features in the images and ignore the correlation between channel feature maps. In addition, the current researches only learn the feature information within individual input image, but fail to mine the contextual information of various categories from the dataset to enhance the input feature representation. To address these issues, we propose an A ttention- A ugmented M emory N etwork ( AAMN ) model for the image multi-label classification task. Specifically, we first propose a novel categorical memory module to excavate the contextual information of various categories from the dataset to augment the current input feature. Secondly, we design a new channel-relation exploration module to capture the inter-channel relationship of features, so as to enhance the correlation between objects in the images. Thirdly, we develop a spatial-relation enhancement module to model second-order statistics of features and capture long-range dependencies between pixels in feature maps, so as to learn salient object features. Experimental results on standard benchmarks, including MS-COCO 2014, PASCAL VOC 2007, and VG-500, demonstrate the effectiveness and superiority of AAMN model, which outperforms current state-of-the-art methods.

Keywords:
Computer science Feature (linguistics) Artificial intelligence Categorical variable Pattern recognition (psychology) Salient Graph Pascal (unit) Relation (database) Pixel Convolutional neural network Correlation Image (mathematics) Exploit Data mining Machine learning Mathematics Theoretical computer science

Metrics

11
Cited By
1.36
FWCI (Field Weighted Citation Impact)
72
Refs
0.79
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Multi-Label Image Classification by Feature Attention Network

Zheng YanWeiwei LiuShiping WenYin Yang

Journal:   IEEE Access Year: 2019 Vol: 7 Pages: 98005-98013
JOURNAL ARTICLE

Graph Attention Transformer Network for Multi-label Image Classification

Jin YuanShikai ChenYao ZhangZhongchao ShiXin GengJianping FanYong Rui

Journal:   ACM Transactions on Multimedia Computing Communications and Applications Year: 2022 Vol: 19 (4)Pages: 1-16
JOURNAL ARTICLE

Label-Guided Cross-Modal Attention Network for Multi-Label Aerial Image Classification

Ying ChenDing ZhangTao HanXiaoliang MengMianxin GaoTeng Wang

Journal:   IEEE Geoscience and Remote Sensing Letters Year: 2024 Vol: 21 Pages: 1-5
JOURNAL ARTICLE

Double Attention Based on Graph Attention Network for Image Multi-Label Classification

Wei ZhouZhiwu XiaPeng DouTao SuHaifeng Hu

Journal:   ACM Transactions on Multimedia Computing Communications and Applications Year: 2022 Vol: 19 (1)Pages: 1-23
JOURNAL ARTICLE

Double Attention for Multi-Label Image Classification

Haiying ZhaoWei ZhouXiaogang HouHui Zhu

Journal:   IEEE Access Year: 2020 Vol: 8 Pages: 225539-225550
© 2026 ScienceGate Book Chapters — All rights reserved.