JOURNAL ARTICLE

A Spatial-temporal Attention Module for 3D Convolution Network in Action Recognition

Shengwei ZhouLiang BaiHaoran WangZhihong DengXiaoming ZhuGong Cheng

Year: 2019 Journal:   DEStech Transactions on Computer Science and Engineering   Publisher: Destech Publications

Abstract

Action recognition is a significant but challenging task in the field of computer vision. 3D convolutional neural network is one of the mainstream methods for action recognition because it can process three-dimensional information effectively. However, at present, the performance of 3D convolutional neural networks is not particularly prominent. The main reason is that the information of the video is mainly contained in the key areas of key frames in the video, yet the 3D convolutional neural network usually cannot extract the most critical information in the video effectively. Therefore, we propose a temporal attention and a spatial attention respectively, and combine them into a module called STAM to let models focus more on the key information. We introduced the STAM module into 3D ResNet, and conducted experiments on the UCF101 and HMDB51 datasets. The results demonstrate that our proposed attention module can improve the performance of 3D convolutional neural networks effectively.

Keywords:
Computer science Convolutional neural network Key (lock) Convolution (computer science) Artificial intelligence Action recognition Focus (optics) Process (computing) Task (project management) Pattern recognition (psychology) Action (physics) Artificial neural network Class (philosophy)

Metrics

1
Cited By
0.11
FWCI (Field Weighted Citation Impact)
12
Refs
0.46
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Gait Recognition and Analysis
Physical Sciences →  Engineering →  Biomedical Engineering
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Convolution spatial-temporal attention network for EEG emotion recognition

Lei CaoB. X. YuYilin DongTianyu LiuJie Li

Journal:   Physiological Measurement Year: 2024 Vol: 45 (12)Pages: 125003-125003
BOOK-CHAPTER

Spatial-Temporal Co-attention Network for Action Recognition

Shuren ZhouXiangli Zeng

Communications in computer and information science Year: 2020 Pages: 302-312
JOURNAL ARTICLE

Spatial-temporal saliency action mask attention network for action recognition

Min JiangNa PanJun Kong

Journal:   Journal of Visual Communication and Image Representation Year: 2020 Vol: 71 Pages: 102846-102846
© 2026 ScienceGate Book Chapters — All rights reserved.