JOURNAL ARTICLE

Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network

Le WangJinliang ZangQilin ZhangZhenxing NiuGang HuaNanning Zheng

Year: 2018 Journal:   Sensors Vol: 18 (7)Pages: 1979-1979   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Research in human action recognition has accelerated significantly since the introduction of powerful machine learning tools such as Convolutional Neural Networks (CNNs). However, effective and efficient methods for incorporation of temporal information into CNNs are still being actively explored in the recent literature. Motivated by the popular recurrent attention models in the research area of natural language processing, we propose the Attention-aware Temporal Weighted CNN (ATW CNN) for action recognition in videos, which embeds a visual attention model into a temporal weighted multi-stream CNN. This attention model is simply implemented as temporal weighting yet it effectively boosts the recognition performance of video representations. Besides, each stream in the proposed ATW CNN framework is capable of end-to-end training, with both network parameters and temporal weights optimized by stochastic gradient descent (SGD) with back-propagation. Our experimental results on the UCF-101 and HMDB-51 datasets showed that the proposed attention mechanism contributes substantially to the performance gains with the more discriminative snippets by focusing on more relevant video segments.

Keywords:
Discriminative model Computer science Convolutional neural network Artificial intelligence Stochastic gradient descent Action recognition Weighting Pattern recognition (psychology) Visual attention Action (physics) Machine learning Artificial neural network Speech recognition Perception

Metrics

41
Cited By
3.90
FWCI (Field Weighted Citation Impact)
59
Refs
0.93
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Hand Gesture Recognition Systems
Physical Sciences →  Computer Science →  Human-Computer Interaction

Related Documents

BOOK-CHAPTER

Attention-Based Temporal Weighted Convolutional Neural Network for Action Recognition

Jinliang ZangLe WangZiyi LiuQilin ZhangGang HuaNanning Zheng

IFIP advances in information and communication technology Year: 2018 Pages: 97-108
JOURNAL ARTICLE

Spatio-Temporal Self-Attention Weighted VLAD Neural Network for Action Recognition

Shilei ChengMei XieZheng MaSiqi LiSong GuYang Feng

Journal:   IEICE Transactions on Information and Systems Year: 2020 Vol: E104.D (1)Pages: 220-224
JOURNAL ARTICLE

Double Attention Convolutional Neural Network for Driver Action Recognition

Ying AiJinxiang XiaKun SheQiuling Long

Journal:   2019 3rd International Conference on Electronic Information Technology and Computer Engineering (EITCE) Year: 2019
BOOK-CHAPTER

A Spatio-Temporal Convolutional Neural Network for Skeletal Action Recognition

Lizhang HuJinhua Xu

Lecture notes in computer science Year: 2017 Pages: 377-385
© 2026 ScienceGate Book Chapters — All rights reserved.