JOURNAL ARTICLE

Temporal–Spatial Mapping for Action Recognition

Xiaolin SongCuiling LanWenjun ZengJunliang XingXiaoyan SunJingyu Yang

Year: 2019 Journal:   IEEE Transactions on Circuits and Systems for Video Technology Vol: 30 (3)Pages: 748-759   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Deep learning models have enjoyed great success for image related computer vision tasks such as image classification and object detection. For video related tasks such as human action recognition, however, the advancements are not as significant yet. The main challenge is the lack of effective and efficient models in modeling the rich temporal-spatial information in a video. We introduce a simple yet effective operation, termed temporal-spatial mapping, for capturing the temporal evolution of the frames by jointly analyzing all the frames of a video. We propose a video level 2D feature representation by transforming the convolutional features of all frames to a 2D feature map, referred to as VideoMap. With each row being the vectorized feature representation of a frame, the temporal-spatial features are compactly represented, while the temporal dynamic evolution is also well embedded. Based on the VideoMap representation, we further propose a temporal attention model within a shallow convolutional neural network to efficiently exploit the temporal-spatial dynamics. The experiment results show that the proposed scheme achieves state-of-the-art performance, with 4.2% accuracy gain over the temporal segment network, a competing baseline method, on the challenging human action benchmark dataset HMDB51.

Keywords:
Computer science Artificial intelligence Benchmark (surveying) Convolutional neural network Feature (linguistics) Pattern recognition (psychology) Representation (politics) Exploit Feature learning Frame (networking) Feature extraction Object (grammar) Computer vision Geography

Metrics

60
Cited By
3.85
FWCI (Field Weighted Citation Impact)
69
Refs
0.95
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

BOOK-CHAPTER

Spatial-Temporal Attention for Action Recognition

Dengdi SunHanqing WuZhuanlian DingBin LuoJin Tang

Lecture notes in computer science Year: 2018 Pages: 854-864
BOOK-CHAPTER

Spatial-Temporal Neural Networks for Action Recognition

Chao JingPing WeiHongbin SunNanning Zheng

IFIP advances in information and communication technology Year: 2018 Pages: 619-627
JOURNAL ARTICLE

Spatial-temporal Ensemble Method for Action Recognition

Minseok SeoSang-Woo LeeDong‐Geol Choi

Journal:   The Journal of Korea Robotics Society Year: 2020 Vol: 15 (4)Pages: 385-391
© 2026 ScienceGate Book Chapters — All rights reserved.