JOURNAL ARTICLE

Spatio-temporal Video Autoencoder for Human Action Recognition

Abstract

The demand for automatic systems for action recognition has increased significantly due to the development of surveillance cameras with high sampling rates, low cost, small size and high resolution. These systems can effectively support human operators to detect events of interest in video sequences, reducing failures and improving recognition results. In this work, we develop and analyze a method to learn two-dimensional (2D) representations from videos through an autoencoder framework. A multi-stream network is used to incorporate spatial and temporal information for action recognition purposes. Experiments conducted on the challenging UCF101 and HMDB51 data sets indicate that our representation is capable of achieving competitive accuracy rates compared to the literature approaches.

Keywords:
Autoencoder Computer science Artificial intelligence Action recognition Representation (politics) Pattern recognition (psychology) Action (physics) Key (lock) Sampling (signal processing) Machine learning Computer vision Artificial neural network

Metrics

3
Cited By
0.32
FWCI (Field Weighted Citation Impact)
0
Refs
0.59
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Human Action Recognition Based on a Spatio-Temporal Video Autoencoder

Anderson SantosHélio Pedrini

Journal:   International Journal of Pattern Recognition and Artificial Intelligence Year: 2019 Vol: 34 (11)Pages: 2040001-2040001
BOOK-CHAPTER

Dual-View Spatio-Temporal Interactive Network for Video Human Action Recognition

Hanbo WuXin Ma

Lecture notes in computer science Year: 2025 Pages: 217-231
JOURNAL ARTICLE

Exploiting spatio‐temporal knowledge for video action recognition

Huigang ZhangLiuan WangJun Sun

Journal:   IET Computer Vision Year: 2022 Vol: 17 (2)Pages: 222-230
© 2026 ScienceGate Book Chapters — All rights reserved.