JOURNAL ARTICLE

2-D Skeleton-Based Action Recognition via Two-Branch Stacked LSTM-RNNs

Danilo AvolaMarco CascioLuigi CinqueGian Luca ForestiCristiano MassaroniEmanuele Rodolà

Year: 2019 Journal:   IEEE Transactions on Multimedia Vol: 22 (10)Pages: 2481-2496   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Action recognition in video sequences is an inter-esting field for many computer vision applications, includingbehaviour analysis, event recognition, and video surveillance.In this work, a method based on 2D skeleton and two-branchstacked Recurrent Neural Networks (RNNs) with Long Short-Term Memory (LSTM) cells is proposed. Unlike 3D skeletons,usually generated by RGB-D cameras, the 2D skeletons adoptedin this work are reconstructed starting from RGB video streams,therefore allowing the use of the proposed approach in bothindoor and outdoor environments. Moreover, any case of missingskeletal data is managed by exploiting 3D-Convolutional NeuralNetworks (3D-CNNs). Comparative experiments with severalkey works on KTH and Weizmann datasets show that themethod described in this paper outperforms the current state-of-the-art. Additional experiments on UCF Sports and IXMASdatasets demonstrate the effectiveness of our method in thepresence of noisy data and perspective changes, respectively.Further investigations on UCF Sports, HMDB51, UCF101, andKinetics400 highlight how the combination between the proposedtwo-branch stacked LSTM and the 3D-CNN-based network canmanage missing skeleton information, greatly improving theoverall accuracy. Moreover, additional tests on KTH and UCFSports datasets also show the robustness of our approach in thepresence of partial body occlusions. Finally, comparisons on UT-Kinect and NTU-RGB+D datasets show that the accuracy of theproposed method is fully comparable to that of works based on3D skeletons.

Keywords:
Computer science RGB color model Artificial intelligence Robustness (evolution) Convolutional neural network Pattern recognition (psychology) Recurrent neural network Skeleton (computer programming) Action recognition Deep learning Artificial neural network Class (philosophy)

Metrics

91
Cited By
4.81
FWCI (Field Weighted Citation Impact)
131
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Gait Recognition and Analysis
Physical Sciences →  Engineering →  Biomedical Engineering
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Two-Branch Stacked Transformer for 2D Skeleton-based Action Recognition

Yerassyl ZhalgasbayevNguyen Anh Tu

Journal:   2023 17th International Conference on Ubiquitous Information Management and Communication (IMCOM) Year: 2023 Pages: 1-4
JOURNAL ARTICLE

Skeleton-based Action Recognition with Two-Branch Graph Convolutional Networks

Zhi LiuQici XieYunhua LuXian Wang

Journal:   Journal of Physics Conference Series Year: 2021 Vol: 2030 (1)Pages: 012091-012091
JOURNAL ARTICLE

Skeleton-Based Dumbbell Fitness Action Recognition Using Two-Stream LSTM Network

Mingzhou ShangQian HuangYiming WangXiang BianChuanxu JiangJiwen Liu

Journal:   2022 7th International Conference on Image, Vision and Computing (ICIVC) Year: 2022 Pages: 31-36
© 2026 ScienceGate Book Chapters — All rights reserved.