JOURNAL ARTICLE

Depth Pooling Based Large-Scale 3-D Action Recognition With Convolutional Neural Networks

Pichao WangWanqing LiZhimin GaoChang TangPhilip Ogunbona

Year: 2018 Journal:   IEEE Transactions on Multimedia Vol: 20 (5)Pages: 1051-1061   Publisher: Institute of Electrical and Electronics Engineers

Abstract

This paper proposes three simple, compact yet effective representations of depth sequences, referred to respectively as dynamic depth images (DDI), dynamic depth normal images (DDNI), and dynamic depth motion normal images (DDMNI), for both isolated and continuous action recognition. These dynamic images are constructed from a segmented sequence of depth maps using hierarchical bidirectional rank pooling to effectively capture the spatial-temporal information. Specifically, DDI exploits the dynamics of postures over time, and DDNI and DDMNI exploit the 3-D structural information captured by depth maps. Upon the proposed representations, a convolutional neural network (ConvNet)-based method is developed for action recognition. The image-based representations enable us to fine-tune the existing ConvNet models trained on image data without training a large number of parameters from scratch. The proposed method achieved the state-of-art results on three large datasets, namely, the large-scale continuous gesture recognition dataset (means the Jaccard index 0.4109), the large-scale isolated gesture recognition dataset (59.21%), and the NTU RGB+D dataset (87.08% cross-subject and 84.22% cross-view) even though only the depth modality was used.

Keywords:
Computer science Artificial intelligence Convolutional neural network Jaccard index Pooling Pattern recognition (psychology) Gesture recognition RGB color model Exploit Feature extraction Pipeline (software) Computer vision Scale (ratio) Gesture

Metrics

176
Cited By
13.14
FWCI (Field Weighted Citation Impact)
85
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Diabetic Foot Ulcer Assessment and Management
Health Sciences →  Medicine →  Endocrinology, Diabetes and Metabolism
Hand Gesture Recognition Systems
Physical Sciences →  Computer Science →  Human-Computer Interaction
© 2026 ScienceGate Book Chapters — All rights reserved.