Skeleton based action recognition with convolutional neural network

Yong Du; Yun Fu; Liang Wang

doi:10.1109/acpr.2015.7486569

ScienceGate Book Chapters

JOURNAL ARTICLE

Skeleton based action recognition with convolutional neural network

Yong Du Yun Fu Liang Wang

Year: 2015 Pages: 579-583

DOI: 10.1109/acpr.2015.7486569

Get Full-Text PDF Get Analytical Report

Abstract

Temporal dynamics of postures over time is crucial for sequence-based action recognition. Human actions can be represented by the corresponding motions of articulated skeleton. Most of the existing approaches for skeleton based action recognition model the spatial-temporal evolution of actions based on hand-crafted features. As a kind of hierarchically adaptive filter banks, Convolutional Neural Network (CNN) performs well in representation learning. In this paper, we propose an end-to-end hierarchical architecture for skeleton based action recognition with CNN. Firstly, we represent a skeleton sequence as a matrix by concatenating the joint coordinates in each instant and arranging those vector representations in a chronological order. Then the matrix is quantified into an image and normalized to handle the variable-length problem. The final image is fed into a CNN model for feature extraction and recognition. For the specific structure of such images, the simple max-pooling plays an important role on spatial feature selection as well as temporal frequency adjustment, which can obtain more discriminative joint information for different actions and meanwhile address the variable-frequency problem. Experimental results demonstrate that our method achieves the state-of-art performance with high computational efficiency, especially surpassing the existing result by more than 15 percentage on the challenging ChaLearn gesture recognition dataset.

Keywords:

Artificial intelligence Computer science Convolutional neural network Discriminative model Pattern recognition (psychology) Pooling Feature extraction Skeleton (computer programming) Feature (linguistics) Gesture recognition Representation (politics) ENCODE Computer vision Gesture

Metrics

422

Cited By

12.52

FWCI (Field Weighted Citation Impact)

Refs

0.99

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Hand Gesture Recognition Systems

Physical Sciences → Computer Science → Human-Computer Interaction

Gait Recognition and Analysis

Physical Sciences → Engineering → Biomedical Engineering

Skeleton based action recognition with convolutional neural network

Abstract

Metrics

Citation History

Topics

Related Documents

Skeleton-based action recognition with convolutional neural networks

Skeleton Action Recognition Based on Double Residual Convolutional Neural Network

Human Action Recognition Based on Skeleton and Convolutional Neural Network

Two-Stream Convolutional Neural Network for Skeleton-Based Action Recognition

Skeleton-based Action Recognition with Multi-scale Spatial-temporal Convolutional Neural Network