Human action recognition based on multi-mode spatial-temporal feature fusion

Dongli Wang; Jun Yang; Yan Zhou

doi:10.23919/fusion43075.2019.9011361

ScienceGate Book Chapters

JOURNAL ARTICLE

Human action recognition based on multi-mode spatial-temporal feature fusion

Dongli Wang Jun Yang Yan Zhou

Year: 2019 Pages: 1-7

DOI: 10.23919/fusion43075.2019.9011361

Get Full-Text PDF Get Analytical Report

Abstract

Motion representation plays a vital role in human action recognition. In recent few years, the application of deep learning in action recognition has become popular. However, there are great challenges in extracting accurate motion features. In this study, a novel feature representation that combines multi-scale spatial-temporal feature is proposed. This descriptor contains spatial-temporal information for three mode, which are extracted from three input channels of RGB images, RGB difference images and binary XOR images. Specifically, a network that consist of convolutional neural network (CNN) and long short-term memory (LSTM) extract spatial-temporal feature from RGB images and RGB difference images respectively. On the other hand, global motion information is extracted from binary XOR images using another separate CNN network. Then, we combine this features from the three channels as a new video feature representation. Finally, an extreme learning machine (ELM) is adopted as classifier. Experimental results on UCF-50 dataset show the superiority of the proposed method.

Keywords:

Artificial intelligence Computer science RGB color model Pattern recognition (psychology) Convolutional neural network Feature extraction Feature (linguistics) Feature learning Computer vision Classifier (UML) Local binary patterns Extreme learning machine Artificial neural network Histogram Image (mathematics)

Metrics

Cited By

0.32

FWCI (Field Weighted Citation Impact)

Refs

0.63

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Gait Recognition and Analysis

Physical Sciences → Engineering → Biomedical Engineering

Advanced Technologies in Various Fields

Physical Sciences → Computer Science → Artificial Intelligence

Human action recognition based on multi-mode spatial-temporal feature fusion

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-feature fusion based human action recognition algorithm

Human Action Recognition Based On Multi-level Feature Fusion

Human Action Recognition Based on Multi-level Feature Fusion

A Spatial-Temporal Feature Fusion Strategy for Skeleton-Based Action Recognition

Temporal-spatial Feature Fusion for Few-shot Skeleton-based Action Recognition