A Spatial-Temporal Feature Fusion Strategy for Skeleton-Based Action Recognition

Yitian Chen; Yuchen Xu; Qianglai Xie; Lei Xiong; Leiyue Yao

doi:10.1109/dspp58763.2023.10405203

ScienceGate Book Chapters

JOURNAL ARTICLE

A Spatial-Temporal Feature Fusion Strategy for Skeleton-Based Action Recognition

Yitian Chen Yuchen Xu Qianglai Xie Lei Xiong Leiyue Yao

Year: 2023 Pages: 207-215

DOI: 10.1109/dspp58763.2023.10405203

Get Full-Text PDF Get Analytical Report

Abstract

Human action recognition (HAR) has attracted the attention of researcher because of its widespread applicability. With the rapid development of deep learning technology, HAR has been greatly improved by deep features. However, many challenges remain, including a shortage of training samples and the effects of view variation the ineffective spatial-temporal information features. To address these problems and further improve the accuracy of HAR, we proposed a novel HAR method based on human skeletons and joints. First, a coordinate transformation method was performed on the raw skeleton data to eliminate the influence of the camera position. Then, a data augmentation strategy was proposed to address the overfitting problem caused by an insufficient number of training samples. In our proposed method, a motion data structure named APoM, which is composed of the cross-frame distance vector, the specific angle and the position vector, connects the movement of joints and the skeleton in the spatial and temporal dimensions and captures skeleton motion details. To evaluate the effectiveness of our method, experiments were conducted on two small-scale public datasets: Florence3D, UTKinectAction3D. The experimental results show that the proposed method achieved competitive performance with accuracies of 98.51%, 98.33%.

Keywords:

Artificial intelligence Computer science Overfitting Computer vision Pattern recognition (psychology) Position (finance) Deep learning Frame (networking) Transformation (genetics) Economic shortage Motion (physics) Artificial neural network

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.20

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Hand Gesture Recognition Systems

Physical Sciences → Computer Science → Human-Computer Interaction

Anomaly Detection Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

A Spatial-Temporal Feature Fusion Strategy for Skeleton-Based Action Recognition

Abstract

Metrics

Topics

Related Documents

Temporal-spatial Feature Fusion for Few-shot Skeleton-based Action Recognition

SKELETON-BASED ACTION RECOGNITION USING FEATURE FUSION FOR SPATIAL-TEMPORAL GRAPH CONVOLUTIONAL NETWORKS

Feature Fusion Network for Skeleton-based Action Recognition

Rotation-based spatial–temporal feature learning from skeleton sequences for action recognition

Spatial-temporal Transformer For Skeleton-based Action Recognition