Dynamic Semantic-Based Spatial-Temporal Graph Convolution Network for Skeleton-Based Human Action Recognition

Jianyang Xie; Yanda Meng; Yitian Zhao; Anh Nguyen; Xiaoyun Yang; Yalin Zheng

doi:10.1109/tip.2024.3497837

ScienceGate Book Chapters

JOURNAL ARTICLE

Dynamic Semantic-Based Spatial-Temporal Graph Convolution Network for Skeleton-Based Human Action Recognition

Jianyang Xie Yanda Meng Yitian Zhao Anh Nguyen Xiaoyun Yang Yalin Zheng

Year: 2024 Journal: IEEE Transactions on Image Processing Vol: 33 Pages: 6691-6704 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tip.2024.3497837

Get Full-Text PDF Get Analytical Report

Abstract

Human action recognition is an essential topic in computer vision and image processing. Graph convolutional networks (GCNs) have attracted significant attention and achieved noteworthy performance in skeleton-based human action recognition tasks. However, most of the previous graph-based works are designed to refine skeleton topology without considering the types of different joints and edges and the occurrence order of the frames. Such a limitation makes them insufficient to represent intrinsic semantic information. Differently, we proposed a dynamic semantic-based spatial-temporal graph convolution network (DS-STGCN) to address the challenge. DS-STGCN has two dynamic semantic modules for spatial and temporal contexts respectively. Specifically, the joints and edge types were encoded in the spatial module implicitly, and the occurrence order of frames was encoded in the temporal module implicitly. Extensive experiments on four datasets including NTU-RGB+D 60(120), Kinetics-400, and FineGYM show that our proposed two semantic modules can bring consistent recognition performance improvement with various backbones. Meanwhile, the proposed DS-STGCN notably surpassed state-of-the-art methods on these datasets. Notably, in the more challenging dataset, such as Kinetics-400, our model significantly outperformed other state-of-the-art GCN-based methods by a large margin. The code has been released at https://github.com/davelailai/DS-STGCN.

Keywords:

Computer science Artificial intelligence Skeleton (computer programming) Action recognition Convolution (computer science) Pattern recognition (psychology) Graph Computer vision Theoretical computer science Artificial neural network

Metrics

Cited By

4.24

FWCI (Field Weighted Citation Impact)

Refs

0.90

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Gait Recognition and Analysis

Physical Sciences → Engineering → Biomedical Engineering

Anomaly Detection Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Dynamic Semantic-Based Spatial-Temporal Graph Convolution Network for Skeleton-Based Human Action Recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Dynamic Semantic-Based Spatial Graph Convolution Network for Skeleton-Based Human Action Recognition

Dual-Excitation Spatial–Temporal Graph Convolution Network for Skeleton-Based Action Recognition

Skeleton-Based Action Recognition Based on Dynamic - Adaptive Spatial Graph Convolution Neural Network

Spatial–Temporal Dynamic Graph Attention Network for Skeleton-Based Action Recognition

Dynamic spatial-temporal topology graph network for skeleton-based action recognition