Temporal Refinement Graph Convolutional Network for Skeleton-Based Action Recognition

Tianming Zhuang; Zhen Qin; Yi Ding; Fuhu Deng; Leduo Chen; Zhiguang Qin; Kim‐Kwang Raymond Choo

doi:10.1109/tai.2023.3329799

ScienceGate Book Chapters

JOURNAL ARTICLE

Temporal Refinement Graph Convolutional Network for Skeleton-Based Action Recognition

Tianming Zhuang Zhen Qin Yi Ding Fuhu Deng Leduo Chen Zhiguang Qin Kim‐Kwang Raymond Choo

Year: 2023 Journal: IEEE Transactions on Artificial Intelligence Vol: 5 (4)Pages: 1586-1598 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tai.2023.3329799

Get Full-Text PDF Get Analytical Report

Abstract

Human skeleton data, which has served in the aspect of human activity recognition, ought to be the most representative biometric characteristics due to its intuitivity and visuality. The state-of-the-art approaches mainly focus on improving modeling spatial correlations within graph topologies. However, the interframes motional representations are also of vital importance, and we argue that they are worth paying attention to and exploring. Therefore, a temporal refinement module with contrastive learning mechanism is proposed, fuzing as a complementary to the conventional spatial graph convolution layer. In addition, in order to further exploiting the interframe variances and generalizing graph convolutional network (GCN) operation to temporal dimension, a temporal-correlation matrix is introduced to effectively capture dynamic dependencies within frame-pairs, enhancing semantic feature representation. Moreover, since GCN is a typical local operator which lacks of capability to fully model the long-term relations along spatial and temporal variation, to move beyond the limitation, a spatial-temporal cascaded aggregation (STCA) module is designed to enlarge the receptive filter scale. The overall recognition framework consists of three above novelties, which is capable of achieving remarkable performance by evaluating on benchmark datasets (i.e., NTU RGB+D 60, NTU RGB+D 120, PKU-MMD, and Kinetics Skeleton 400). Extensive experiments demonstrate the effectiveness of the proposed framework, e.g., performing recognition accuracy rate of 90.9% and 96.8% on NTU RGB+D 60, 87.9% and 88.9% on NTU RGB+D 120.

Keywords:

Computer science Skeleton (computer programming) Action recognition Graph Artificial intelligence Convolutional neural network Pattern recognition (psychology) Theoretical computer science Programming language

Metrics

Cited By

1.27

FWCI (Field Weighted Citation Impact)

Refs

0.77

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Anomaly Detection Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Gait Recognition and Analysis

Physical Sciences → Engineering → Biomedical Engineering

Temporal Refinement Graph Convolutional Network for Skeleton-Based Action Recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Multi‐temporal scale aggregation refinement graph convolutional network for skeleton‐based action recognition

Pose Refinement Graph Convolutional Network for Skeleton-Based Action Recognition

Cross-Scale Spatial Refinement Graph Convolutional Network for Skeleton-Based Action Recognition

Body Partition Refinement Dynamic Graph Convolutional Network for Skeleton-based Action Recognition

Temporal-difference Adaptive Graph Convolutional Network for Skeleton-based Action Recognition