JOURNAL ARTICLE

Skeletal Twins: Unsupervised Skeleton-Based Action Representation Learning

Haoyuan ZhangYonghong HouWenjing Zhang

Year: 2022 Journal:   2022 IEEE International Conference on Multimedia and Expo (ICME) Pages: 1-6

Abstract

In this paper, we investigate unsupervised representation learning for skeleton action recognition, and develop a simple yet effective framework: SKeletal Twins (SKT), which is capable of learning representations from unlabeled skeleton data. To be specific, we choose skeleton-specific spatial and temporal augmentations for spatio-temporal dynamics learning, then the augmented skeleton sequence is represented as a graph with both spatial and temporal edges so that the GCN-based twin encoders are able to encode human pose and joint's temporal motion. Barlow Twins' objective function is used to minimize the redundancy and keep similarity of different skeleton augmentations. However it ignores the instance-level consistency of the skeleton instance from different augmentations, thus an instance-level consistency-enhanced objective function is designed and jointly optimized, which boosts the representation learning. Extensive experiments verify that the proposed framework obtains the state-of-the-art results on the challenging NTU-60 and NTU-120 datasets.

Keywords:
Skeleton (computer programming) Computer science Artificial intelligence Feature learning Redundancy (engineering) Pattern recognition (psychology) Representation (politics) ENCODE Graph Unsupervised learning Encoder Consistency (knowledge bases) Action recognition Human skeleton Similarity (geometry) Image (mathematics) Theoretical computer science Class (philosophy)

Metrics

7
Cited By
0.41
FWCI (Field Weighted Citation Impact)
20
Refs
0.67
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Gait Recognition and Analysis
Physical Sciences →  Engineering →  Biomedical Engineering
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.