JOURNAL ARTICLE

Self-supervised 3D Skeleton Action Representation Learning with Motion Consistency and Continuity

Yukun SuGuosheng LinQingyao Wu

Year: 2021 Journal:   2021 IEEE/CVF International Conference on Computer Vision (ICCV) Pages: 13308-13318

Abstract

Recently, self-supervised learning (SSL) has been proved very effective and it can help boost the performance in learning representations from unlabeled data in the image domain. Yet, very little is explored about its usefulness in 3D skeleton-based action recognition understanding. Directly applying existing SSL techniques for 3D skeleton learning, however, suffers from trivial solutions and imprecise representations. To tackle these drawbacks, we consider perceiving the consistency and continuity of motion at different playback speeds are two critical issues. To this end, we propose a novel SSL method to learn the 3D skeleton representation in an efficacious way. Specifically, by constructing a positive clip (speed-changed) and a negative clip (motion-broken) of the sampled action sequence, we encourage the positive pairs closer while pushing the negative pairs to force the network to learn the intrinsic dynamic motion consistency information. Moreover, to enhance the learning features, skeleton interpolation is further exploited to model the continuity of human skeleton data. To validate the effectiveness of the proposed method, extensive experiments are conducted on Kinetics, NTU60, NTU120, and PKUMMD datasets with several alternative network architectures. Experimental evaluations demonstrate the superiority of our approach and through which, we can gain significant performance improvement without using extra labeled data.

Keywords:
Consistency (knowledge bases) Computer science Artificial intelligence Motion (physics) Representation (politics) Skeleton (computer programming) Action recognition Interpolation (computer graphics) Motion capture Action (physics) Human skeleton Pattern recognition (psychology) Machine learning Computer vision Class (philosophy)

Metrics

68
Cited By
3.36
FWCI (Field Weighted Citation Impact)
69
Refs
0.95
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Gait Recognition and Analysis
Physical Sciences →  Engineering →  Biomedical Engineering
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Self-supervised action representation learning from partial consistency skeleton sequences

Biyun LinYinwei Zhan

Journal:   Neural Computing and Applications Year: 2024 Vol: 36 (20)Pages: 12385-12395
JOURNAL ARTICLE

Self-Supervised 3D Action Representation Learning With Skeleton Cloud Colorization

Siyuan YangJun LiuShijian LuM.H. ErYongjian HuAlex C. Kot

Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Year: 2023 Vol: 46 (1)Pages: 509-524
JOURNAL ARTICLE

Spatiotemporal consistency enhancement self-supervised representation learning for action recognition

Shuai BiZhengping HuMengyao ZhaoShufang LiZhe Sun

Journal:   Signal Image and Video Processing Year: 2022 Vol: 17 (4)Pages: 1485-1492
JOURNAL ARTICLE

Self-Supervised Skeleton-Based Action Representation Learning: A Benchmark and Beyond

Jiahang ZhangLilang LinShuai YangJiaying Liu

Journal:   International Journal of Computer Vision Year: 2026 Vol: 134 (1)
© 2026 ScienceGate Book Chapters — All rights reserved.