Skeletal Twins: Unsupervised Skeleton-Based Action Representation Learning

Haoyuan Zhang; Yonghong Hou; Wenjing Zhang

doi:10.1109/icme52920.2022.9859595

ScienceGate Book Chapters

JOURNAL ARTICLE

Skeletal Twins: Unsupervised Skeleton-Based Action Representation Learning

Haoyuan Zhang Yonghong Hou Wenjing Zhang

Year: 2022 Journal: 2022 IEEE International Conference on Multimedia and Expo (ICME) Pages: 1-6

DOI: 10.1109/icme52920.2022.9859595

Get Full-Text PDF Get Analytical Report

Abstract

In this paper, we investigate unsupervised representation learning for skeleton action recognition, and develop a simple yet effective framework: SKeletal Twins (SKT), which is capable of learning representations from unlabeled skeleton data. To be specific, we choose skeleton-specific spatial and temporal augmentations for spatio-temporal dynamics learning, then the augmented skeleton sequence is represented as a graph with both spatial and temporal edges so that the GCN-based twin encoders are able to encode human pose and joint's temporal motion. Barlow Twins' objective function is used to minimize the redundancy and keep similarity of different skeleton augmentations. However it ignores the instance-level consistency of the skeleton instance from different augmentations, thus an instance-level consistency-enhanced objective function is designed and jointly optimized, which boosts the representation learning. Extensive experiments verify that the proposed framework obtains the state-of-the-art results on the challenging NTU-60 and NTU-120 datasets.

Keywords:

Skeleton (computer programming) Computer science Artificial intelligence Feature learning Redundancy (engineering) Pattern recognition (psychology) Representation (politics) ENCODE Graph Unsupervised learning Encoder Consistency (knowledge bases) Action recognition Human skeleton Similarity (geometry) Image (mathematics) Theoretical computer science Class (philosophy)

Metrics

Cited By

0.41

FWCI (Field Weighted Citation Impact)

Refs

0.67

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Gait Recognition and Analysis

Physical Sciences → Engineering → Biomedical Engineering

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Skeletal Twins: Unsupervised Skeleton-Based Action Representation Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Idempotent Unsupervised Representation Learning for Skeleton-Based Action Recognition

Hierarchical Contrast for Unsupervised Skeleton-Based Action Representation Learning

Unsupervised skeleton-based action representation learning via relation consistency pursuit

Hierarchical Transformer: Unsupervised Representation Learning for Skeleton-Based Human Action Recognition

Unified Multi-modal Unsupervised Representation Learning for Skeleton-based Action Understanding