Hierarchical Graph Convolutional Skeleton Transformer for Action Recognition

Ruwen Bai; Min Li; Bo Meng; Fengfa Li; Miao Jiang; Junxing Ren; Degang Sun

doi:10.1109/icme52920.2022.9859781

ScienceGate Book Chapters

JOURNAL ARTICLE

Hierarchical Graph Convolutional Skeleton Transformer for Action Recognition

Ruwen Bai Min Li Bo Meng Fengfa Li Miao Jiang Junxing Ren Degang Sun

Year: 2022 Journal: 2022 IEEE International Conference on Multimedia and Expo (ICME) Pages: 01-06

DOI: 10.1109/icme52920.2022.9859781

Get Full-Text PDF Get Analytical Report

Abstract

Graph convolutional networks (GCNs) have emerged as dom-inant methods for skeleton-based action recognition. How-ever, they still suffer from two problems, namely, neighbor-hood constraints and entangled spatiotemporal feature repre-sentations. Most studies have focused on improving the de-sign of graph topology to solve the first problem but they have yet to fully explore the latter. In this work, we design a dis-entangled spatiotemporal transformer (DSTT) block to over-come the above limitations of GCNs in three steps: (i) feature disentanglement for spatiotemporal decomposition; (ii) global spatiotemporal attention for capturing correlations in the global context; and (iii) local information enhancement for utilizing more local information. Thereon, we propose a novel architecture, named Hierarchical Graph Convolutional skeleton Transformer (HGCT), to employ the complementary advantages of GCN (i.e., local topology, temporal dynamics and hierarchy) and Transformer (i.e., global context and dy-namic attention). HGCT is lightweight and computationally efficient. Quantitative analysis demonstrates the superiority and good interpretability of HGCT.

Keywords:

Interpretability Computer science Transformer Artificial intelligence Action recognition Graph Theoretical computer science Pattern recognition (psychology) Topology (electrical circuits) Mathematics Engineering

Metrics

Cited By

3.04

FWCI (Field Weighted Citation Impact)

Refs

0.93

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Anomaly Detection Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Hand Gesture Recognition Systems

Physical Sciences → Computer Science → Human-Computer Interaction

Hierarchical Graph Convolutional Skeleton Transformer for Action Recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Deformable graph convolutional transformer for skeleton-based action recognition

Hierarchical Graph Convolutional Network for Skeleton-Based Action Recognition

Structure-Aware Multi-scale Hierarchical Graph Convolutional Network for Skeleton Action Recognition

Language Guided Graph Transformer for Skeleton Action Recognition

A Graph Skeleton Transformer Network for Action Recognition