JOURNAL ARTICLE

Neural Koopman Pooling: Control-Inspired Temporal Dynamics Encoding for Skeleton-Based Action Recognition

Abstract

Skeleton-based human action recognition is becoming increasingly important in a variety of fields. Most existing works train a CNN or GCN based backbone to extract spatial-temporal features, and use temporal average/max pooling to aggregate the information. However, these pooling methods fail to capture high-order dynamics information. To address the problem, we propose a plug-and-play module called Koopman pooling, which is a parameterized high-order pooling technique based on Koopman theory. The Koopman operator linearizes a non-linear dynamics system, thus providing a way to represent the complex system through the dynamics matrix, which can be used for classification. We also propose an eigenvalue normalization method to encourage the learned dynamics to be non-decaying and stable. Besides, we also show that our Koopman pooling framework can be easily extended to one-shot action recognition when combined with Dynamic Mode Decomposition. The proposed method is evaluated on three benchmark datasets, namely NTU RGB+D 60, 120 and NW-UCLA. Our experiments clearly demonstrate that Koopman pooling significantly improves the performance under both full-dataset and one-shot settings.

Keywords:
Pooling Dynamic mode decomposition Computer science Artificial intelligence Benchmark (surveying) Pattern recognition (psychology) Eigenvalues and eigenvectors Machine learning

Metrics

32
Cited By
5.82
FWCI (Field Weighted Citation Impact)
106
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Gait Recognition and Analysis
Physical Sciences →  Engineering →  Biomedical Engineering

Related Documents

JOURNAL ARTICLE

Asynchronous Joint-Based Temporal Pooling for Skeleton-Based Action Recognition

Shanaka Ramesh GunasekaraWanqing LiJie YangPhilip Ogunbona

Journal:   IEEE Transactions on Circuits and Systems for Video Technology Year: 2024 Vol: 35 (1)Pages: 357-366
JOURNAL ARTICLE

Temporal Pyramid Pooling-Based Convolutional Neural Network for Action Recognition

Peng WangYuanzhouhan CaoChunhua ShenLingqiao LiuHeng Tao Shen

Journal:   IEEE Transactions on Circuits and Systems for Video Technology Year: 2016 Vol: 27 (12)Pages: 2613-2622
JOURNAL ARTICLE

Adaptive Koopman contrastive learning for skeleton-based action recognition

Xiaohang YuHui MiaoChen PangLei Lyu

Journal:   Neurocomputing Year: 2025 Vol: 658 Pages: 131750-131750
JOURNAL ARTICLE

Representation Learning of Temporal Dynamics for Skeleton-Based Action Recognition

Yong DuYun FuLiang Wang

Journal:   IEEE Transactions on Image Processing Year: 2016 Vol: 25 (7)Pages: 3010-3022
© 2026 ScienceGate Book Chapters — All rights reserved.