JOURNAL ARTICLE

A Two-Pathway Convolutional Neural Network with Temporal Pyramid Network for Action Recognition

Abstract

In order to solve the problem of imperfect capture of visual rhythm in action recognition, this paper proposes a novel model which is a combination of a two-pathway network and temporal pyramid networks (TPNs). Specifically, our work involves two aspects, on the one hand, we integrate TPNs into the fast pathway and the slow pathway of SlowFast network to capture multi-level features, and then merge the prediction results of the two pathways in the final recognition stage, which boosts performance of our network by enhancing the semantics extraction at input layer and feature layer. On the other hand, we apply a ConvLSTM module to improve the capability of temporal modeling in TPN, which can further strengthen the capture of features in the long-term dimensions, and the advanced TPN promotes the fusion of temporal and spatial features. Experiments on the Kinetics-400 dataset demonstrate the superiority of our novel architecture combining two-pathway network and advanced TPN in action recognition.

Keywords:
Computer science Artificial intelligence Merge (version control) Convolutional neural network Feature extraction Pyramid (geometry) Action recognition Recurrent neural network Pattern recognition (psychology) Artificial neural network Information retrieval

Metrics

1
Cited By
0.00
FWCI (Field Weighted Citation Impact)
23
Refs
0.18
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Gait Recognition and Analysis
Physical Sciences →  Engineering →  Biomedical Engineering

Related Documents

JOURNAL ARTICLE

Temporal Pyramid Pooling-Based Convolutional Neural Network for Action Recognition

Peng WangYuanzhouhan CaoChunhua ShenLingqiao LiuHeng Tao Shen

Journal:   IEEE Transactions on Circuits and Systems for Video Technology Year: 2016 Vol: 27 (12)Pages: 2613-2622
BOOK-CHAPTER

A Spatio-Temporal Convolutional Neural Network for Skeletal Action Recognition

Lizhang HuJinhua Xu

Lecture notes in computer science Year: 2017 Pages: 377-385
BOOK-CHAPTER

Attention-Based Temporal Weighted Convolutional Neural Network for Action Recognition

Jinliang ZangLe WangZiyi LiuQilin ZhangGang HuaNanning Zheng

IFIP advances in information and communication technology Year: 2018 Pages: 97-108
© 2026 ScienceGate Book Chapters — All rights reserved.