Dual-stream Multi-scale Fusion Method for Human Action Recognition

Yingying Chen; Yanfang Wang; Chang Li; Q. Li; Qian Huang

doi:10.1109/icn60549.2023.10426122

ScienceGate Book Chapters

JOURNAL ARTICLE

Dual-stream Multi-scale Fusion Method for Human Action Recognition

Yingying Chen Yanfang Wang Chang Li Q. Li Qian Huang

Year: 2023 Vol: 27 Pages: 235-243

DOI: 10.1109/icn60549.2023.10426122

Get Full-Text PDF Get Analytical Report

Abstract

RGB video-based action recognition has many application scenarios due to its rich and abundant appearance information for accurate and robust performance. In recent years, convolutional neural networks have been rapidly developed and have made effective achievements in the field of action recognition. However, they cannot adequately extract fine-grained information. It is difficult to effectively complement learning spatio-temporal information even when utilizing two modalities. In this paper, we propose a dual-stream multi-scale fusion method. The method constructs different fine-grained representations of key features through key feature extraction module and near-by fusion to further extract and enhance the multi-scale information. In the multi-scale cross fusion, we utilize temporal gradients of motion information to interact with RGB videos to enhance modal complementarity. The final result fuses multi-scale representations within modalities and higher-order similarities between modalities, showing fine-grained learning of appearance and motion. Compared to other commonly used methods, the algorithm proposed in this paper shows significant improvement on the UCF101 and HMDB51 dataset, achieving 94.12% and 72.55% accuracy, respectively.

Keywords:

Computer science Artificial intelligence RGB color model Convolutional neural network Feature extraction Modalities Pattern recognition (psychology) Action recognition Key (lock) Scale (ratio) Computer vision Class (philosophy)

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.21

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Gait Recognition and Analysis

Physical Sciences → Engineering → Biomedical Engineering

Diabetic Foot Ulcer Assessment and Management

Health Sciences → Medicine → Endocrinology, Diabetes and Metabolism

Dual-stream Multi-scale Fusion Method for Human Action Recognition

Abstract

Metrics

Topics

Related Documents

Multi-scale 3D Convolution Fusion Two-Stream Networks for Action Recognition

Feature Fusion for Dual-Stream Cooperative Action Recognition

Human action recognition method based on spatiotemporal decoupling multi-scale feature fusion network

Human Action Recognition Using Multi-Stream Fusion and Hybrid Deep Neural Networks

DSNet: Dual-stream multi-scale fusion network for low-quality 3D face recognition