JOURNAL ARTICLE

Combined 2D and 3D Convolution Residual Attention Network for Hand Gesture Recognition

Chang-Ting TsaiJian–Jiun Ding

Year: 2022 Journal:   2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) Pages: 104-108

Abstract

Hand gesture recognition is a classical problem in human-computer interaction research. In this paper, a learning-based model is proposed for hand gesture recognition. Our model receives RGB and depth channels input. To recognize the hand gesture, the segmentation of hand region is the important issue. At first, we apply patch embedding layer to encode all the frames as several patches. Then, these encoded patches are fed into 3D convolution network. The 3D convolution layer can simultaneously learn the spatial and temporal feature of the video. The 3D convolution network also contains attention block, which is used to enhance the crucial feature map value. Besides, the encoded patches pass through the local decoder to recover the depth frames of the video. This operation can preserve the depth information in encoded patches. At last, we perform the linear classifier for the output of 3D convolution network to get the result of hand gesture. Our method achieves 80.5% accuracy in the NV-Gesture dataset and 89.6% accuracy in the SKIG dataset.

Keywords:
Computer science Gesture Artificial intelligence Gesture recognition Computer vision RGB color model Convolution (computer science) Feature (linguistics) Convolutional neural network Embedding Segmentation Residual Pattern recognition (psychology) ENCODE Classifier (UML) Feature extraction Speech recognition Artificial neural network Algorithm

Metrics

2
Cited By
0.24
FWCI (Field Weighted Citation Impact)
18
Refs
0.36
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Hand Gesture Recognition Systems
Physical Sciences →  Computer Science →  Human-Computer Interaction
Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Hearing Impairment and Communication
Social Sciences →  Psychology →  Developmental and Educational Psychology
© 2026 ScienceGate Book Chapters — All rights reserved.