Combined 2D and 3D Convolution Residual Attention Network for Hand Gesture Recognition

Chang-Ting Tsai; Jian–Jiun Ding

doi:10.23919/apsipaasc55919.2022.9980075

ScienceGate Book Chapters

JOURNAL ARTICLE

Combined 2D and 3D Convolution Residual Attention Network for Hand Gesture Recognition

Chang-Ting Tsai Jian–Jiun Ding

Year: 2022 Journal: 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) Pages: 104-108

DOI: 10.23919/apsipaasc55919.2022.9980075

Get Full-Text PDF Get Analytical Report

Abstract

Hand gesture recognition is a classical problem in human-computer interaction research. In this paper, a learning-based model is proposed for hand gesture recognition. Our model receives RGB and depth channels input. To recognize the hand gesture, the segmentation of hand region is the important issue. At first, we apply patch embedding layer to encode all the frames as several patches. Then, these encoded patches are fed into 3D convolution network. The 3D convolution layer can simultaneously learn the spatial and temporal feature of the video. The 3D convolution network also contains attention block, which is used to enhance the crucial feature map value. Besides, the encoded patches pass through the local decoder to recover the depth frames of the video. This operation can preserve the depth information in encoded patches. At last, we perform the linear classifier for the output of 3D convolution network to get the result of hand gesture. Our method achieves 80.5% accuracy in the NV-Gesture dataset and 89.6% accuracy in the SKIG dataset.

Keywords:

Computer science Gesture Artificial intelligence Gesture recognition Computer vision RGB color model Convolution (computer science) Feature (linguistics) Convolutional neural network Embedding Segmentation Residual Pattern recognition (psychology) ENCODE Classifier (UML) Feature extraction Speech recognition Artificial neural network Algorithm

Metrics

Cited By

0.24

FWCI (Field Weighted Citation Impact)

Refs

0.36

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Hand Gesture Recognition Systems

Physical Sciences → Computer Science → Human-Computer Interaction

Human Pose and Action Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Hearing Impairment and Communication

Social Sciences → Psychology → Developmental and Educational Psychology

Combined 2D and 3D Convolution Residual Attention Network for Hand Gesture Recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Convolution Neural Network based Hand Gesture Recognition System

Visual hand gesture recognition with convolution neural network

Hand gesture recognition based on convolution neural network

Res3ATN - Deep 3D Residual Attention Network for Hand Gesture Recognition in Videos

Human hand gesture recognition using a convolution neural network