JOURNAL ARTICLE

Fusing multi-modal features for gesture recognition

Abstract

This paper proposes a novel multi-modal gesture recognition framework and introduces its application to continuous sign language recognition. A Hidden Markov Model is used to construct the audio feature classifier. A skeleton feature classifier is trained to provided complementary information based on the Dynamic Time Warping model. The confidence scores generated by two classifiers are firstly normalized and then combined to produce a weighted sum for the final recognition. Experimental results have shown that the precision and recall scores for 20 classes of our multi-modal recognition framework can achieve 0.8829 and 0.8890 respectively, which proves that our method is able to correctly reject false detection caused by single classifier. Our approach scored 0.12756 in mean Levenshtein distance and was ranked 1st in the Multi-modal Gesture Recognition Challenge in 2013.

Keywords:
Computer science Gesture recognition Modal Classifier (UML) Hidden Markov model Artificial intelligence Pattern recognition (psychology) Dynamic time warping Gesture Levenshtein distance Speech recognition Feature extraction Sign language Feature (linguistics)

Metrics

61
Cited By
7.71
FWCI (Field Weighted Citation Impact)
22
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Hand Gesture Recognition Systems
Physical Sciences →  Computer Science →  Human-Computer Interaction
Hearing Impairment and Communication
Social Sciences →  Psychology →  Developmental and Educational Psychology
Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Multi-Stream Action Recognition Network Fusing Multi-Modal Features

彬彬 张

Journal:   Computer Science and Application Year: 2021 Vol: 11 (02)Pages: 451-460
JOURNAL ARTICLE

Multi-Modal Emotion Recognition by Fusing Correlation Features of Speech-Visual

Guanghui ChenXiaoping Zeng

Journal:   IEEE Signal Processing Letters Year: 2021 Vol: 28 Pages: 533-537
JOURNAL ARTICLE

ModDrop: Adaptive Multi-Modal Gesture Recognition

Natalia NeverovaChristian WolfGraham W. TaylorFlorian Nebout

Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Year: 2015 Vol: 38 (8)Pages: 1692-1706
© 2026 ScienceGate Book Chapters — All rights reserved.