JOURNAL ARTICLE

Multi-modal Emotion Recognition Using Canonical Correlations and Acoustic Features

Abstract

The information of the psycho-physical state of the subject is becoming a valuable addition to the modern audio or video recognition systems. As well as enabling a better user experience, it can also assist in superior recognition accuracy of the base system. In the article, we present our approach to multi-modal (audio-video) emotion recognition system. For audio sub-system, a feature set comprised of prosodic, spectral and cepstrum features is selected and support vector classifier is used to produce the scores for each emotional category. For video sub-system a novel approach is presented, which does not rely on the tracking of specific facial landmarks and thus, eliminates the problems usually caused, if the tracking algorithm fails at detecting the correct area. The system is evaluated on the eNTERFACE database and the recognition accuracy of our audio-video fusion is compared to the published results in the literature.

Keywords:
Computer science Mel-frequency cepstrum Speech recognition Modal Artificial intelligence Classifier (UML) Cepstrum Feature extraction Pattern recognition (psychology) Emotion recognition Support vector machine Speaker recognition Feature (linguistics) Audio visual Multimedia

Metrics

34
Cited By
2.51
FWCI (Field Weighted Citation Impact)
17
Refs
0.87
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Emotion and Mood Recognition
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Audio-Visual Emotion Recognition System Using Multi-Modal Features

Anand HandaRashi AgarwalNarendra Kohli

Journal:   International Journal of Cognitive Informatics and Natural Intelligence Year: 2021 Vol: 15 (4)Pages: 1-14
JOURNAL ARTICLE

Audio-Visual Emotion Recognition System using Multi-Modal Features

Journal:   International Journal of Cognitive Informatics and Natural Intelligence Year: 2021 Vol: 15 (4)Pages: 0-0
JOURNAL ARTICLE

Multi-Modal Emotion Recognition Using Speech Features and Text-Embedding

Sung-Woo ByunJu-Hee KimSeok-Pil Lee

Journal:   Applied Sciences Year: 2021 Vol: 11 (17)Pages: 7967-7967
© 2026 ScienceGate Book Chapters — All rights reserved.