JOURNAL ARTICLE

Metric Learning Based Multimodal Audio-visual Emotion Recognition

Esam GhalebMirela PopaStylianos Asteriadis

Year: 2019 Journal:   IEEE Multimedia Pages: 1-1   Publisher: IEEE Computer Society

Abstract

People express their emotions through multiple channels, such as visual and audio ones. Consequently, automatic emotion recognition can be significantly benefited by multimodal learning. Even-though each modality exhibits unique characteristics; multimodal learning takes advantage of the complementary information of diverse modalities when measuring the same instance, resulting in enhanced understanding of emotions. Yet, their dependencies and relations are not fully exploited in audio–video emotion recognition. Furthermore, learning an effective metric through multimodality is a crucial goal for many applications in machine learning. Therefore, in this article, we propose multimodal emotion recognition metric learning (MERML), learned jointly to obtain a discriminative score and a robust representation in a latent-space for both modalities. The learned metric is efficiently used through the radial basis function (RBF) based support vector machine (SVM) kernel. The evaluation of our framework shows a significant performance, improving the state-of-the-art results on the eNTERFACE and CREMA-D datasets.

Keywords:
Computer science Multimodal learning Discriminative model Modalities Metric (unit) Multimodality Artificial intelligence Support vector machine Modality (human–computer interaction) Machine learning Representation (politics) Kernel (algebra) Pattern recognition (psychology) Speech recognition

Metrics

56
Cited By
6.87
FWCI (Field Weighted Citation Impact)
27
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Emotion and Mood Recognition
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
© 2026 ScienceGate Book Chapters — All rights reserved.