Multimodal Emotion Recognition Using Modality-Wise Knowledge Distillation

Seonggyu Lee; Youngdo Ahn; Jong Won Shin

doi:10.3390/s25206341

ScienceGate Book Chapters

JOURNAL ARTICLE

Multimodal Emotion Recognition Using Modality-Wise Knowledge Distillation

Seonggyu Lee Youngdo Ahn Jong Won Shin

Year: 2025 Journal: Sensors Vol: 25 (20)Pages: 6341-6341 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/s25206341

Get Full-Text PDF Get Analytical Report

Abstract

Multimodal emotion recognition (MER) aims to estimate emotional states utilizing multiple sensors simultaneously. Most previous MER models extract unimodal representation via modality-wise encoders and combine them into a multimodal representation to classify the emotion, and these models are trained with an objective for the final output of the MER. If an encoder for a specific modality is optimized better than others at some point of the training procedure, the parameters for the other encoders may not be sufficiently updated to provide optimal performance. In this paper, we propose a MER using modality-wise knowledge distillation, which adapts the unimodal encoders using pre-trained unimodal emotion recognition models. Experimental results on CREMA-D and IEMOCAP databases demonstrated that the proposed method outperformed previous approaches to overcome the optimization imbalance phenomenon and could also be combined with these approaches effectively.

Keywords:

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.41

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Emotion and Mood Recognition

Social Sciences → Psychology → Experimental and Cognitive Psychology

Sentiment Analysis and Opinion Mining

Physical Sciences → Computer Science → Artificial Intelligence

Face and Expression Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multimodal Emotion Recognition Using Modality-Wise Knowledge Distillation

Abstract

Metrics

Topics

Related Documents

Modality- and Subject-Aware Emotion Recognition Using Knowledge Distillation

Multimodal Knowledge Distillation for Emotion Recognition

MKDF: Knowledge Distillation Based Lightweight Multimodal Framework for Emotion Recognition

Modality emotion semantic correlation analysis for multimodal emotion recognition

Modality-Agnostic Multimodal Emotion Recognition using a Contrastive Masked Autoencoder