LLM-Enhanced Multi-Teacher Knowledge Distillation for Modality-Incomplete Emotion Recognition in Daily Healthcare

Yuzhe Zhang; Huan Liu; Yang Xiao; Mohammed Amoon; Dalin Zhang; Di Wang; Shusen Yang; Chai Quek

doi:10.1109/jbhi.2024.3470338

ScienceGate Book Chapters

JOURNAL ARTICLE

LLM-Enhanced Multi-Teacher Knowledge Distillation for Modality-Incomplete Emotion Recognition in Daily Healthcare

Yuzhe Zhang Huan Liu Yang Xiao Mohammed Amoon Dalin Zhang Di Wang Shusen Yang Chai Quek

Year: 2024 Journal: IEEE Journal of Biomedical and Health Informatics Vol: 29 (9)Pages: 6406-6416 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/jbhi.2024.3470338

Get Full-Text PDF Get Analytical Report

Abstract

The critical importance of monitoring and recognizing human emotional states in healthcare has led to a surge in proposals for EEG-based multimodal emotion recognition in recent years. However, practical challenges arise in acquiring EEG signals in daily healthcare settings due to stringent data acquisition conditions, resulting in the issue of incomplete modalities. Existing studies have turned to knowledge distillation as a means to mitigate this problem by transferring knowledge from multimodal networks to unimodal ones. However, these methods are constrained by the use of a single teacher model to transfer integrated feature extraction knowledge, particularly concerning spatial and temporal features in EEG data. To address this limitation, we propose a multi-teacher knowledge distillation framework enhanced with a Large Language Model (LLM), aimed at facilitating effective feature learning in the student network by transferring knowledge of extracting integrated features. Specifically, we employ an LLM as the teacher for extracting temporal features and a graph convolutional neural network for extracting spatial features. To further enhance knowledge distillation, we introduce causal masking and a confidence indicator into the LLM to facilitate the transfer of the most discriminative features. Extensive testing on the DEAP and MAHNOB-HCI datasets demonstrates that our model outperforms existing methods in the modality-incomplete scenario. This study underscores the potential application of large models in this field.

Keywords:

Modality (human–computer interaction) Computer science Health care Distillation Artificial intelligence Emotion recognition Knowledge management Human–computer interaction Natural language processing Chemistry

Metrics

Cited By

4.47

FWCI (Field Weighted Citation Impact)

Refs

0.92

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Sentiment Analysis and Opinion Mining

Physical Sciences → Computer Science → Artificial Intelligence

LLM-Enhanced Multi-Teacher Knowledge Distillation for Modality-Incomplete Emotion Recognition in Daily Healthcare

Abstract

Metrics

Citation History

Topics

Related Documents

Multimodal Emotion Recognition Using Modality-Wise Knowledge Distillation

Multi-Teacher Language-Aware Knowledge Distillation for Multilingual Speech Emotion Recognition

Modality- and Subject-Aware Emotion Recognition Using Knowledge Distillation

Focal Channel Knowledge Distillation for Multi-Modality Action Recognition

Cross-Modal Knowledge Distillation for Enhanced Unimodal Emotion Recognition