Residual Learning with Bi-LSTM and Multi-Head Attention for Multi-Modal Emotion Recognition

Yifei Gao; Chi Xu

doi:10.1109/icipca59209.2023.10257779

ScienceGate Book Chapters

JOURNAL ARTICLE

Residual Learning with Bi-LSTM and Multi-Head Attention for Multi-Modal Emotion Recognition

Yifei Gao Chi Xu

Year: 2023 Pages: 409-413

DOI: 10.1109/icipca59209.2023.10257779

Get Full-Text PDF Get Analytical Report

Abstract

Multimodal emotion recognition has a wide range of applications in the fields of intelligent recommendation and human-computer interaction. In recent research on emotion recognition, the model using recurrent neural network could observe the context semantic information and infer the emotion label jointly according to the context information, but it lacked the capture of key information and did not solve the problem of network degradation. Therefore, this paper designs a model based on Bi-LSTM, multi-head attention mechanism and residual connection fusion (Att-BiLSTM). The Bi-LSTM structure realizes the context semantic inference function, the multi-head attention mechanism realizes the emphasis on key information, and the residual connection alleviates the network degradation caused by the overfitting problem. Att-BiLSTM achieves 62.1% precision and 61.8% recall on the IEMOCAP dataset, which is better than the existing comparison algorithms.

Keywords:

Computer science Residual Overfitting Artificial intelligence Context (archaeology) Key (lock) Inference Recurrent neural network Machine learning Emotion recognition Artificial neural network

Metrics

Cited By

0.42

FWCI (Field Weighted Citation Impact)

Refs

0.60

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Emotion and Mood Recognition

Social Sciences → Psychology → Experimental and Cognitive Psychology

Sentiment Analysis and Opinion Mining

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Residual Learning with Bi-LSTM and Multi-Head Attention for Multi-Modal Emotion Recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-Modal Transformer with Multi-Head Attention for Emotion Recognition

Multi-head attention fusion networks for multi-modal speech emotion recognition

Multi-modal Emotion Recognition with Temporal-Band Attention Based on LSTM-RNN

Multi-view and Attention-Based BI-LSTM for Weibo Emotion Recognition

Multi-modal sentiment recognition with residual gating network and emotion intensity attention