JOURNAL ARTICLE

Residual Learning with Bi-LSTM and Multi-Head Attention for Multi-Modal Emotion Recognition

Abstract

Multimodal emotion recognition has a wide range of applications in the fields of intelligent recommendation and human-computer interaction. In recent research on emotion recognition, the model using recurrent neural network could observe the context semantic information and infer the emotion label jointly according to the context information, but it lacked the capture of key information and did not solve the problem of network degradation. Therefore, this paper designs a model based on Bi-LSTM, multi-head attention mechanism and residual connection fusion (Att-BiLSTM). The Bi-LSTM structure realizes the context semantic inference function, the multi-head attention mechanism realizes the emphasis on key information, and the residual connection alleviates the network degradation caused by the overfitting problem. Att-BiLSTM achieves 62.1% precision and 61.8% recall on the IEMOCAP dataset, which is better than the existing comparison algorithms.

Keywords:
Computer science Residual Overfitting Artificial intelligence Context (archaeology) Key (lock) Inference Recurrent neural network Machine learning Emotion recognition Artificial neural network

Metrics

1
Cited By
0.42
FWCI (Field Weighted Citation Impact)
6
Refs
0.60
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Emotion and Mood Recognition
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Sentiment Analysis and Opinion Mining
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Multi-head attention fusion networks for multi-modal speech emotion recognition

Junfeng ZhangLining XingZhen TanHongsen WangKesheng Wang

Journal:   Computers & Industrial Engineering Year: 2022 Vol: 168 Pages: 108078-108078
BOOK-CHAPTER

Multi-modal Emotion Recognition with Temporal-Band Attention Based on LSTM-RNN

Jiamin LiuYuanqi SuYuehu Liu

Lecture notes in computer science Year: 2018 Pages: 194-204
JOURNAL ARTICLE

Multi-view and Attention-Based BI-LSTM for Weibo Emotion Recognition

Xiaofeng CaiZhifeng Hao

Journal:   Proceedings of the 2018 International Conference on Network, Communication, Computer Engineering (NCCE 2018) Year: 2018
© 2026 ScienceGate Book Chapters — All rights reserved.