JOURNAL ARTICLE

[Research on bimodal emotion recognition algorithm based on multi-branch bidirectional multi-scale time perception].

Peiyun XueShiao WangJing BaiYan Qiang

Year: 2025 Journal:   PubMed Vol: 42 (3)Pages: 528-536   Publisher: National Institutes of Health

Abstract

Emotion can reflect the psychological and physiological health of human beings, and the main expression of human emotion is voice and facial expression. How to extract and effectively integrate the two modes of emotion information is one of the main challenges faced by emotion recognition. In this paper, a multi-branch bidirectional multi-scale time perception model is proposed, which can detect the forward and reverse speech Mel-frequency spectrum coefficients in the time dimension. At the same time, the model uses causal convolution to obtain temporal correlation information between different scale features, and assigns attention maps to them according to the information, so as to obtain multi-scale fusion of speech emotion features. Secondly, this paper proposes a two-modal feature dynamic fusion algorithm, which combines the advantages of AlexNet and uses overlapping maximum pooling layers to obtain richer fusion features from different modal feature mosaic matrices. Experimental results show that the accuracy of the multi-branch bidirectional multi-scale time sensing dual-modal emotion recognition model proposed in this paper reaches 97.67% and 90.14% respectively on the two public audio and video emotion data sets, which is superior to other common methods, indicating that the proposed emotion recognition model can effectively capture emotion feature information and improve the accuracy of emotion recognition.

Keywords:
Perception Computer science Scale (ratio) Speech recognition Emotion recognition Artificial intelligence Psychology Pattern recognition (psychology) Algorithm Neuroscience Geography Cartography

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Computing and Algorithms
Social Sciences →  Social Sciences →  Urban Studies
Educational Technology and Pedagogy
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Speech emotion recognition method based on time-aware bidirectional multi-scale network

Liyan ZhangJiaxin DuJiayan LiXinyu Wang

Journal:   Journal of Physics Conference Series Year: 2024 Vol: 2816 (1)Pages: 012102-012102
JOURNAL ARTICLE

Action recognition algorithm based on multi-scale and multi-branch features

Lei ZhangGuang-liang HAN

Journal:   Chinese Journal of Liquid Crystals and Displays Year: 2022 Vol: 37 (12)Pages: 1614-1625
© 2026 ScienceGate Book Chapters — All rights reserved.