JOURNAL ARTICLE

Emotion recognition using multi-modal features and CNN classification

Saba Noor Ayesha KhanumUpendra Kumar MummadiFahmina TaranumSyed Shabbeer AhmadImtiyaz KhanD. Shravani

Year: 2024 Journal:   AIP conference proceedings Vol: 3007 Pages: 030001-030001   Publisher: American Institute of Physics

Abstract

An emerging use of artificial intelligence is automatic emotion recognition. Facial expression identification is an intriguing and challenging problem in computer vision. In data science, one of the most difficult problems is speech emotion recognition. The technology that has been built consists of two stages: the first involves real-time facial and speech capture and the second is categorizing of emotions. Data collection, data analysis, and data visualization are the stages of automated emotion identification. Convolution neural networks are used in the proposed multimodal system to identify emotions from speech and face expressions. Each block in the sequence is made up of convolution layers and sub sampling layers. The most difficult of all the available datasets, FER2013, was used to train the model for face emotion recognition. The accuracy that has been attained for this task is 71%. To address the issue of data deficiency in speech emotion identification, four distinct datasets—CREMA-D, RAVDESS, SAVEE, and TESS were integrated. The accuracy achieved for this challenge is 88%. The suggested approach can recognize eight emotions in total namely "angry, calm, disgust, fear, happy, neutral, sad, and surprised" for both the speech and the face, respectively. Additional effects include batch normalization, early stopping, and dropouts for better performance.

Keywords:
Computer science Modal Artificial intelligence Emotion recognition Pattern recognition (psychology) Feature extraction Speech recognition

Metrics

1
Cited By
1.10
FWCI (Field Weighted Citation Impact)
16
Refs
0.65
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Emotion and Mood Recognition
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Audio-Visual Emotion Recognition System Using Multi-Modal Features

Anand HandaRashi AgarwalNarendra Kohli

Journal:   International Journal of Cognitive Informatics and Natural Intelligence Year: 2021 Vol: 15 (4)Pages: 1-14
JOURNAL ARTICLE

Audio-Visual Emotion Recognition System using Multi-Modal Features

Journal:   International Journal of Cognitive Informatics and Natural Intelligence Year: 2021 Vol: 15 (4)Pages: 0-0
JOURNAL ARTICLE

Multi-Modal Emotion Recognition Using Speech Features and Text-Embedding

Sung-Woo ByunJu-Hee KimSeok-Pil Lee

Journal:   Applied Sciences Year: 2021 Vol: 11 (17)Pages: 7967-7967
© 2026 ScienceGate Book Chapters — All rights reserved.