Abstract

Speech Emotion Recognition (SER) is an emerging field that involves recognizing emotions conveyed in speech. Emotions expressed through speech can greatly impact decision-making. This paper delves into the topic of speech emotion recognition (SER) and its focus on interpreting emotions conveyed through spoken language. The importance of SER lies in its potential to improve human-computer interaction, cognitive analysis, and psychiatric assessment. The study combines and preprocesses audio data from various datasets, such as RAVDESS, CREMA-D, TESS, and SAVEE, and uses log mel spectrograms to effectively extract features. Various methods including CNN models, and standard and optimized feature extraction techniques are used. The results suggest that SER has significant real-world applications and the approaches provided effectively identify emotional and voice signals.

Keywords:
Spectrogram Emotion recognition Computer science Speech recognition Feature extraction Field (mathematics) Focus (optics) Feature (linguistics) Emotion classification Sentiment analysis Natural language processing Emotion detection Artificial intelligence Linguistics

Metrics

1
Cited By
1.10
FWCI (Field Weighted Citation Impact)
16
Refs
0.67
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Emotion and Mood Recognition
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
EEG and Brain-Computer Interfaces
Life Sciences →  Neuroscience →  Cognitive Neuroscience
© 2026 ScienceGate Book Chapters — All rights reserved.