JOURNAL ARTICLE

Speech Emotion Recognition using Mel Frequency Cepstral Coefficient and SVM Classifier

Abstract

Speech Emotion Recognition is the forefront of Human Machine Interaction. In this paper we present a supervised learning approach for Speech Emotion Recognition (SER) using Mel Frequency Cepstral Coefficient (MFCC) feature extraction. Using dimensionality reduction and normalization we increase the efficiency of prediction accuracy. The developed model was trained and tested against the Berlin Database of Emotional Speech recorded by the Technical University Berlin. The model was classified using a Support Vector Machine (SVM). Using validation, the model resulted with highest accuracy of 92.16% against the test samples.

Keywords:
Mel-frequency cepstrum Normalization (sociology) Speech recognition Computer science Support vector machine Artificial intelligence Feature extraction Emotion recognition Pattern recognition (psychology) Dimensionality reduction Classifier (UML)

Metrics

13
Cited By
1.08
FWCI (Field Weighted Citation Impact)
7
Refs
0.78
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
© 2026 ScienceGate Book Chapters — All rights reserved.