JOURNAL ARTICLE

Speech Emotion Recognition based on Multiple Feature Fusion

Abstract

In speech emotion recognition,the extraction of Mel-frequency Cepstral Coefficients (MFCC) will lose many useful spectral feature information,resulting in low recognition accuracy. Therefore,a method for combining MFCC and Mel spectrum for speech emotion recognition is proposed. First,M FCC are extracted from the audio signal,and then the Mel spectrum is extracted from the audio signal. Finally,the fused audio features are used to support vector machine classification. The experimental results in the EMODB data set show that the fusion of multiple features has higher classification accuracy than the classifier using single feature,and the proposed method can effectively improve the recognition accuracy.

Keywords:
Speech recognition Computer science Emotion recognition Feature (linguistics) Fusion Feature extraction Artificial intelligence Pattern recognition (psychology) Linguistics

Metrics

1
Cited By
0.00
FWCI (Field Weighted Citation Impact)
20
Refs
0.18
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Emotion and Mood Recognition
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
© 2026 ScienceGate Book Chapters — All rights reserved.