JOURNAL ARTICLE

Emotion recognition from speech signals using digital features optimization by diversity measure fusion

Ashok Kumar KonduruJ. L. Mazher Iqbal

Year: 2023 Journal:   Journal of Intelligent & Fuzzy Systems Vol: 46 (1)Pages: 2547-2572   Publisher: IOS Press

Abstract

Emotion recognition from speech signals serves a crucial role in human-computer interaction and behavioral studies. The task, however, presents significant challenges due to the high dimensionality and noisy nature of speech data. This article presents a comprehensive study and analysis of a novel approach, “Digital Features Optimization by Diversity Measure Fusion (DFOFDM)”, aimed at addressing these challenges. The paper begins by elucidating the necessity for improved emotion recognition methods, followed by a detailed introduction to DFOFDM. This approach employs acoustic and spectral features from speech signals, coupled with an optimized feature selection process using a fusion of diversity measures. The study’s central method involves a Cuckoo Search-based classification strategy, which is tailored for this multi-label problem. The performance of the proposed DFOFDM approach is evaluated extensively. Emotion labels such as ‘Angry’, ‘Happy’, and ‘Neutral’ showed a precision rate over 92%, while other emotions fell within the range of 87% to 90%. Similar performance was observed in terms of recall, with most emotions falling within the 90% to 95% range. The F-Score, another crucial metric, also reflected comparable statistics for each label. Notably, the DFOFDM model showed resilience to label imbalances and noise in speech data, crucial for real-world applications. When compared with a contemporary model, “Transfer Subspace Learning by Least Square Loss (TSLSL)”, DFOFDM displayed superior results across various evaluation metrics, indicating a promising improvement in the field of speech emotion recognition. In terms of computational complexity, DFOFDM demonstrated effective scalability, providing a feasible solution for large-scale applications. Despite its effectiveness, the study acknowledges the potential limitations of the DFOFDM, which might influence its performance on certain types of real-world data. The findings underline the potential of DFOFDM in advancing emotion recognition techniques, indicating the necessity for further research.

Keywords:
Computer science Speech recognition Artificial intelligence Subspace topology Metric (unit) Machine learning Pattern recognition (psychology)

Metrics

1
Cited By
0.42
FWCI (Field Weighted Citation Impact)
33
Refs
0.63
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Emotion and Mood Recognition
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Emotion recognition from speech signals using new harmony features

Bin YangMarko Lugger

Journal:   Signal Processing Year: 2009 Vol: 90 (5)Pages: 1415-1423
JOURNAL ARTICLE

Robotic Emotion Recognition Using Two-Level Features Fusion in Audio Signals of Speech

Chang Li

Journal:   IEEE Sensors Journal Year: 2021 Vol: 22 (18)Pages: 17447-17454
JOURNAL ARTICLE

Emotion Recognition using Speech Signals

S. Harsha VardhanM. P. RahuPuttamreddy KavyasriA. Sraavani

Journal:   International Journal of Advanced Research in Science Communication and Technology Year: 2022 Pages: 126-132
© 2026 ScienceGate Book Chapters — All rights reserved.