Emotion recognition from speech signals using digital features optimization by diversity measure fusion

Ashok Kumar Konduru; J. L. Mazher Iqbal

doi:10.3233/jifs-231263

ScienceGate Book Chapters

JOURNAL ARTICLE

Emotion recognition from speech signals using digital features optimization by diversity measure fusion

Ashok Kumar Konduru J. L. Mazher Iqbal

Year: 2023 Journal: Journal of Intelligent & Fuzzy Systems Vol: 46 (1)Pages: 2547-2572 Publisher: IOS Press

DOI: 10.3233/jifs-231263

Get Full-Text PDF Get Analytical Report

Abstract

Emotion recognition from speech signals serves a crucial role in human-computer interaction and behavioral studies. The task, however, presents significant challenges due to the high dimensionality and noisy nature of speech data. This article presents a comprehensive study and analysis of a novel approach, “Digital Features Optimization by Diversity Measure Fusion (DFOFDM)”, aimed at addressing these challenges. The paper begins by elucidating the necessity for improved emotion recognition methods, followed by a detailed introduction to DFOFDM. This approach employs acoustic and spectral features from speech signals, coupled with an optimized feature selection process using a fusion of diversity measures. The study’s central method involves a Cuckoo Search-based classification strategy, which is tailored for this multi-label problem. The performance of the proposed DFOFDM approach is evaluated extensively. Emotion labels such as ‘Angry’, ‘Happy’, and ‘Neutral’ showed a precision rate over 92%, while other emotions fell within the range of 87% to 90%. Similar performance was observed in terms of recall, with most emotions falling within the 90% to 95% range. The F-Score, another crucial metric, also reflected comparable statistics for each label. Notably, the DFOFDM model showed resilience to label imbalances and noise in speech data, crucial for real-world applications. When compared with a contemporary model, “Transfer Subspace Learning by Least Square Loss (TSLSL)”, DFOFDM displayed superior results across various evaluation metrics, indicating a promising improvement in the field of speech emotion recognition. In terms of computational complexity, DFOFDM demonstrated effective scalability, providing a feasible solution for large-scale applications. Despite its effectiveness, the study acknowledges the potential limitations of the DFOFDM, which might influence its performance on certain types of real-world data. The findings underline the potential of DFOFDM in advancing emotion recognition techniques, indicating the necessity for further research.

Keywords:

Computer science Speech recognition Artificial intelligence Subspace topology Metric (unit) Machine learning Pattern recognition (psychology)

Metrics

Cited By

0.42

FWCI (Field Weighted Citation Impact)

Refs

0.63

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Emotion and Mood Recognition

Social Sciences → Psychology → Experimental and Cognitive Psychology

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Emotion recognition from speech signals using digital features optimization by diversity measure fusion

Abstract

Metrics

Citation History

Topics

Related Documents

Emotion recognition from speech signals using new harmony features

Robotic Emotion Recognition Using Two-Level Features Fusion in Audio Signals of Speech

Emotion Recognition from Speech Signals using Excitation Source and Spectral Features

Emotion recognition from physiological signals using fusion of wavelet based features

Emotion Recognition using Speech Signals