JOURNAL ARTICLE

Active Learning for Speech Emotion Recognition Using Deep Neural Network

Abstract

Deep neural networks (DNNs) have consistently pushed the state-of-the-art performance in many fields, including speech emotion recognition. However, DNN-based solutions require vast amounts of labeled data for training. In speech emotion recognition, the cost and time needed to annotate data with emotional labels can be prohibitive. The available corpora normally have a few thousand recordings collected by a limited number of speakers. As a result, models trained on such corpora fail to generalize to samples from new domains. This study explores practical solutions to train DNNs for speech emotion recognition with limited resources by using active learning (AL). We assume that data without emotional labels from a new domain are available and we have resources to select a limited number of recordings to be annotated with emotional labels. We actively select samples using greedy sampling (GS) and uncertainty-based methods, evaluating the performance on regression problems where the goal is to predict scores for arousal and valence. We show that the use of active learning leads to competitive performance with limited training data.

Keywords:
Computer science Emotion recognition Speech recognition Artificial intelligence Artificial neural network Valence (chemistry) Deep learning Labeled data Deep neural networks Training set Machine learning

Metrics

41
Cited By
3.28
FWCI (Field Weighted Citation Impact)
44
Refs
0.93
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Speech Emotion Recognition Using Deep Feedforward Neural Network

Muhammad Fahreza AlghifariTeddy Surya GunawanMira Kartiwi

Journal:   Indonesian Journal of Electrical Engineering and Computer Science Year: 2018 Vol: 10 (2)Pages: 554-554
BOOK-CHAPTER

Arabic Speech Emotion Recognition Using Deep Neural Network

Omayma MahmoudiMouncef Filali Bouami

Lecture notes in networks and systems Year: 2023 Pages: 124-133
BOOK-CHAPTER

Emotion Recognition from Speech Using Deep Neural Network

Stuti JuyalChirag KillaGurvinder SinghNishant GuptaVedika Gupta

EAI/Springer Innovations in Communication and Computing Year: 2021 Pages: 3-39
JOURNAL ARTICLE

Speech Emotion Recognition System Using Recurrent Neural Network in Deep Learning

Siddhant S. PatilShruti K. PatilIshwari S. ChankeshwaraHrishikesh S. RapatwarProf. V.V. Waykule

Journal:   International Journal for Research in Applied Science and Engineering Technology Year: 2022 Vol: 10 (3)Pages: 2332-2336
© 2026 ScienceGate Book Chapters — All rights reserved.