JOURNAL ARTICLE

Convolutional Neural Network-Based Automatic Speech Emotion Recognition System for Malayalam

V. K. MuneerK. P. Mohamed BasheerRizwana Kallooravi Thandil

Year: 2023 Journal:   Indian Journal of Science and Technology Vol: 16 (46)Pages: 4410-4420   Publisher: Indian Society for Education and Environment

Abstract

<h2>Abstract</h2>\n<p><strong>Objectives:</strong> This research work focuses on developing a SER system using CNN and deep learning techniques for a low-resourced Dravidian Indian Language, Malayalam. The importance of speech as a powerful and natural medium of communication, capable of conveying a wide range of information about an individual's mental, behavioral, and emotional characteristics. With the increasing prevalence of human-machine interactions, the study of speech analysis has played a crucial role in bridging the gap between the physical and digital realms. Particularly, the field of emotion identification has gained popularity, as emotions are frequently expressed through speech cues. However, the scarcity of suitable datasets poses a challenge for researchers conducting experiments. <strong>Methods:</strong> In this paper, we address this challenge by employing Long Convolutional Neural Networks (CNN) to effectively recognize sentiments in voice recordings of Malayalam, a low-resource language. We manually construct datasets from audio clips of Malayalam movies and employ the Mel Frequency-Cepstral-Coefficient (MFCC) approach to extract features from the audio signals. <strong>Findings:</strong> By training, classifying, and testing our model using raw speech data from the dataset, the paper proposes a novel approach for recognizing emotions from voice signals processed in Malayalam with an average accuracy of 71%, indicating its ability to correctly predict emotions from vocal utterances in this under-resourced Language. <strong>Novelty:</strong> The novelty of this work lies in its dedication to addressing the challenges of emotion recognition in a low-resource language, the manual creation of datasets, and the successful adaptation of established techniques to a linguistic context where research is relatively scarce. These contributions collectively advance the field of speech emotion recognition and pave the way for further exploration in underrepresented languages.</p>\n<p><strong>Keywords</strong>: Speech emotion recognition, Malayalam, Natural Language Processing, MFCC, CNN</p>

Keywords:
Computer science Malayalam Keyword spotting Speech recognition Mel-frequency cepstrum Convolutional neural network Artificial intelligence Natural language processing Utterance Context (archaeology) Feature extraction

Metrics

1
Cited By
0.26
FWCI (Field Weighted Citation Impact)
25
Refs
0.60
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
© 2026 ScienceGate Book Chapters — All rights reserved.