Emotion Recognition from Speech using Artificial Neural Networks and Recurrent Neural Networks

Shambhavi Sharma

doi:10.1109/confluence51648.2021.9377192

ScienceGate Book Chapters

JOURNAL ARTICLE

Emotion Recognition from Speech using Artificial Neural Networks and Recurrent Neural Networks

Shambhavi Sharma

Year: 2021 Pages: 153-158

DOI: 10.1109/confluence51648.2021.9377192

Get Full-Text PDF Get Analytical Report

Abstract

This paper presents a comparative study on two classifiers created for speech emotion recognition. Perceiving a person's feeling has consistently been an intriguing task for everyone. These feelings can be expressed through facial expressions, speech, actions, and so forth. The most widely used form of communication is through speech. Speech is an elaborated form of communication constituting various details. These details provide several information such as the abstract of the message, tone of the speaker, language used, background noise, any form of musical sound, emotions, etc. The significance of speech emotion recognition technology is getting mainstream with the advancement of "Voice User Interface" technology. This technology makes it possible for computers to interact with humans by applying speech analysis to understand the instructions given by a person and perform the required tasks and commands. There is always an emotion attached to a piece of speech while communicating but recognizing this emotion is a complex job in the research field. This is mainly because the way emotions are perceived from an audio differs from person to person. I have created two models for speech emotion recognition. I have used Mel Frequency Cepstral Coefficient (MFCC) for feature extraction from the audio files. The first model has been created using Multi-Layer Perceptron (MLP) classifier which gave an accuracy 57.29 percent. The second model was created Long Short-Term Memory (LSTM) and gave a good accuracy of 92.88. I have made use of RAVDESS dataset for classification purpose.

Keywords:

Speech recognition Computer science Mel-frequency cepstrum Artificial neural network Hidden Markov model Feature extraction Multilayer perceptron Emotion classification Artificial intelligence Natural language processing

Metrics

Cited By

1.42

FWCI (Field Weighted Citation Impact)

Refs

0.79

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Emotion and Mood Recognition

Social Sciences → Psychology → Experimental and Cognitive Psychology

Face and Expression Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Emotion Recognition from Speech using Artificial Neural Networks and Recurrent Neural Networks

Abstract

Metrics

Citation History

Topics

Related Documents

Speech Emotion Recognition using Artificial Neural Networks

Speech Emotion Recognition Using Convolutional Recurrent Neural Networks

Thai Speech Emotion Recognition Using Artificial Neural Networks

Segment-based speech emotion recognition using recurrent neural networks

Speech emotion recognition using convolutional and Recurrent Neural Networks