Emotion recognition from Persian speech with 1D Convolution neural network

SeyedMilad Ranaei Siadat; Ilia M. Voronkov; Alexander A. Kharlamov

doi:10.1109/cnn56452.2022.9912532

ScienceGate Book Chapters

JOURNAL ARTICLE

Emotion recognition from Persian speech with 1D Convolution neural network

SeyedMilad Ranaei Siadat Ilia M. Voronkov Alexander A. Kharlamov

Year: 2022 Pages: 152-157

DOI: 10.1109/cnn56452.2022.9912532

Get Full-Text PDF Get Analytical Report

Abstract

The problem of recognizing and classifying emotions in speech is one of the most relevant and significant research topics, however, hardly any studies have been conducted to date for a large number of languages to achieve the required accuracy. Expressing and recognizing emotions based on the signal of the human speech is one of the complex issues that is distinct from languages. This paper proposes a systematical and robust approach to implement an emotion recognition system for low resource languages such as Persian. To the best of our knowledge, this is the first SER work on the Persian language using deep learning techniques. Sharif Emotional Speech Database ShEMO with five basic emotions including anger, fear, happiness, sadness and surprise, as well as neutral state is identified as suitable candidate to evaluate a 1D Convolutional Neural Network (1DCNN) architecture. The data are first processed using Mel-Frequency Cepstral Coefficients (MFCC) feature extraction method and then feed MFCC as input feature to our neural network. Experimental results demonstrate that our proposed method achieves about 74% classification accuracy on ShEMO dataset.

Keywords:

Computer science Mel-frequency cepstrum Speech recognition Feature extraction Convolutional neural network Sadness Artificial intelligence Feature (linguistics) Artificial neural network Surprise Emotion classification Persian Natural language processing Pattern recognition (psychology) Convolution (computer science) Anger Psychology

Metrics

Cited By

0.25

FWCI (Field Weighted Citation Impact)

Refs

0.57

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Emotion and Mood Recognition

Social Sciences → Psychology → Experimental and Cognitive Psychology

Emotion recognition from Persian speech with 1D Convolution neural network

Abstract

Metrics

Citation History

Topics

Related Documents

Emotion Recognition from Persian Speech with Neural Network

Emotion Recognition of Manipuri Speech using Convolution Neural Network

Convolution neural network with multiple pooling strategies for speech emotion recognition

Emotion Recognition from Speech Audio Signals Using Convolution Neural Network Model Architectures

Speech Emotion Recognition Based on Dynamic Convolution Recurrent Neural Network