Improving dysarthric speech recognition using empirical mode decomposition and convolutional neural network

Mohammed Sidi Yakoub; Sid‐Ahmed Selouani; Brahim-Fares Zaidi; Asma Bouchair

doi:10.60692/mx4ct-spa85

ScienceGate Book Chapters

JOURNAL ARTICLE

Improving dysarthric speech recognition using empirical mode decomposition and convolutional neural network

Mohammed Sidi Yakoub Sid‐Ahmed Selouani Brahim-Fares Zaidi Asma Bouchair

Year: 2020 Journal: Greater South Information System

DOI: 10.60692/mx4ct-spa85

Get Full-Text PDF Get Analytical Report

Abstract

Abstract In this paper, we use empirical mode decomposition and Hurst-based mode selection (EMDH) along with deep learning architecture using a convolutional neural network (CNN) to improve the recognition of dysarthric speech. The EMDH speech enhancement technique is used as a preprocessing step to improve the quality of dysarthric speech. Then, the Mel-frequency cepstral coefficients are extracted from the speech processed by EMDH to be used as input features to a CNN-based recognizer. The effectiveness of the proposed EMDH-CNN approach is demonstrated by the results obtained on the Nemours corpus of dysarthric speech. Compared to baseline systems that use Hidden Markov with Gaussian Mixture Models (HMM-GMMs) and a CNN without an enhancement module, the EMDH-CNN system increases the overall accuracy by 20.72% and 9.95%, respectively, using a k -fold cross-validation experimental setup.

Keywords:

Convolutional neural network Pattern recognition (psychology) Preprocessor Hidden Markov model Mixture model Mode (computer interface) Mel-frequency cepstrum Artificial neural network Cepstrum

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.34

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Voice and Speech Disorders

Health Sciences → Medicine → Physiology

Phonocardiography and Auscultation Techniques

Health Sciences → Medicine → Pulmonary and Respiratory Medicine

Respiratory and Cough-Related Research

Health Sciences → Medicine → Pulmonary and Respiratory Medicine

Improving dysarthric speech recognition using empirical mode decomposition and convolutional neural network

Abstract

Metrics

Topics

Related Documents

Improving dysarthric speech recognition using empirical mode decomposition and convolutional neural network

Improving dysarthric speech recognition using empirical mode decomposition and convolutional neural network

Dysarthric Speech Recognition Using Variational Mode Decomposition and Convolutional Neural Networks

Dysarthric Speech Recognition Using Convolutional LSTM Neural Network

Residual Convolutional Neural Network-Based Dysarthric Speech Recognition