JOURNAL ARTICLE

CONVOLUTIONAL NEURAL NETWORK FOR ARABIC SPEECH RECOGNITION

Abstract

This work is focused on single word Arabic automatic speech recognition (AASR). Two techniques are used during the feature extraction phase; Log frequency spectral coefficients (MFSC) and Gammatone-frequency cepstral coefficients (GFCC) with their first and second-order derivatives. The convolutional neural network (CNN) is mainly used to execute feature learning and classification process. CNN achieved performance enhancement in automatic speech recognition (ASR). Local connectivity, weight sharing, and pooling are the crucial properties of CNNs that have the potential to improve ASR. We tested the CNN model using an Arabic speech corpus of isolated words. The used corpus is synthetically augmented by applying different transformations such as changing the pitch, the speed, the dynamic range, adding noise, and forward and backward shift in time. It was found that the maximum accuracy obtained when using GFCC with CNN is 99.77 %. The outcome results of this work are compared to previous reports and indicate that CNN achieved better performance in AASR.

Keywords:
Convolutional neural network Pooling Pattern recognition (psychology) Feature extraction Feature (linguistics) Arabic Word (group theory) Mel-frequency cepstrum

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.38
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Neural Networks and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Organizational and Employee Performance
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

CONVOLUTIONAL NEURAL NETWORK FOR ARABIC SPEECH RECOGNITION

Engy AbdelmaksoudArafa HassenNabila Bintey HassanMohamed Hesham Farouk

Journal:   The Egyptian Journal of Language Engineering /The Egyptian Journal of Language Engineering Year: 2020 Vol: 8 (1)Pages: 27-38
JOURNAL ARTICLE

Arabic Sign Language Recognition and Generating Arabic Speech Using Convolutional Neural Network

M. M. Kamruzzaman

Journal:   Wireless Communications and Mobile Computing Year: 2020 Vol: 2020 Pages: 1-9
BOOK-CHAPTER

Arabic Phonemes Recognition Using Convolutional Neural Network

Irwan MazlinZan Azma NasruddinWan Adilah Wan AdnanFariza Hanis Abdul Razak

Communications in computer and information science Year: 2019 Pages: 262-271
© 2026 ScienceGate Book Chapters — All rights reserved.