Amazigh Speech Recognition Using 1D CNN

Mohamed Daouad; El Wardani Dadi

doi:10.1145/3607720.3607760

ScienceGate Book Chapters

JOURNAL ARTICLE

Amazigh Speech Recognition Using 1D CNN

Mohamed Daouad El Wardani Dadi

Year: 2023 Pages: 1-6

DOI: 10.1145/3607720.3607760

Get Full-Text PDF Get Analytical Report

Abstract

This paper introduces an automated system designed to recognize specifically speech in the Amazigh language, using a sophisticated deep learning model based on a convolutional neural network (CNN) and features extracted from spectrograms. The research focuses on identifying 18 specific isolated words from a dataset of 2,000 audio files collected from native Amazigh speakers in Morocco's Rif region. To accurately represent the speech signal, our system employs spectrograms that plot time on the x-axis and frequency on the y-axis, while indicating the amplitude through the intensity value at a specific position in the spectrogram. For our system architecture, spectrograms act as input to the deep CNNs. We use 1D convolutional neural network structures consisting of eight layers, primarily used for feature learning and recognition. The model extracts discriminative features from spectrogram images and outputs predictions for the eighteen classes. The findings illustrate that the proposed Convolutional Neural Network achieves an impressive accuracy of 94.77%. This highlights the effectiveness of this approach for automatic speech recognition in the Amazigh language, specifically for single-word recognition.

Keywords:

Spectrogram Computer science Discriminative model Convolutional neural network Speech recognition Artificial intelligence Feature (linguistics) Pattern recognition (psychology) Artificial neural network Deep learning Feature extraction

Metrics

Cited By

1.02

FWCI (Field Weighted Citation Impact)

Refs

0.77

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Amazigh Speech Recognition Using 1D CNN

Abstract

Metrics

Citation History

Topics

Related Documents

Investigation Amazigh speech recognition using CMU tools

Amazigh Speech Recognition Embedded System

Pathological Detection Using HMM Speech Recognition-Based Amazigh Digits

Data Augmentation for Amazigh Speech Recognition Using Filter Banks

Amazigh audiovisual speech recognition system design