Phoneme based Kannada Speech Corpus for Automatic Speech Recognition System

N Praveen; Shashidhar Kini

doi:10.1109/icdcece53908.2022.9793010

ScienceGate Book Chapters

JOURNAL ARTICLE

Phoneme based Kannada Speech Corpus for Automatic Speech Recognition System

N Praveen Shashidhar Kini

Year: 2022 Journal: 2022 IEEE International Conference on Distributed Computing and Electrical Circuits and Electronics (ICDCECE) Pages: 1-5

DOI: 10.1109/icdcece53908.2022.9793010

Get Full-Text PDF Get Analytical Report

Abstract

Automatic speech recognition is a challenging task in languages which are not having many resources for research. The state-of-the-art technologies like Deep Neural Network approach require a standard dataset for training and testing the approach for a particular language. In this paper, development of a novel dataset for Kannada speech corpus is described. The speech dataset developed is useful for implementation of Automatic Kannada speech recognition system. The speech data for this dataset is crowd-sourced by using a website developed for the purpose. Since the data collection process is active in the internet the size of the dataset grows as more number of people contributes their voice. The system asks the user to enter their details when they open it for the first time. The user-data collected here is helpful to categorize the data in the dataset based on different parameters, which is the requirement for some types of ASR system. The phonetic representation of the word is stored in the database along with the numeric representation which optimizes the processing of speech to text conversion.

Keywords:

Computer science Speech recognition Speech corpus Artificial intelligence Task (project management) Natural language processing Audio mining Voice activity detection Speech processing Speech analytics Process (computing) Kannada Speech synthesis

Metrics

Cited By

0.71

FWCI (Field Weighted Citation Impact)

Refs

0.67

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Phoneme based Kannada Speech Corpus for Automatic Speech Recognition System

Abstract

Metrics

Citation History

Topics

Related Documents

Development of Kannada Speech Corpus for Continuous Speech Recognition

Phoneme-grapheme based speech recognition system

MLP based phoneme detectors for Automatic Speech Recognition

Phoneme based speech recognition

Automatic Isolated Kannada Speech Recognition System under Degraded Conditions