Creation and Instigation of Triphone based Big-Lexicon Speaker-Independent Continuous Speech Recognition Framework for Kannada Language

Praveen Kumar; Research Scholar; S Ouahabi; M Atounti; M Bellouki; A Madhavraj; A Ramakrishna; S Sinha; S Agrawal; A Jain; C Dugast; L Devillers; X Aubert; A Shrestha; A Mahmood; D Dimitriadis; E Bocchieri; P Praveen Kumar; G Yadava; H Jayanna; J Guglani; A Mishra; M Kalamani; M Krishnamorti; R Valarmati; P Upadyaya; O Faroq; M Abidi; Y Varshney; R Sharma; S Paladugu; M Al Amin; M Islam; S Kibria; M Rahman; D Povey; L Burget; M Agarwal; P Akyazi; K Feng; A Ghoshal; O Glembek; N Goel; M Karafit; A Rastrow; C Manasa; K Priya; D Gupta; S Young

doi:10.35940/ijitee.b1090.1292s19

JOURNAL ARTICLE

Creation and Instigation of Triphone based Big-Lexicon Speaker-Independent Continuous Speech Recognition Framework for Kannada Language

Praveen Kumar Research Scholar S Ouahabi M Atounti M Bellouki A Madhavraj A Ramakrishna S Sinha S Agrawal A Jain C Dugast L Devillers X Aubert A Shrestha A Mahmood D Dimitriadis E Bocchieri P Praveen Kumar G Yadava H Jayanna J Guglani A Mishra M Kalamani M Krishnamorti R Valarmati P Upadyaya O Faroq M Abidi Y Varshney R Sharma S Paladugu K Priya D Gupta M Al Amin M Islam S Kibria M Rahman D Povey L Burget M Agarwal P Akyazi K Feng A Ghoshal O Glembek N Goel M Karafit A Rastrow C Manasa K Priya D Gupta S Young

Year: 2019 Journal: International Journal of Innovative Technology and Exploring Engineering Vol: 9 (2S)Pages: 152-158 Publisher: Blue Eyes Intelligence Engineering and Sciences Publication

DOI: 10.35940/ijitee.b1090.1292s19

Get Full-Text PDF Get Analytical Report

Abstract

This paper proposes a framework that is intended to do the comparably accurate recognition of speech and in precise, continuous speech recognition (CSR) based on triphone modelling for Kannada dialect. For designing the proposed framework, the features from the speech data are obtained from the well-known feature extraction technique Mel-frequency cepstral coefficients (MFCC) and from its transformations, like, linear discriminant analysis (LDA) and maximum likelihood linear transforms (MLLT) are obtained from Kannada speech data files. At that point, the system is trained to evaluate the hidden Markov model (HMM) parameters for continuous speech (CS) data. The persistent Kannada speech information is gathered from 2600 speakers (1560 men and 1040women) of the age bunch in the scope of 14 years-80 years. The speech information is acquired from different geographical regions of the Karnataka (one of the 29 states situated in the southern part of India) state under degraded condition. It comprises of 21,551 words that spread 30 locales. The performance evaluation of both monophone and triphone models concerning word error rate (WER) is done and the obtained results are compared with the standard databases such as TIMIT and aurora4. A significant reduction in WER is obtained for triphone models. The speech recognition (SR) rate is verified for both offline and online recognition mode for all the speakers. The results reveal that the recognition rate (RR) for Kannada speech corpus has got a better improvement over the state-of-the-art existing databases.

Keywords:

Speech recognition Hidden Markov model Computer science Mel-frequency cepstrum Word error rate Kannada Lexicon Linear discriminant analysis Artificial intelligence Cepstrum Speech corpus Feature extraction Natural language processing Speech synthesis

Metrics

Cited By

0.16

FWCI (Field Weighted Citation Impact)

Refs

0.51

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Data Compression Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Creation and Instigation of Triphone based Big-Lexicon Speaker-Independent Continuous Speech Recognition Framework for Kannada Language

Abstract

Metrics

Citation History

Topics

Related Documents

Continuous Speech Recognition of Kannada language using triphone modeling

Continuous Speech Recognition System for Kannada Language with Triphone Modelling using HTK

Triphone Model Based Novel Kannada Continuous Speech Recognition System using Kaldi Tool

Development of Speaker-Independent Automatic Speech Recognition System for Kannada Language

Speaker Dependent Continuous Kannada Speech Recognition Using HMM