Statistical Parametric Speech Synthesis for Punjabi Language using Deep Neural Network

Harman Preet Singh; Parminder Singh; Manjot Kaur Gill

doi:10.52458/978-93-91842-08-6-41

ScienceGate Book Chapters

BOOK-CHAPTER

Statistical Parametric Speech Synthesis for Punjabi Language using Deep Neural Network

Harman Preet Singh Parminder Singh Manjot Kaur Gill

Year: 2021 Soft Computing Research Society eBooks Pages: 431-441

DOI: 10.52458/978-93-91842-08-6-41

Get Full-Text PDF Get Analytical Report

Abstract

In recent years, speech technology gets very advanced, due to which speech synthesis becomes an interesting area of study for researchers. Text-To-Speech (TTS) system generates the speech from the text by using a synthesized technique like concatenative, formant, articulatory, Statistical Parametric Speech Synthesis (SPSS) etc. The Deep Neural Network (DNN) based SPSS for the Punjabi language is used in this research work. The database used for this research works contains 674 audio files and a single text file containing 674 sentences. This database was created at the Language Technologies Institute at Carnegie Mellon University (CMU) provided under Festvox distribution. Ossian toolkit is used as a front-end for text processing. The two DNNs are modeled using the merlin toolkit. The duration DNN maps the linguistic and duration features of speech. The acoustic DNN maps the linguistic and acoustic features. The subjective evaluation using the Mean Opinion Score (MOS) shows that this TTS system has good quality of naturalness that is 80.2%.

Keywords:

Computer science Speech synthesis Naturalness Speech recognition Speech corpus Parametric statistics Duration (music) Mean opinion score Artificial neural network Formant Natural language processing Artificial intelligence Acoustics Engineering Mathematics Vowel

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.40

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Statistical Parametric Speech Synthesis for Punjabi Language using Deep Neural Network

Abstract

Metrics

Topics

Related Documents

Speech Enhancement for Punjabi Language Using Deep Neural Network

Text to Speech Synthesis System for Punjabi language using Statistical Parametric Speech Synthesis Technique

Statistical parametric speech synthesis using deep neural networks

Spell Checker for Punjabi Language Using Deep Neural Network

Statistical parametric speech synthesis using weighted multi-distribution deep belief network