JOURNAL ARTICLE

Integrated pronunciation learning for automatic speech recognition using probabilistic lexical modeling

Abstract

Standard automatic speech recognition (ASR) systems use phoneme-based pronunciation lexicon prepared by linguistic experts. When the hand crafted pronunciations fail to cover the vocabulary of a new domain, a grapheme-to-phoneme (G2P) converter is used to extract pronunciations for new words and then a phoneme- based ASR system is trained. G2P converters are typically trained only on the existing lexicons. In this paper, we propose a grapheme-based ASR approach in the framework of probabilistic lexical modeling that integrates pronunciation learning as a stage in ASR system training, and exploits both acoustic and lexical resources (not necessarily from the domain or language of interest). The proposed approach is evaluated on four lexical resource constrained ASR tasks and compared with the conventional two stage approach where G2P training is followed by ASR system development.

Keywords:
Pronunciation Computer science Grapheme Lexicon Artificial intelligence Natural language processing Speech recognition Vocabulary Probabilistic logic Domain (mathematical analysis) Linguistics

Metrics

4
Cited By
0.63
FWCI (Field Weighted Citation Impact)
28
Refs
0.85
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

PRONUNCIATION LEARNING USING AUTOMATIC SPEECH RECOGNITION

Rohit SharmaAmey MenonSahil KhairnarSanket BhagatShubham Rawat

Journal:   EPRA International Journal of Multidisciplinary Research (IJMR) Year: 2024 Pages: 565-568
JOURNAL ARTICLE

Automatic speech recognition and pronunciation learning

Shannon McCrocklinJohn M. Levis

Journal:   Language Teaching Year: 2025 Pages: 1-17
JOURNAL ARTICLE

Articulatory feature based continuous speech recognition using probabilistic lexical modeling

Ramya RasipuramMathew Magimai.-Doss

Journal:   Computer Speech & Language Year: 2015 Vol: 36 Pages: 233-259
© 2026 ScienceGate Book Chapters — All rights reserved.