JOURNAL ARTICLE

Automatic pronunciation verification for speech recognition

Abstract

Pronunciations for words are a critical component in an automated speech recognition system (ASR) as mis-recognitions may be caused by missing or inaccurate pronunciations. The need for high quality pronunciations has recently motivated data-driven techniques to generate them [1]. We propose a data-driven and language-independent framework for verification of such pronunciations to further improve the lexicon quality in ASR. New candidate pronunciations are verified by re-recognizing historical audio logs and examining the associated recognition costs. We build an additional pronunciation quality feature from word and pronunciation frequencies in logs. A machine learned classifier trained on these features achieves nearly 90% accuracy in labeling good vs bad pronunciations across all languages we tested. New pronunciations verified as good may be added to a dictionary, while bad pronunciations may be discarded or sent to experts for further evaluation. We simultaneously verify 5,000 to 30,000 new pronunciations within a few hours and show improvements in the ASR performance as a result of including pronunciations verified by this system.

Keywords:
Pronunciation Computer science Lexicon Artificial intelligence Speech recognition Natural language processing Classifier (UML) Feature (linguistics) Training set Quality (philosophy) Linguistics

Metrics

2
Cited By
0.31
FWCI (Field Weighted Citation Impact)
14
Refs
0.78
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Automatic Speech Recognition and Pronunciation Training

Wenqi Xiao

Journal:   Proceedings of the 2018 2nd International Conference on Education, Economics and Management Research (ICEEMR 2018) Year: 2018
JOURNAL ARTICLE

PRONUNCIATION LEARNING USING AUTOMATIC SPEECH RECOGNITION

Rohit SharmaAmey MenonSahil KhairnarSanket BhagatShubham Rawat

Journal:   EPRA International Journal of Multidisciplinary Research (IJMR) Year: 2024 Pages: 565-568
JOURNAL ARTICLE

Automatic speech recognition and pronunciation learning

Shannon McCrocklinJohn M. Levis

Journal:   Language Teaching Year: 2025 Pages: 1-17
JOURNAL ARTICLE

Improving English pronunciation via automatic speech recognition technology

Minjia LiuXiujuan ChenYiling MoZejia ChenXiaobin LiuMeiting HanMeihui LiManfei Xu

Journal:   International Journal of Innovation and Learning Year: 2018 Vol: 25 (2)Pages: 126-126
© 2026 ScienceGate Book Chapters — All rights reserved.