Pronunciation lexicon modeling and design for Korean large vocabulary continuous speech recognition

Kyong-Nim Lee; Minhwa Chung

doi:10.21437/interspeech.2004-576

ScienceGate Book Chapters

JOURNAL ARTICLE

Pronunciation lexicon modeling and design for Korean large vocabulary continuous speech recognition

Kyong-Nim Lee Minhwa Chung

Year: 2004 Pages: 1537-1540

DOI: 10.21437/interspeech.2004-576

Get Full-Text PDF Get Analytical Report

Abstract

In this paper, we describe a pronunciation lexicon model which is especially useful for constructing morpheme-based pronunciation lexicon to improve the performance of a Korean LVCSR. There are a lot of pronunciation variations occurring at morpheme boundaries in continuous speech. For modeling of cross-morpheme pronunciation variations, we usually used a context-dependent multiple pronunciation lexicon with possible multiple phonetic transcriptions for each word. Since phonemic context together with morphological category and morpheme boundary information affect Korean pronunciation variations, we have distinguished phonological rules that can be applied to phonemes in withinmorpheme and cross-morpheme. However, pronunciation variations in morpheme boundaries are increasing the lexicon size; we have designed the optimized pronunciation lexicon which is decreasing the confusability and increasing pronunciation coverage. The results of Korean Broadcast News Transcription experiments show that a reduction of 18% in pronunciation lexicon size and an absolute reduction of 0.27% in WER from the same lexical entries were achieved by building a proposed pronunciation lexicon.

Keywords:

Pronunciation Lexicon Morpheme Computer science Context (archaeology) Speech recognition Natural language processing Artificial intelligence Linguistics Vocabulary History

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.03

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Phonetics and Phonology Research

Social Sciences → Psychology → Experimental and Cognitive Psychology

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Pronunciation lexicon modeling and design for Korean large vocabulary continuous speech recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Modeling cross-morpheme pronunciation variations for korean large vocabulary continuous speech recognition

Morpheme-Based Modeling of Pronunciation Variation for Large Vocabulary Continuous Speech Recognition in Korean

Acoustic data-driven pronunciation lexicon for large vocabulary speech recognition

Pronunciation modeling for large vocabulary conversational speech recognition

Hybrid pronunciation modeling for Arabic large vocabulary speech recognition