Korean large vocabulary continuous speech recognition using pseudomorpheme units

Oh‐Wook Kwon; Kyuwoong Hwang; Jun Park

doi:10.21437/eurospeech.1999-124

ScienceGate Book Chapters

JOURNAL ARTICLE

Korean large vocabulary continuous speech recognition using pseudomorpheme units

Oh‐Wook Kwon Kyuwoong Hwang Jun Park

Year: 1999 Pages: 483-486

DOI: 10.21437/eurospeech.1999-124

Get Full-Text PDF Get Analytical Report

Abstract

This paper presents a Korean large vocabulary continuous speech recognition system based on pseudomorpheme units. In Korean, an eojeol (word phrase) is a unit for spacing and a morpheme is the smallest unit with semantic meaning. If the eojeol is used as the dictionary and language modeling unit, the number of the unit becomes enormous. Instead we propose to use modified morpheme or pseudomorpheme as the basic recognition unit. We can recover the original eojeol by concatenating graphemes of pseudomorpheme components. We used a dictionary and language model with pseudomorpheme/part-ofspeech entries where each entry can have multiple pronunciations according to the morphology rule. With 32k-word vocabulary, the speaker-independent character, pseudomorpheme, and eojeol recognition accuracies on economy article database were 90.8%, 84.5%, and 81.3%, respectively.

Keywords:

Morpheme Computer science Vocabulary Phrase Speech recognition Natural language processing Artificial intelligence Word (group theory) Language model Character (mathematics) Unit (ring theory) Linguistics Mathematics

Metrics

Cited By

1.20

FWCI (Field Weighted Citation Impact)

Refs

0.81

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Phonetics and Phonology Research

Social Sciences → Psychology → Experimental and Cognitive Psychology

Korean large vocabulary continuous speech recognition using pseudomorpheme units

Abstract

Metrics

Citation History

Topics

Related Documents

Korean large vocabulary continuous speech recognition with morpheme-based recognition units

Large vocabulary speech recognition using subword units

Large vocabulary Korean continuous speech recognition using a one-pass algorithm

Large vocabulary continuous speech recognition using HTK

Large vocabulary continuous speech recognition using word graphs