Abstract

Large vocabulary speech recognition systems traditionally represent words in terms of subword units, usually phonemes.This paper investigates the potential of graphemes acting as subunits.In order to develop context dependent grapheme based speech recognizers several decision tree based clustering procedures are performed and compared to each other.Grapheme based speech recognizers in three languages -English, German, and Spanish -are trained and compared to their phoneme based counterparts.The results show that for languages with a close grapheme-to-phoneme relation, grapheme based modeling is as good as the phoneme based one.Furthermore, multilingual grapheme based recognizers are designed to investigate whether grapheme based information can be successfully shared among languages.Finally, some bootstrapping experiments for Swedish were performed to test the potential for rapid language deployment.

Keywords:
Grapheme Computer science Bootstrapping (finance) Natural language processing Speech recognition Artificial intelligence German Context (archaeology) Vocabulary Linguistics

Metrics

127
Cited By
5.37
FWCI (Field Weighted Citation Impact)
19
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and dialogue systems
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.