JOURNAL ARTICLE

Syllable-based large vocabulary continuous speech recognition

Aravind GanapathirajuJ. HamakerJ. PiconeM. OrdowskiGeorge R. Doddington

Year: 2001 Journal:   IEEE Transactions on Speech and Audio Processing Vol: 9 (4)Pages: 358-366   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Most large vocabulary continuous speech recognition (LVCSR) systems in the past decade have used a context-dependent (CD) phone as the fundamental acoustic unit. We present one of the first robust LVCSR systems that uses a syllable-level acoustic unit for LVCSR on telephone-bandwidth speech. This effort is motivated by the inherent limitations in phone-based approaches-namely the lack of an easy and efficient way for modeling long-term temporal dependencies. A syllable unit spans a longer time frame, typically three phones, thereby offering a more parsimonious framework for modeling pronunciation variation in spontaneous speech. We present encouraging results which show that a syllable-based system exceeds the performance of a comparable triphone system both in terms of word error rate (WER) and complexity. The WER of the best syllabic system reported here is 49.1% on a standard Switchboard evaluation, a small improvement over the triphone system. We also report results on a much smaller recognition task, OGI Alphadigits, which was used to validate some of the benefits syllables offer over triphones. The syllable-based system exceeds the performance of the triphone system by nearly 20%, an impressive accomplishment since the alphadigits application consists mostly of phone-level minimal pair distinctions.

Keywords:
Computer science Speech recognition Syllable Vocabulary Word error rate Phone Pronunciation Syllabic verse Context (archaeology) Task (project management) Artificial intelligence Natural language processing Linguistics

Metrics

124
Cited By
4.83
FWCI (Field Weighted Citation Impact)
19
Refs
0.95
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Phonetics and Phonology Research
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

BOOK-CHAPTER

Syllable Based Language Model for Large Vocabulary Continuous Speech Recognition of Polish

Piotr Majewski

Lecture notes in computer science Year: 2008 Pages: 397-401
JOURNAL ARTICLE

Continuous speech recognition using large vocabulary word spotting and CV syllable spotting

Noboru Sugamura

Journal:   International Conference on Acoustics, Speech, and Signal Processing Year: 2002 Vol: s89 86 Pages: 121-124
JOURNAL ARTICLE

Syllable-Phoneme based Continuous Speech Recognition

Sigita LaurinčiukaitėAntanas Lipeika

Journal:   Elektronika ir Elektrotechnika Year: 2006 Vol: 70 (6)Pages: 91-94
© 2026 ScienceGate Book Chapters — All rights reserved.