JOURNAL ARTICLE

Speech recognition using stochastic phonemic segment model based on phoneme segmentation

Abstract

This paper discusses speech recognition based on a new statistical phoneme segment model which is trained by phoneme parameters derived from automatically extracted phoneme segments. The proposed system operates as follows. In preprocessing before recognition, the phoneme boundaries are detected by segmentation. The phonemes are discriminated using a stochastic phoneme segment model, and a phoneme segment lattice with scores is constructed. Next the speech recognition is performed by matching of symbol sequences to dictionary items. The segmentation system that is employed can infer phoneme boundaries with high accuracy. This helps to eliminate unnecessary parameters, leaving the feature parameters which are effective in separating phonemes. In other words, the phoneme recognition problem in continuous speech can be reduced to a discrimination problem and thus a speaker-independent model can be constructed from a relatively small number of training data. The stochastic phoneme segment model is trained with training samples extracted from a phoneme-balanced word set of 4920 words uttered by 10 speakers. In a recognition experiment with 6709 words uttered by 63 nontraining speakers, a recognition rate of 92.6% was obtained as the average for all speakers, using a word dictionary of 212 words. © 2000 Scripta Technica, Syst Comp Jpn, 31(10): 89–98, 2000

Keywords:
Speech recognition Computer science Segmentation Preprocessor Speech segmentation Pattern recognition (psychology) Feature (linguistics) Word recognition Word (group theory) Artificial intelligence Matching (statistics) Word error rate Set (abstract data type) Natural language processing Mathematics Linguistics

Metrics

4
Cited By
0.00
FWCI (Field Weighted Citation Impact)
9
Refs
0.12
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

A stochastic segment model for phoneme-based continuous speech recognition

Mari OstendorfSalim Roukos

Journal:   IEEE Transactions on Acoustics Speech and Signal Processing Year: 1989 Vol: 37 (12)Pages: 1857-1869
JOURNAL ARTICLE

Speech/Non-Speech Segmentation Based on Phoneme Recognition Features

Janez ŽibertNikola PavešićFrance Mihelič

Journal:   EURASIP Journal on Advances in Signal Processing Year: 2006 Vol: 2006 (1)
© 2026 ScienceGate Book Chapters — All rights reserved.