JOURNAL ARTICLE

A stochastic segment model for phoneme-based continuous speech recognition

Abstract

Developing accurate and robust phonetic models for the different speech sounds is a major challenge for high performance continuous speech recognition. In this paper, we introduce a new approach, called the stochastic segment model, for modelling a variable-length phonetic segment X, an L-long sequence of feature vectors. The stochastic segment model consists of 1) time-warping the variable-length segment X into a fixed-length segment Y called a resampled segment, and 2) a joint density function of the parameters of the resampled segment Y, which in this work is assumed Gaussian. In this paper, we describe the stochastic segment model, the recognition algorithm, and the iterative training algorithm for estimating segment models from continuous speech. For speaker-dependent continuous speech recognition, the segment model reduces the word error rate by one third over a hidden Markov phonetic model.

Keywords:
Hidden Markov model Speech recognition Computer science Dynamic time warping Pattern recognition (psychology) Feature (linguistics) Variable (mathematics) Image warping Acoustic model Word error rate Artificial intelligence Speech processing Algorithm Mathematics

Metrics

17
Cited By
0.38
FWCI (Field Weighted Citation Impact)
10
Refs
0.71
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

A stochastic segment model for phoneme-based continuous speech recognition

Mari OstendorfSalim Roukos

Journal:   IEEE Transactions on Acoustics Speech and Signal Processing Year: 1989 Vol: 37 (12)Pages: 1857-1869
JOURNAL ARTICLE

Speech recognition using stochastic phonemic segment model based on phoneme segmentation

Chieko FuruichiKatsura AizawaInoue Kazuhiko

Journal:   Systems and Computers in Japan Year: 2000 Vol: 31 (10)Pages: 89-98
© 2026 ScienceGate Book Chapters — All rights reserved.