JOURNAL ARTICLE

The SSI large-vocabulary speaker-independent continuous speech recognition system

Abstract

The Speech Systems Incorporated (SSI) commercial, large-vocabulary, speaker-independent, continuous speech recognition system is described. The system utilizes a novel approach to speech representation: a two-stage encoding of speech, with an intervening compression of acoustic frames (segmentation) between the encoding stages, and a linguistic decoding process suitable for large, variable-duration segments. Binary decision trees trained using the maximum mutual information (MMI) criterion serve as encoders. The features used in encoding are listed, and their ability to discriminate the phonetic content of the speech is analyzed. Recognition results are given for a speaker-independent continuous speech, grammar-constrained radiology reporting product, and for an isolated-word grammar of high perplexity.< >

Keywords:
Speech recognition Computer science Perplexity Audio mining Vocabulary Encoding (memory) Speech coding Voice activity detection Speaker recognition Artificial intelligence Encoder Linear predictive coding Grammar Natural language processing Speech processing Linguistics Language model

Metrics

4
Cited By
0.44
FWCI (Field Weighted Citation Impact)
8
Refs
0.73
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.