JOURNAL ARTICLE

Speaker adaptation for large vocabulary speech recognition systems using speaker Markov models

Gerhard Rigoll

Year: 1989 Journal:   International Conference on Acoustics, Speech, and Signal Processing Pages: 5-8 vol.1

Abstract

An alternative approach to speaker adaptation for a large-vocabulary hidden-Markov-model-based speech recognition system is described. The goal of this investigation was to train the IBM speech recognition system with only five minutes of speech data from a new speaker instead of the usual 20 minutes without the recognition rate dropping by more than 1-2%. The approach is based on the use of a stochastic model representing the different properties of the new speaker and an old speaker for which the full training set of 20 minutes is available. It is called a speaker Markov model. It is shown how the parameters of such a model can be derived and how it can be used for transforming the training set of the old speaker in order to use it in addition to the short training set of the new speaker. The adaptation algorithm was tested with 12 speakers. The average recognition rate dropped from 96.4% to 95.2% for a 5000-word vocabulary task. The decoding time increased by a factor of 1.35; this factor is often 3-5 if other adaptation algorithms are used.< >

Keywords:
Speech recognition Hidden Markov model Computer science Vocabulary Speaker recognition Speaker diarisation Adaptation (eye) Set (abstract data type) Task (project management) Markov model Artificial intelligence Decoding methods Word error rate Word (group theory) Natural language processing Markov chain Linguistics Machine learning Algorithm Engineering

Metrics

19
Cited By
9.32
FWCI (Field Weighted Citation Impact)
8
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Data Compression Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
© 2026 ScienceGate Book Chapters — All rights reserved.