JOURNAL ARTICLE

Rapid acoustic model development using Gaussian mixture clustering and language adaptation

Abstract

This work presents techniques for improved cross-language transfer of speech recognition systems to new, previously undeveloped, languages. Such techniques are particularly useful for target languages where minimal amounts of training data are available. We describe a novel method to produce a language-independent system by combining acoustic models from a number of source languages. This intermediate language-independent acoustic model is used to bootstrap a target-language system by applying language adaptation. For our experiments we use acoustic models of seven source languages to develop a target Greek acoustic model. We show that our technique significantly outperforms a system trained from scratch when less than 8 hours of read speech is available.

Keywords:
Computer science Adaptation (eye) Language model Cache language model Acoustic model Cluster analysis Speech recognition Natural language processing Artificial intelligence Scratch Speech processing Natural language Universal Networking Language Programming language

Metrics

1
Cited By
0.00
FWCI (Field Weighted Citation Impact)
8
Refs
0.02
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Language identification using Gaussian mixture model tokenization

Pedro A. Torres‐CarrasquilloDouglas A. ReynoldsJ.R. Deller

Journal:   IEEE International Conference on Acoustics Speech and Signal Processing Year: 2002 Pages: I-757
JOURNAL ARTICLE

Language identification using Gaussian mixture model tokenization

Torres-CarrasquilloReynoldsDeller

Journal:   IEEE International Conference on Acoustics Speech and Signal Processing Year: 2002 Pages: I-I
© 2026 ScienceGate Book Chapters — All rights reserved.