JOURNAL ARTICLE

Language model adaptation using word clustering

Abstract

Building a stochastic language model (LM) for speech recognition requires a large corpus of target tasks. For some tasks no enough large corpus is available and this is an obstacle to achieving high recognition accuracy. In this paper, we propose a methodforbuildinganLMwithahigherpredictionpowerusing large corpora from different tasks rather than an LM estimated from a small corpus for a specific target task. In our experiment, weusedtranscriptionsofairuniversitylecturesandarticlesfrom Nikkei newspaper and compared an existing interpolation-based method and our new method. The results show that our new method reduces perplexity by 9.71%.

Keywords:
Perplexity Computer science Artificial intelligence Language model Task (project management) Cluster analysis Natural language processing Adaptation (eye) Speech recognition Word (group theory) Interpolation (computer graphics) Newspaper Obstacle Linguistics

Metrics

2
Cited By
0.38
FWCI (Field Weighted Citation Impact)
6
Refs
0.66
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Vari-gram language model based on word clustering

Lichi Yuan

Journal:   Journal of Central South University Year: 2012 Vol: 19 (4)Pages: 1057-1062
JOURNAL ARTICLE

A stochastic language model using dependency and its improvement by word clustering

Shinsuke MoriMakoto Nagao

Journal:   Proceedings of the 17th international conference on Computational linguistics - Year: 1998 Vol: 2 Pages: 898-904
© 2026 ScienceGate Book Chapters — All rights reserved.