JOURNAL ARTICLE

Language identification through large vocabulary continuous speech recognition

Abstract

In recent years, automatic language identification has become an increasingly important component in practical spoken language systems, and much attention has been devoted to various competing approaches. In this paper, we are concerned with the automatic identification of languages that may be highly similar in nature, such as the various dialects of Chinese. Our approach differs from many recent successful systems by exploiting a fusion of feature scores readily available from a large vocabulary speech recognition system. We show that such features are able to distinguish among the similar sounding dialects of Chinese, and experiments on a nine language corpus show promising performance on a three way identification task.

Keywords:
Computer science Identification (biology) Vocabulary Natural language processing Language identification Task (project management) Feature (linguistics) Component (thermodynamics) Artificial intelligence Speech recognition Spoken language Natural language Linguistics

Metrics

3
Cited By
0.38
FWCI (Field Weighted Citation Impact)
13
Refs
0.72
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Phonetics and Phonology Research
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
© 2026 ScienceGate Book Chapters — All rights reserved.