Abstract

This paper describes preliminary results of automatic recognition of Korean broadcast-news speech. We have been working on flexible vocabulary isolated-word speech recognition, and the same HMM models are used for broadcast-news continuous speech recognition. The recognizer is trained by using phonetically balanced isolated words speech, rather than the broadcast news speech itself. In this research, we use several different lexica to investigate the recognition performance according to the length of the words. We also propose a long-distance bigram language model, which can be used at the first stage of the search, so that it can reduce the recognition errors caused by earlier pruning of correct hypothesis.

Keywords:
Bigram Computer science Speech recognition Hidden Markov model Vocabulary Artificial intelligence Pruning Word (group theory) Language model Natural language processing Speech analytics Word error rate Acoustic model Speech processing Trigram Linguistics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
3
Refs
0.02
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Automatic speech summarization applied to English broadcast news speech

Chiori HoriSadaoki FuruiRob MalkinHua YuAlex Waibel

Journal:   IEEE International Conference on Acoustics Speech and Signal Processing Year: 2002 Pages: I-9
JOURNAL ARTICLE

Automatic speech summarization applied to English broadcast news speech

HoriFuruiMalkinHua YuWaibel

Journal:   IEEE International Conference on Acoustics Speech and Signal Processing Year: 2002 Pages: I-I
© 2026 ScienceGate Book Chapters — All rights reserved.