Large vocabulary continuous speech recognition using HTK

Philip C. Woodland; J. J. Odell; V. Valtchev; S.J. Young

doi:10.1109/icassp.1994.389562

ScienceGate Book Chapters

JOURNAL ARTICLE

Large vocabulary continuous speech recognition using HTK

Philip C. Woodland J. J. Odell V. Valtchev S.J. Young

Year: 2002 Vol: ii Pages: II/125-II/128

DOI: 10.1109/icassp.1994.389562

Get Full-Text PDF Get Analytical Report

Abstract

HTK is a portable software toolkit for building speech recognition systems using continuous density hidden Markov models developed by the Cambridge University Speech Group. One particularly successful type of system uses mixture density tied-state triphones. We have used this technique for the 5 k/20 k word ARPA Wall Street Journal (WSJ) task. We have extended our approach from using word-internal gender independent modelling to use decision tree based state clustering, cross-word triphones and gender dependent models. Our current systems can be run with either bigram or trigram language models using a single pass dynamic network decoder. Systems based on these techniques were included in the November 1993 ARPA WSJ evaluation, and gave the lowest error rate reported on the 5 k word bigram, 5 k word trigram and 20 k word bigram "hub" tests and the second lowest error rate on the 20 k word trigram "hub" test.< >

Keywords:

Bigram Trigram Computer science Word (group theory) Word error rate Speech recognition Vocabulary Hidden Markov model Artificial intelligence Natural language processing Mathematics

Metrics

248

Cited By

17.46

FWCI (Field Weighted Citation Impact)

Refs

0.99

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and dialogue systems

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Large vocabulary continuous speech recognition using HTK

Abstract

Metrics

Citation History

Topics

Related Documents

Large vocabulary continuous speech recognition using word graphs

Vietnamese large vocabulary continuous speech recognition

Vietnamese large vocabulary continuous speech recognition

Korean large vocabulary continuous speech recognition using pseudomorpheme units

Large-vocabulary speaker-independent continuous speech recognition using HMM