Optimized large vocabulary WFST speech recognition system

Yuhong Guo; Li Ta; Yujing Si; Jielin Pan; Yonghong Yan

doi:10.1109/fskd.2012.6234200

ScienceGate Book Chapters

JOURNAL ARTICLE

Optimized large vocabulary WFST speech recognition system

Yuhong Guo Li Ta Yujing Si Jielin Pan Yonghong Yan

Year: 2012 Vol: 2008 Pages: 1243-1247

DOI: 10.1109/fskd.2012.6234200

Get Full-Text PDF Get Analytical Report

Abstract

Speech recognition decoder is an important part of large vocabulary speech recognition application. The speed and the accuracy is the main concern of its application. Recently, weighted finite state transducers (WFST) has become the dominant description of decoding network. However, the large memory and time cost of constructing the final WFST decoding network is the bottleneck of this technique. The goal of this article is to construct a tight, flexible WFST decoding network as well as a fast, scalable decoder. A tight representation of silence in speech is proposed and the decoding algorithm with improved pruning strategies is also suggested. The experimental results show that the proposed network presentation will cut off 37% memory cost and 19% time cost of constructing the final decoding network. And with the decoding strategies of WFST feature specified beams the proposed decoder's efficiency and accuracy are also significantly improved.

Keywords:

Decoding methods Computer science Pruning Vocabulary Bottleneck Feature (linguistics) Speech recognition Scalability Construct (python library) Artificial intelligence Speech coding Pattern recognition (psychology) Algorithm

Metrics

Cited By

1.14

FWCI (Field Weighted Citation Impact)

Refs

0.82

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Algorithms and Data Compression

Physical Sciences → Computer Science → Artificial Intelligence

Network Packet Processing and Optimization

Physical Sciences → Computer Science → Hardware and Architecture

Optimized large vocabulary WFST speech recognition system

Abstract

Metrics

Citation History

Topics

Related Documents

The Titech large vocabulary WFST speech recognition system

A tail-sharing WFST composition algorithm for large vocabulary speech recognition

Vocabulary Expansion for the Sub-word WFST-Based Automatic Speech Recognition System

Large vocabulary continuous speech recognition using WFST-based linear classifier for structured data

Large vocabulary continuous speech recognition based on WFST structured classifiers and deep bottleneck features