Large vocabulary continuous speech recognition using WFST-based linear classifier for structured data

Shinji Watanabe; Takaaki Hori; Atsushi Nakamura

doi:10.21437/interspeech.2010-127

ScienceGate Book Chapters

JOURNAL ARTICLE

Large vocabulary continuous speech recognition using WFST-based linear classifier for structured data

Shinji Watanabe Takaaki Hori Atsushi Nakamura

Year: 2010 Pages: 346-349

DOI: 10.21437/interspeech.2010-127

Get Full-Text PDF Get Analytical Report

Abstract

This paper describes a discriminative approach that further advances the framework for Weighted Finite State Transducer (WFST) based decoding. The approach introduces additional linear models for adjusting the scores of a decoding graph composed of conventional information source models (e.g., hidden Markov models and N-gram models), and reviews the WFSTbased decoding process as a linear classifier for structured data (e.g., sequential multiclass data). The difficulty with the approach is that the number of dimensions of the additional linear models becomes very large in proportion to the number of arcs in a WFST, and our previous study only applied it to a small task (TIMIT phoneme recognition). This paper proposes a training method for a large-scale linear classifier employed in WFSTbased decoding by using a distributed perceptron algorithm. The experimental results show that the proposed approach was successfully applied to a large vocabulary continuous speech recognition task, and achieved an improvement compared with the performance of the minimum phone error based discriminative training of acoustic models. Index Terms: speech recognition, weighted finite state transducer, linear classifier, distributed perceptron, large vocabulary continuous speech recognition

Keywords:

Discriminative model Hidden Markov model Computer science Classifier (UML) Speech recognition Decoding methods Pattern recognition (psychology) Vocabulary Artificial intelligence TIMIT Multilayer perceptron Perceptron Artificial neural network Algorithm

Metrics

Cited By

4.41

FWCI (Field Weighted Citation Impact)

Refs

0.95

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Large vocabulary continuous speech recognition using WFST-based linear classifier for structured data

Abstract

Metrics

Citation History

Topics

Related Documents

Large vocabulary continuous speech recognition based on WFST structured classifiers and deep bottleneck features

Optimized large vocabulary WFST speech recognition system

A fully data parallel WFST-based large vocabulary continuous speech recognition on a graphics processing unit

WFST-based Large Vocabulary Continuous Speech Decoder for Service Robots

The Titech large vocabulary WFST speech recognition system