A fully data parallel WFST-based large vocabulary continuous speech recognition on a graphics processing unit

Jike Chong; Ekaterina Gonina; Youngmin Yi; Kurt Keutzer

doi:10.21437/interspeech.2009-343

ScienceGate Book Chapters

JOURNAL ARTICLE

A fully data parallel WFST-based large vocabulary continuous speech recognition on a graphics processing unit

Jike Chong Ekaterina Gonina Youngmin Yi Kurt Keutzer

Year: 2009 Pages: 1183-1186

DOI: 10.21437/interspeech.2009-343

Get Full-Text PDF Get Analytical Report

Abstract

Tremendous compute throughput is becoming available in personal desktop and laptop systems through the use of graphics processing units (GPUs). However, exploiting this resource requires re-architecting an application to fit a data parallel programming model. The complex graph traversal routines in the inference process for large vocabulary continuous speech recognition (LVCSR) have been considered by many as unsuitable for extensive parallelization. We explore and demonstrate a fully data parallel implementation of a speech inference engine on NVIDIA’s GTX280 GPU. Our implementation consists of two phases compute-intensive observation probability computation phase and communication-intensive graph traversal phase. We take advantage of dynamic elimination of redundant computation in the compute-intensive phase while maintaining close-to-peak execution efficiency. We also demonstrate the importance of exploring application-level trade-offs in the communication-intensive graph traversal phase to adapt the algorithm to data parallel execution on GPUs. On 3.1 hours of speech data set, we achieve more than 11× speedup compared to a highly optimized sequential implementation on Intel Core i7 without sacrificing accuracy.

Keywords:

Computer science Tree traversal Speedup Parallel computing Inference Graphics processing unit Graph traversal General-purpose computing on graphics processing units Graph CUDA Graphics Theoretical computer science Algorithm Artificial intelligence Operating system

Metrics

Cited By

7.24

FWCI (Field Weighted Citation Impact)

Refs

0.98

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

A fully data parallel WFST-based large vocabulary continuous speech recognition on a graphics processing unit

Abstract

Metrics

Citation History

Topics

Related Documents

H- and C-level WFST-based large vocabulary continuous speech recognition on Graphics Processing Units

Large vocabulary continuous speech recognition using WFST-based linear classifier for structured data

A large vocabulary parallel processing continuous speech recognition system

Optimized large vocabulary WFST speech recognition system

WFST-based Large Vocabulary Continuous Speech Decoder for Service Robots