FPGA-Based Low-Power Speech Recognition with Recurrent Neural Networks

Minjae Lee; Kyuyeon Hwang; Jin-Hwan Park; Sung‐Wook Choi; Sungho Shin; Wonyong Sung

doi:10.1109/sips.2016.48

ScienceGate Book Chapters

JOURNAL ARTICLE

FPGA-Based Low-Power Speech Recognition with Recurrent Neural Networks

Minjae Lee Kyuyeon Hwang Jin-Hwan Park Sung‐Wook Choi Sungho Shin Wonyong Sung

Year: 2016 Pages: 230-235

DOI: 10.1109/sips.2016.48

Get Full-Text PDF Get Analytical Report

Abstract

In this paper, a neural network based real-time speech recognition (SR) system is developed using an FPGA for very low-power operation. The implemented system employs two recurrent neural networks (RNNs), one is a speech-tocharacter RNN for acoustic modeling (AM) and the other is for character-level language modeling (LM). The system also employs a statistical word-level LM to improve the recognition accuracy. The results of the AM, the character-level LM, and the word-level LM are combined using a fairly simple N-best search algorithm instead of the hidden Markov model (HMM) based network. The RNNs are implemented using massively parallel processing elements (PEs) for low latency and high throughput. The weights are quantized to 6 bits to store all of them in the on-chip memory of an FPGA. The proposed algorithm is implemented on a Xilinx XC7Z045, and the system can operate much faster than real-time.

Keywords:

Computer science Field-programmable gate array Recurrent neural network Hidden Markov model Speech recognition Artificial neural network Latency (audio) Language model Low latency (capital markets) Massively parallel Time delay neural network Artificial intelligence Pattern recognition (psychology) Parallel computing Computer hardware

Metrics

Cited By

10.99

FWCI (Field Weighted Citation Impact)

Refs

0.99

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Neural Networks and Applications

Physical Sciences → Computer Science → Artificial Intelligence

FPGA-Based Low-Power Speech Recognition with Recurrent Neural Networks

Abstract

Metrics

Citation History

Topics

Related Documents

sEMG-based Gesture Recognition with Spiking Neural Networks on Low-power FPGA

sEMG-Based Gesture Recognition with Spiking Neural Networks on Low-Power FPGA

sEMG-based Gesture Recognition with Spiking Neural Networks on Low-power FPGA

Speech recognition with deep recurrent neural networks

Speech recognition with hierarchical recurrent neural networks