FastWave: Accelerating Autoregressive Convolutional Neural Networks on FPGA

Shehzeen Hussain; Mojan Javaheripi; Paarth Neekhara; Ryan Kastner; Farinaz Koushanfar

doi:10.1109/iccad45719.2019.8942122

ScienceGate Book Chapters

JOURNAL ARTICLE

FastWave: Accelerating Autoregressive Convolutional Neural Networks on FPGA

Shehzeen Hussain Mojan Javaheripi Paarth Neekhara Ryan Kastner Farinaz Koushanfar

Year: 2019 Pages: 1-8

DOI: 10.1109/iccad45719.2019.8942122

Get Full-Text PDF Get Analytical Report

Abstract

Autoregressive convolutional neural networks (CNNs) have been widely\nexploited for sequence generation tasks such as audio synthesis, language\nmodeling and neural machine translation. WaveNet is a deep autoregressive CNN\ncomposed of several stacked layers of dilated convolution that is used for\nsequence generation. While WaveNet produces state-of-the art audio generation\nresults, the naive inference implementation is quite slow; it takes a few\nminutes to generate just one second of audio on a high-end GPU. In this work,\nwe develop the first accelerator platform~\\textit{FastWave} for autoregressive\nconvolutional neural networks, and address the associated design challenges. We\ndesign the Fast-Wavenet inference model in Vivado HLS and perform a wide range\nof optimizations including fixed-point implementation, array partitioning and\npipelining. Our model uses a fully parameterized parallel architecture for fast\nmatrix-vector multiplication that enables per-layer customized latency\nfine-tuning for further throughput improvement. Our experiments comparatively\nassess the trade-off between throughput and resource utilization for various\noptimizations. Our best WaveNet design on the Xilinx XCVU13P FPGA that uses\nonly on-chip memory, achieves 66 faster generation speed compared to CPU\nimplementation and 11 faster generation speed than GPU implementation.\n

Keywords:

Computer science Autoregressive model Field-programmable gate array Convolutional neural network Inference Parallel computing Throughput Convolution (computer science) Deep learning Design space exploration Artificial neural network Computer architecture Computer hardware Artificial intelligence Embedded system

Metrics

Cited By

2.61

FWCI (Field Weighted Citation Impact)

Refs

0.92

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Model Reduction and Neural Networks

Physical Sciences → Physics and Astronomy → Statistical and Nonlinear Physics

FastWave: Accelerating Autoregressive Convolutional Neural Networks on FPGA

Abstract

Metrics

Citation History

Topics

Related Documents

FPGA-Based Design for Accelerating 3D Convolutional Neural Networks

Accelerating convolutional neural networks: Exploring FPGA-based architectures and challenges

Accelerating low bit-width convolutional neural networks with embedded FPGA

A Method for Accelerating Convolutional Neural Networks Based on FPGA

Accelerating sparse convolutional neural networks with systolic arrays on FPGA