JOURNAL ARTICLE

PRArch: Pattern-Based Reconfigurable Architecture for Deep Neural Network Acceleration

Abstract

Quantization is now widely used for Deep Neural Network (DNN) inference acceleration. While mixed-precision quantization achieve better compression rate as well as better accuracy compared to fixed-precision quantization, it is non-trivial and costly to make hardware accelerator like systolic array to support mixed-precision. In this paper, we propose a Pattern-based Mixed-precision Quantization algorithm, namely PMQ, to transform mixed-precision kernels into fix-precision which is hardware friendly, and we further observe the pattern-based sparsity existing in the high parts of transformed kernels, leading to a novel aggregated sparse kernel convolution. Based on the PMQ, we propose an accelerator PRArch supporting mixed-precision convolution neural networks using fix-precision systolic array with minimal overhead. Experiments on several typical convolution networks show a speedup of 1.86x in average compared to coarse-grained quantization accelerator Eyeriss using the same computation chip area, and the accuracy drop less than 1% without fine-tuning.

Keywords:
Quantization (signal processing) Computer science Speedup Acceleration Kernel (algebra) Hardware acceleration Algorithm Artificial neural network Convolution (computer science) Inference Convolutional neural network Overhead (engineering) Parallel computing Computation Computer engineering Field-programmable gate array Artificial intelligence Computer hardware Mathematics

Metrics

1
Cited By
0.10
FWCI (Field Weighted Citation Impact)
21
Refs
0.45
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.