JOURNAL ARTICLE

RISC-V vector processor for acceleration of machine learning algorithms

Abstract

In this paper we present an RTL implementation of a 32-bit parametrizable vector processor for acceleration of algorithms working in fixed-point arithmetic. The processor uses the latest RISC-V vector extension ISA specification and is deployed and tested on a Zynq Soc using Avnet Zedboard. Our microarchitecture exploits the inherent parallelism in algorithms by splitting execution across multiple vector lanes and enabling chaining of vector instructions. To provide the required number of read/write ports for instruction chaining, the vector register bank uses the double-pumping technique in combination with an XOR-based approach. First, the microarchitecture of the system is explained in detail, and the results of the implementation on the Zedboard are presented for some different processor configurations. We then compared the performance of the implemented design with some different modern processor cores.

Keywords:
Computer science Microarchitecture Parallel computing Chaining Instruction set Vector processor Acceleration Reduced instruction set computing Instructions per cycle Very long instruction word Bit array Computer architecture Algorithm Computer hardware Central processing unit

Metrics

9
Cited By
2.27
FWCI (Field Weighted Citation Impact)
6
Refs
0.87
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Parallel Computing and Optimization Techniques
Physical Sciences →  Computer Science →  Hardware and Architecture
Embedded Systems Design Techniques
Physical Sciences →  Computer Science →  Hardware and Architecture
Error Correcting Code Techniques
Physical Sciences →  Computer Science →  Computer Networks and Communications
© 2026 ScienceGate Book Chapters — All rights reserved.