JOURNAL ARTICLE

A Full SW-HW Demonstration of GEMM Accelerators with RISC-V Instruction Extensions

Abstract

Many deep learning accelerators have been proposed and designed in both academia and industry for executing deep neural networks with better power efficiency. Recently, many studies focus on developing a system-on-chip including both host processor and accelerators. In this paper, we demonstrate a full software-hardware stack for accelerating deep learning benchmarks using a co-processor attached to RISC-V core. To do so, we extend the RISC-V instruction set and modified the compilation stack to show significant end-to-end performance boost compared to the CPU-only processing scenario.

Keywords:
Computer science Parallel computing Reduced instruction set computing Instruction set Computer architecture Programming language

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
5
Refs
0.02
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Particle accelerators and beam dynamics
Physical Sciences →  Engineering →  Aerospace Engineering
Particle Accelerators and Free-Electron Lasers
Physical Sciences →  Engineering →  Electrical and Electronic Engineering
Particle Detector Development and Performance
Physical Sciences →  Physics and Astronomy →  Nuclear and High Energy Physics
© 2026 ScienceGate Book Chapters — All rights reserved.