Hardware efficient multiplier design for deep learning processing unit

Joseph Peter.V; R. Anitha; S. Anusooya; P. K. Jawahar; E. Nithesh; S. Sairamsiva; S. K.

doi:10.11591/ijece.v15i6.pp5205-5214

ScienceGate Book Chapters

JOURNAL ARTICLE

Hardware efficient multiplier design for deep learning processing unit

Joseph Peter.V R. Anitha S. Anusooya P. K. Jawahar E. Nithesh S. Sairamsiva S. K.

Year: 2025 Journal: International Journal of Power Electronics and Drive Systems/International Journal of Electrical and Computer Engineering Vol: 15 (6)Pages: 5205-5205 Publisher: Institute of Advanced Engineering and Science (IAES)

DOI: 10.11591/ijece.v15i6.pp5205-5214

Get Full-Text PDF Get Analytical Report

Abstract

Deep learning models increasing computational requirements have increased the demand for specialized hardware architectures that can provide high performance while using less energy. Because of their high-power consumption, low throughput, and incapacity to handle real-time processing demands, general-purpose processors frequently fall short. In order to overcome these obstacles, this work introduces a hardware-efficient multiplier design for deep learning processing unit (DPU). To improve performance and energy efficiency, the suggested architecture combines low-power arithmetic circuits, parallel processing units, and optimized dataflow mechanisms. Neural network core operations, such as matrix computations and activation functions, are performed by dedicated hardware blocks. By minimizing data movement, an effective on-chip memory hierarchy lowers latency and power consumption. According to simulation results using industry-standard very large-scale integration (VLSI) tools, compared to traditional processors, there is a 25% decrease in latency, a 40% increase in computational throughput, and a 30% reduction in power consumption. Architecture’s scalability and modularity guarantee compatibility with a variety of deep learning applications, such as edge computing, autonomous systems, and internet of things devices.

Keywords:

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Hardware efficient multiplier design for deep learning processing unit

Abstract

Metrics

Topics

Related Documents

Hardware Efficient Convolution Processing Unit for Deep Neural Networks

Efficient hardware multiplier design for pairing computation

Deep Learning Hardware Accelerator Unit

Hardware-efficient approximate multiplier architectures for media processing applications

Hardware Efficient Approximate Multiplier Architecture for Image Processing Applications