Intermittent-Aware Neural Network Pruning

Chih-Chia Lin; Chia-Yin Liu; Chih‐Hsuan Yen; Tei‐Wei Kuo; Pi-Cheng Hsiu

doi:10.1109/dac56929.2023.10247825

ScienceGate Book Chapters

JOURNAL ARTICLE

Intermittent-Aware Neural Network Pruning

Chih-Chia Lin Chia-Yin Liu Chih‐Hsuan Yen Tei‐Wei Kuo Pi-Cheng Hsiu

Year: 2023 Pages: 1-6

DOI: 10.1109/dac56929.2023.10247825

Get Full-Text PDF Get Analytical Report

Abstract

Deep neural network inference on energy harvesting tiny devices has emerged as a solution for sustainable edge intelligence. However, compact models optimized for continuously-powered systems may become suboptimal when deployed on intermittently-powered systems. This paper presents the pruning criterion, pruning strategy, and prototype implementation of iPrune, the first framework which introduces intermittency into neural network pruning to produce compact models adaptable to intermittent systems. The pruned models are deployed and evaluated on a Texas Instruments device with various power strengths and TinyML applications. Compared to an energy-aware pruning framework, iPrune can speed up intermittent inference by 1.1 to 2 times while achieving comparable model accuracy.

Keywords:

Pruning Computer science Intermittency Artificial neural network Inference Enhanced Data Rates for GSM Evolution Artificial intelligence Machine learning Energy (signal processing) Mathematics

Metrics

Cited By

1.00

FWCI (Field Weighted Citation Impact)

Refs

0.73

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Energy Harvesting in Wireless Networks

Physical Sciences → Engineering → Electrical and Electronic Engineering

Innovative Energy Harvesting Technologies

Physical Sciences → Engineering → Mechanical Engineering

Opportunistic and Delay-Tolerant Networks

Physical Sciences → Computer Science → Computer Networks and Communications

Intermittent-Aware Neural Network Pruning

Abstract

Metrics

Citation History

Topics

Related Documents

Crossbar-Aware Neural Network Pruning

Inference-aware convolutional neural network pruning

Target-Aware Neural Network Execution via Compiler-Guided Pruning

Pruning-aware Sparse Regularization for Network Pruning

An Efficient NPU-Aware Filter Pruning in Convolutional Neural Network