Learnable Sparsity Structured Pruning for Acoustic Pre-trained Models

Siyuan Wang; Haoyu Wang; Jian Li; Wei-Qiang Zhang

doi:10.1145/3614008.3614020

ScienceGate Book Chapters

JOURNAL ARTICLE

Learnable Sparsity Structured Pruning for Acoustic Pre-trained Models

Siyuan Wang Haoyu Wang Jian Li Wei-Qiang Zhang

Year: 2023 Pages: 76-81

DOI: 10.1145/3614008.3614020

Get Full-Text PDF Get Analytical Report

Abstract

Large-scale pre-trained models bring significant gains to many speech-related tasks. However, it is still challenging to use these large models when computing power of terminal equipment is limited. Pruning is an effective method to reduce memory footprint and cost calculation. The imperfect evaluation criteria of existing pruning methods and the complex fine tuning process result in a relatively high loss of accuracy. To solve these problems, we propose a structured pruning method, which introduced the upper confidence bound of importance scores to assess the potential of each component of the model more accurately. In addition, we also introduce a set of learnable pruning threshold parameters that can be learned via stochastic gradient descent, thereby reducing the hyper-parameter tuning. We apply our method to HuBERT models on automatic speech recognition (ASR) task. Our result shows that for all pruning granularity and pruning ratios, our method yields higher accuracy and speedup ratios in the inference process.When sparsity was 60%, our method performed only 0.63% down.

Keywords:

Pruning Computer science Inference Memory footprint Speedup Granularity Process (computing) Footprint Machine learning Task (project management) Artificial intelligence Set (abstract data type) Algorithm Parallel computing

Metrics

Cited By

0.26

FWCI (Field Weighted Citation Impact)

Refs

0.57

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Learnable Sparsity Structured Pruning for Acoustic Pre-trained Models

Abstract

Metrics

Citation History

Topics

Related Documents

Structured Pruning for Efficient Generative Pre-trained Language Models

MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models

Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency

Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency

Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency