Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency

Yanyang Li; Fuli Luo; Runxin Xu; Songfang Huang; Fei Huang; Liwei Wang

doi:10.60692/e0ekr-sea54

ScienceGate Book Chapters

JOURNAL ARTICLE

Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency

Yanyang Li Fuli Luo Runxin Xu Songfang Huang Fei Huang Liwei Wang

Year: 2022 Journal: Greater South Information System

DOI: 10.60692/e0ekr-sea54

Get Full-Text PDF Get Analytical Report

Abstract

Structured pruning has been extensively studied on monolingual pre-trained language models and is yet to be fully evaluated on their multilingual counterparts.This work investigates three aspects of structured pruning on multilingual pre-trained language models: settings, algorithms, and efficiency.Experiments on nine downstream tasks show several counterintuitive phenomena: for settings, individually pruning for each language does not induce a better result; for algorithms, the simplest method performs the best; for efficiency, a fast model does not imply that it is also small.To facilitate the comparison on all sparsity levels, we present Dynamic Sparsification, a simple approach that allows training the model once and adapting to different model sizes at inference.We hope this work fills the gap in the study of structured pruning on multilingual pre-trained models and sheds light on future research.

Keywords:

Pruning Counterintuitive Language model Simple (philosophy) Machine translation Work (physics)

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.32

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Language and cultural evolution

Social Sciences → Social Sciences → Cultural Studies

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency

Abstract

Metrics

Topics

Related Documents

Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency

Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency

Structured Pruning for Efficient Generative Pre-trained Language Models

Learnable Sparsity Structured Pruning for Acoustic Pre-trained Models

Syntactic multilingual probing of pre-trained language models of code