Parameter-Efficient Fine-Tuning without Introducing New Latency

Baohao Liao; Yan Meng; Christof Monz

doi:10.18653/v1/2023.acl-long.233

ScienceGate Book Chapters

JOURNAL ARTICLE

Parameter-Efficient Fine-Tuning without Introducing New Latency

Baohao Liao Yan Meng Christof Monz

Year: 2023 Pages: 4242-4260

DOI: 10.18653/v1/2023.acl-long.233

Get Full-Text PDF Get Analytical Report

Abstract

Parameter-efficient fine-tuning (PEFT) of pre-trained language models has recently demonstrated remarkable achievements, effectively matching the performance of full fine-tuning while utilizing significantly fewer trainable parameters, and consequently addressing the storage and communication constraints. Nonetheless, various PEFT methods are limited by their inherent characteristics. In the case of sparse fine-tuning, which involves modifying only a small subset of the existing parameters, the selection of fine-tuned parameters is task- and domain-specific, making it unsuitable for federated learning. On the other hand, PEFT methods with adding new parameters typically introduce additional inference latency. In this paper, we demonstrate the feasibility of generating a sparse mask in a task-agnostic manner, wherein all downstream tasks share a common mask. Our approach, which relies solely on the magnitude information of pre-trained parameters, surpasses existing methodologies by a significant margin when evaluated on the GLUE benchmark. Additionally, we introduce a novel adapter technique that directly applies the adapter to pre-trained parameters instead of the hidden representation, thereby achieving identical inference speed to that of full fine-tuning. Through extensive experiments, our proposed method attains a new state-of-the-art outcome in terms of both performance and storage efficiency, storing only 0.03% parameters of full fine-tuning.

Keywords:

Computer science Fine-tuning Inference Benchmark (surveying) Latency (audio) Adapter (computing) Task (project management) Language model Artificial intelligence Machine learning Computer engineering Computer hardware

Metrics

Cited By

5.88

FWCI (Field Weighted Citation Impact)

Refs

0.95

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Parameter-Efficient Fine-Tuning without Introducing New Latency

Abstract

Metrics

Citation History

Topics

Related Documents

Parameter Efficient Fine-tuning of Self-supervised ViTs without Catastrophic Forgetting

Parameter Efficient Fine-tuning of Self-supervised ViTs without Catastrophic Forgetting

6 LLM Fine-Tuning: Instruction and Parameter-Efficient Fine-Tuning (PEFT)

Bad-Tuning: Backdooring Vision Transformer Parameter-Efficient Fine-Tuning

Semantic Hierarchical Prompt Tuning for Parameter-Efficient Fine-Tuning