Towards Adaptive Prefix Tuning for Parameter-Efficient Language Model Fine-tuning

Zhen-Ru Zhang; Chuanqi Tan; Haiyang Xu; Chengyu Wang; Jun Huang; Songfang Huang

doi:10.18653/v1/2023.acl-short.107

ScienceGate Book Chapters

JOURNAL ARTICLE

Towards Adaptive Prefix Tuning for Parameter-Efficient Language Model Fine-tuning

Zhen-Ru Zhang Chuanqi Tan Haiyang Xu Chengyu Wang Jun Huang Songfang Huang

Year: 2023 Pages: 1239-1248

DOI: 10.18653/v1/2023.acl-short.107

Get Full-Text PDF Get Analytical Report

Abstract

Fine-tuning large pre-trained language models on various downstream tasks with whole parameters is prohibitively expensive. Hence, Parameter-efficient fine-tuning has attracted attention that only optimizes a few task-specific parameters with the frozen pre-trained model. In this work, we focus on prefix tuning, which only optimizes continuous prefix vectors (i.e. pseudo tokens) inserted into Transformer layers. Based on the observation that the learned syntax and semantics representation varies a lot at different layers, we argue that the adaptive prefix will be further tailored to each layer than the fixed one, enabling the fine-tuning more effective and efficient. Thus, we propose Adaptive Prefix Tuning (APT) to adjust the prefix in terms of both fine-grained token level and coarse-grained layer level with a gate mechanism. Experiments on the SuperGLUE and NER datasets show the effectiveness of APT. In addition, taking the gate as a probing, we validate the efficiency and effectiveness of the variable prefix.

Keywords:

Prefix Computer science Security token Fine-tuning Language model Transformer Focus (optics) Semantics (computer science) Artificial intelligence Programming language Voltage

Metrics

Cited By

15.84

FWCI (Field Weighted Citation Impact)

Refs

0.99

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Towards Adaptive Prefix Tuning for Parameter-Efficient Language Model Fine-tuning

Abstract

Metrics

Citation History

Topics

Related Documents

Sensi-Bert: Towards Sensitivity Driven Fine-Tuning for Parameter-Efficient Language Model

Explore parameter efficient fine-tuning methods on Large Language Model

Vision-Language Model Fine-Tuning via Simple Parameter-Efficient Modification

Efficient Differentially Private Fine-Tuning with QLoRA and Prefix Tuning for Large Language Models

Parameter-efficient fine-tuning of large language models using semantic knowledge tuning