Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters

Junyi Peng; Themos Stafylakis; Rongzhi Gu; Oldřich Plchot; Ladislav Mošner; Lukáš Burget; Jaň Černocký

doi:10.1109/icassp49357.2023.10094795

ScienceGate Book Chapters

JOURNAL ARTICLE

Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters

Junyi Peng Themos Stafylakis Rongzhi Gu Oldřich Plchot Ladislav Mošner Lukáš Burget Jaň Černocký

Year: 2023 Pages: 1-5

DOI: 10.1109/icassp49357.2023.10094795

Get Full-Text PDF Get Analytical Report

Abstract

Recently, the pre-trained Transformer models have received a rising interest in the field of speech processing thanks to their great success in various downstream tasks. However, most fine-tuning approaches update all the parameters of the pre-trained model, which becomes prohibitive as the model size grows and sometimes results in over-fitting on small datasets. In this paper, we conduct a comprehensive analysis of applying parameter-efficient transfer learning (PETL) methods to reduce the required learnable parameters for adapting to speaker verification tasks. Specifically, during the fine-tuning process, the pre-trained models are frozen, and only lightweight modules inserted in each Transformer block are trainable (a method known as adapters). Moreover, to boost the performance in a cross-language low-resource scenario, the Transformer model is further tuned on a large intermediate dataset before directly fine-tuning it on a small dataset. With updating fewer than 4% of parameters, (our proposed) PETL-based methods achieve comparable performances with full fine-tuning methods (Vox1-O: 0.55%, Vox1-E: 0.82%, Vox1-H:1.73%).

Keywords:

Transformer Computer science Transfer of learning Artificial intelligence Fine-tuning Speech recognition Machine learning Voltage Engineering

Metrics

Cited By

4.34

FWCI (Field Weighted Citation Impact)

Refs

0.93

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters

Abstract

Metrics

Citation History

Topics

Related Documents

Efficient Integrated Features Based on Pre-trained Models for Speaker Verification

Efficient Adapter Tuning of Pre-Trained Speech Models for Automatic Speaker Verification

UniPET-SPK: A Unified Framework for Parameter-Efficient Tuning of Pre-Trained Speech Models for Robust Speaker Verification

Parameter-efficient transfer learning of prompts and adapters on vision-language models

SR-HuBERT : An Efficient Pre-Trained Model for Speaker Verification