Unsupervised Speaker Verification Using Pre-Trained Model and Label Correction

Zhicong Chen; Jie Wang; Wenxuan Hu; Lin Li; Qingyang Hong

doi:10.1109/icassp49357.2023.10094610

ScienceGate Book Chapters

JOURNAL ARTICLE

Unsupervised Speaker Verification Using Pre-Trained Model and Label Correction

Zhicong Chen Jie Wang Wenxuan Hu Lin Li Qingyang Hong

Year: 2023 Pages: 1-5

DOI: 10.1109/icassp49357.2023.10094610

Get Full-Text PDF Get Analytical Report

Abstract

Recently, the fine-tuning pre-trained model framework has emerged as a promising paradigm for speech-processing tasks. In this study, we present a novel strategy for unsupervised speaker verification using the Sub-structure of Pre-Trained Model (Sub-PTM), which consists of a CNN-based feature extractor and several Transformer blocks. To obtain the initial pseudo labels, we utilize Infomap to perform clustering on the representations extracted from the Sub-PTM. The generated pseudo labels are then leveraged to train a speaker verification model containing a Sub-PTM and a downstream network. We also propose an Online and Offline Label Correction (OAO-LC) method to alleviate the effects of incorrect pseudo labels. By incorporating these techniques, our system achieves competitive results compared to the supervised baseline.

Keywords:

Computer science Artificial intelligence Cluster analysis Transformer Feature extraction Speech recognition Pattern recognition (psychology) Feature (linguistics) Speaker verification Extractor Speaker recognition

Metrics

Cited By

2.30

FWCI (Field Weighted Citation Impact)

Refs

0.87

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Unsupervised Speaker Verification Using Pre-Trained Model and Label Correction

Abstract

Metrics

Citation History

Topics

Related Documents

PRISM: Pre-trained Indeterminate Speaker Representation Model for Speaker Diarization and Speaker Verification

SR-HuBERT : An Efficient Pre-Trained Model for Speaker Verification

An iVector extractor using pre-trained neural networks for speaker verification

Improving Noise Robustness in Self-supervised Pre-trained Model for Speaker Verification

Self-Supervised Speaker Verification Using Dynamic Loss-Gate and Label Correction