DistillCSE: Distilled Contrastive Learning for Sentence Embeddings

Jiahao Xu; Wei Shao; Lihui Chen; Lemao Liu

doi:10.18653/v1/2023.findings-emnlp.547

ScienceGate Book Chapters

JOURNAL ARTICLE

DistillCSE: Distilled Contrastive Learning for Sentence Embeddings

Jiahao Xu Wei Shao Lihui Chen Lemao Liu

Year: 2023 Pages: 8153-8165

DOI: 10.18653/v1/2023.findings-emnlp.547

Get Full-Text PDF Get Analytical Report

Abstract

This paper proposes the DistillCSE framework, which performs contrastive learning under the self-training paradigm with knowledge distillation. The potential advantage of DistillCSE is its self-enhancing feature: using a base model to provide additional supervision signals, a stronger model may be learned through knowledge distillation. However, the vanilla DistillCSE through the standard implementation of knowledge distillation only achieves marginal improvements. The quantitative analyses demonstrate its reason that the standard knowledge distillation exhibits a relatively large variance of the teacher model’s logits due to the essence of contrastive learning. To mitigate the issue induced by high variance, this paper accordingly proposed two simple yet effective solutions for knowledge distillation: a Group-P shuffling strategy as an implicit regularization and the averaging logits from multiple teacher components. Experiments on standard benchmarks demonstrate that the proposed DistillCSE outperforms many strong baseline methods and yields a new state-of-the-art performance.

Keywords:

Computer science Distillation Regularization (linguistics) Variance (accounting) Artificial intelligence Machine learning Shuffling Sentence Contrastive analysis Feature (linguistics) Natural language processing Chromatography

Metrics

Cited By

1.02

FWCI (Field Weighted Citation Impact)

Refs

0.77

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Sentiment Analysis and Opinion Mining

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

DistillCSE: Distilled Contrastive Learning for Sentence Embeddings

Abstract

Metrics

Citation History

Topics

Related Documents

Composition-contrastive Learning for Sentence Embeddings

Enhanced contrastive learning for semantic sentence embeddings

Contrastive Learning of Sentence Embeddings from Scratch

BioSimCSE: BioMedical Sentence Embeddings using Contrastive learning

SimCSE: Simple Contrastive Learning of Sentence Embeddings