JOURNAL ARTICLE

Unsupervised Sentence Embedding Model Based on Contrastive Learning

Abstract

The unsupervised sentence embedding model of the contrastive learning framework SimCSE uses dropout noise as a data expansion method, which often defaults to having sentences of the same length to have more similar semantic information, and the random nature of dropout may lead to loss of semantic information or large differences due to sentence embedding. For this reason, we propose two agent tasks random deletion as well as R-Dropout to solve these problems. We conducted experiments on the text semantic similarity task on the publicly available datasets STS12-16, STS B, and SICK-R. The experimental results show that our proposed sentence embedding model improves the average Spearman correlation coefficient to 77.67 %, compared with the benchmark models IS-BERT base , CT-BERT base , and SimCSE- We also used the SenEval toolkit to evaluate the quality of sentence embed dings generated by the model, and used sentence embeddings as features of migration tasks MR, SUBJ, MPQA, TREC, and MRPC for classification tasks using SenEval, and the experimental results showed that our proposed sentence embedding model achieves better performance in the accuracy of classification in all cases.

Keywords:
Sentence Dropout (neural networks) Computer science Artificial intelligence Embedding Natural language processing Benchmark (surveying) Semantic similarity Machine learning

Metrics

1
Cited By
0.26
FWCI (Field Weighted Citation Impact)
34
Refs
0.54
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Instance Smoothed Contrastive Learning for Unsupervised Sentence Embedding

Hongliang HeJunlei ZhangZhenzhong LanYue Zhang

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2023 Vol: 37 (11)Pages: 12863-12871
JOURNAL ARTICLE

Prefix Data Augmentation for Contrastive Learning of Unsupervised Sentence Embedding

Chunchun WangShu Lv

Journal:   Applied Sciences Year: 2024 Vol: 14 (7)Pages: 2880-2880
BOOK-CHAPTER

SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples

Hao WangYong Dou

Lecture notes in computer science Year: 2023 Pages: 419-431
© 2026 ScienceGate Book Chapters — All rights reserved.