Zero-Shot Text Classification via Self-Supervised Tuning

Chaoqun Liu; Wenxuan Zhang; Guizhen Chen; Xiaobao Wu; Anh Tuan Luu; Chip-Hong Chang; Lidong Bing

doi:10.18653/v1/2023.findings-acl.110

ScienceGate Book Chapters

JOURNAL ARTICLE

Zero-Shot Text Classification via Self-Supervised Tuning

Chaoqun Liu Wenxuan Zhang Guizhen Chen Xiaobao Wu Anh Tuan Luu Chip-Hong Chang Lidong Bing

Year: 2023 Pages: 1743-1761

DOI: 10.18653/v1/2023.findings-acl.110

Get Full-Text PDF Get Analytical Report

Abstract

Existing solutions to zero-shot text classification either conduct prompting with pre-trained language models, which is sensitive to the choices of templates, or rely on large-scale annotated data of relevant tasks for meta-tuning. In this work, we propose a new paradigm based on self-supervised learning to solve zero-shot text classification tasks by tuning the language models with unlabeled data, called self-supervised tuning. By exploring the inherent structure of free texts, we propose a new learning objective called first sentence prediction to bridge the gap between unlabeled data and text classification tasks. After tuning the model to learn to predict the first sentence in a paragraph based on the rest, the model is able to conduct zero-shot inference on unseen tasks such as topic classification and sentiment analysis. Experimental results show that our model outperforms the state-of-the-art baselines on 7 out of 10 tasks. Moreover, the analysis reveals that our model is less sensitive to the prompt design. Our code and pre-trained models are publicly available at https://github.com/DAMO-NLP-SG/SSTuning.

Keywords:

Computer science Artificial intelligence Paragraph Sentence Natural language processing Language model Inference Machine learning Code (set theory) Zero (linguistics)

Metrics

Cited By

1.79

FWCI (Field Weighted Citation Impact)

Refs

0.84

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Zero-Shot Text Classification via Self-Supervised Tuning

Abstract

Metrics

Citation History

Topics

Related Documents

Preserving Zero-shot Capability in Supervised Fine-tuning for Multi-label Text Classification

Zero-Shot Text Classification with Self-Training

Zero-shot Text Classification via Reinforced Self-training

A weakly supervised textual entailment approach to zero-shot text classification

Zero-Shot Turkish Text Classification