Retrieval-Augmented Few-shot Text Classification

Guoxin Yu; Lemao Liu; Haiyun Jiang; Shuming Shi; Xiang Ao

doi:10.18653/v1/2023.findings-emnlp.447

ScienceGate Book Chapters

JOURNAL ARTICLE

Retrieval-Augmented Few-shot Text Classification

Guoxin Yu Lemao Liu Haiyun Jiang Shuming Shi Xiang Ao

Year: 2023 Pages: 6721-6735

DOI: 10.18653/v1/2023.findings-emnlp.447

Get Full-Text PDF Get Analytical Report

Abstract

Retrieval-augmented methods are successful in the standard scenario where the retrieval space is sufficient; whereas in the few-shot scenario with limited retrieval space, this paper shows it is non-trivial to put them into practice. First, it is impossible to retrieve semantically similar examples by using an off-the-shelf metric and it is crucial to learn a task-specific retrieval metric; Second, our preliminary experiments demonstrate that it is difficult to optimize a plausible metric by minimizing the standard cross-entropy loss. The in-depth analyses quantitatively show minimizing cross-entropy loss suffers from the weak supervision signals and the severe gradient vanishing issue during the optimization. To address these issues, we introduce two novel training objectives, namely EM-L and R-L, which provide more task-specific guidance to the retrieval metric by the EM algorithm and a ranking-based loss, respectively. Extensive experiments on 10 datasets prove the superiority of the proposed retrieval augmented methods on the performance.

Keywords:

Computer science Ranking (information retrieval) Metric (unit) Entropy (arrow of time) Information retrieval Task (project management) Image retrieval Artificial intelligence Machine learning Data mining Image (mathematics)

Metrics

Cited By

1.79

FWCI (Field Weighted Citation Impact)

Refs

0.85

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Retrieval-Augmented Few-shot Text Classification

Abstract

Metrics

Citation History

Topics

Related Documents

Text Augmented Correlation Transformer For Few-shot Classification & Segmentation

MalMixer: Few-Shot Malware Classification with Retrieval-Augmented Semi-Supervised Learning

Retrieval-style In-context Learning for Few-shot Hierarchical Text Classification

Zero-/Few-Shot Anomaly Classification for Transistor Using Multimodal CLIP Retrieval Augmented

Audiobox TTA-RAG: Improving Zero-Shot and Few-Shot Text-To-Audio with Retrieval-Augmented Generation