CombLM: Adapting Black-Box Language Models through Small Fine-Tuned Models

Aitor Ormazabal; Mikel Artetxe; Eneko Agirre

doi:10.18653/v1/2023.emnlp-main.180

ScienceGate Book Chapters

JOURNAL ARTICLE

CombLM: Adapting Black-Box Language Models through Small Fine-Tuned Models

Aitor Ormazabal Mikel Artetxe Eneko Agirre

Year: 2023 Pages: 2961-2974

DOI: 10.18653/v1/2023.emnlp-main.180

Get Full-Text PDF Get Analytical Report

Abstract

Methods for adapting language models (LMs) to new tasks and domains have traditionally assumed white-box access to the model, and work by modifying its parameters. However, this is incompatible with a recent trend in the field, where the highest quality models are only available as black-boxes through inference APIs. Even when the model weights are available, the computational cost of fine-tuning large LMs can be prohibitive for most practitioners. In this work, we present a lightweight method for adapting large LMs to new domains and tasks, assuming no access to their weights or intermediate activations. Our approach fine-tunes a small white-box LM and combines it with the large black-box LM at the probability level through a small network, learned on a small validation set. We validate our approach by adapting a large LM (OPT-30B) to several domains and a downstream task (machine translation), observing improved performance in all cases, of up to 9%, while using a domain expert 23x smaller.

Keywords:

Computer science Black box Machine translation Language model White box Inference Set (abstract data type) Task (project management) Field (mathematics) Domain (mathematical analysis) Artificial intelligence Machine learning Programming language

Metrics

Cited By

0.51

FWCI (Field Weighted Citation Impact)

Refs

0.68

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

CombLM: Adapting Black-Box Language Models through Small Fine-Tuned Models

Abstract

Metrics

Citation History

Topics

Related Documents

Black-box Membership Inference Attacks against Fine-tuned Diffusion Models

Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning

Email Augmentation: A Comparison of Fine-tuned Small Language Models

Empowering ESG Insights in Vietnamese Through Fine-Tuned Language Models

BLADE: Enhancing Black-Box Large Language Models with Small Domain-Specific Models