LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion

Dongfu Jiang; Xiang Ren; Bill Lin

doi:10.18653/v1/2023.acl-long.792

ScienceGate Book Chapters

JOURNAL ARTICLE

LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion

Dongfu Jiang Xiang Ren Bill Lin

Year: 2023 Pages: 14165-14178

DOI: 10.18653/v1/2023.acl-long.792

Get Full-Text PDF Get Analytical Report

Abstract

We present LLM-Blender, an ensembling framework designed to attain consistently superior performance by leveraging the diverse strengths of multiple open-source large language models (LLMs). Our framework consists of two modules: PairRanker and GenFuser, addressing the observation that optimal LLMs for different examples can significantly vary. PairRanker employs a specialized pairwise comparison method to distinguish subtle differences between candidate outputs. It jointly encodes the input text and a pair of candidates, using cross-attention encoders to determine the superior one. Our results demonstrate that PairRanker exhibits the highest correlation with ChatGPT-based ranking. Then, GenFuser aims to merge the top-ranked candidates, generating an improved output by capitalizing on their strengths and mitigating their weaknesses. To facilitate large-scale evaluation, we introduce a benchmark dataset, MixInstruct, which is a mixture of multiple instruction datasets featuring oracle pairwise comparisons. Our LLM-Blender significantly outperform individual LLMs and baseline methods across various metrics, establishing a substantial performance gap.

Keywords:

Pairwise comparison Computer science Oracle Merge (version control) Ranking (information retrieval) Benchmark (surveying) Machine learning Artificial intelligence Generative model Data mining Generative grammar Information retrieval

Metrics

Cited By

20.18

FWCI (Field Weighted Citation Impact)

Refs

0.99

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Text Readability and Simplification

Physical Sciences → Computer Science → Artificial Intelligence

LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion

Abstract

Metrics

Citation History

Topics

Related Documents

Large Language Models are Effective Text Rankers with Pairwise Ranking Prompting

Generative AI: foundational models. Natural Language Processing (NLP) and LARGE Language Models (LLM)

LLM – Large Language Models

Enrich Humanoids With Large Language Models (LLM)

LLM-IE: a python package for biomedical generative information extraction with large language models