Is ChatGPT Fair for Recommendation? Evaluating Fairness in Large Language Model Recommendation

Jizhi Zhang; Keqin Bao; Yang Zhang; Wenjie Wang; Fuli Feng; Xiangnan He

doi:10.1145/3604915.3608860

ScienceGate Book Chapters

JOURNAL ARTICLE

Is ChatGPT Fair for Recommendation? Evaluating Fairness in Large Language Model Recommendation

Jizhi Zhang Keqin Bao Yang Zhang Wenjie Wang Fuli Feng Xiangnan He

Year: 2023 Pages: 993-999

DOI: 10.1145/3604915.3608860

Get Full-Text PDF Get Analytical Report

Abstract

The remarkable achievements of Large Language Models (LLMs) have led to the emergence of a novel recommendation paradigm — Recommendation via LLM (RecLLM). Nevertheless, it is important to note that LLMs may contain social prejudices, and therefore, the fairness of recommendations made by RecLLM requires further investigation. To avoid the potential risks of RecLLM, it is imperative to evaluate the fairness of RecLLM with respect to various sensitive attributes on the user side. Due to the differences between the RecLLM paradigm and the traditional recommendation paradigm, it is problematic to directly use the fairness benchmark of traditional recommendation. To address the dilemma, we propose a novel benchmark called Fairness of Recommendation via LLM (FaiRLLM). This benchmark comprises carefully crafted metrics and a dataset that accounts for eight sensitive attributes1 in two recommendation scenarios: music and movies. By utilizing our FaiRLLM benchmark, we conducted an evaluation of ChatGPT and discovered that it still exhibits unfairness to some sensitive attributes when generating recommendations. Our code and dataset can be found at https://github.com/jizhi-zhang/FaiRLLM.

Keywords:

Benchmark (surveying) Computer science Dilemma Code (set theory) Recommender system Artificial intelligence Machine learning Programming language

Metrics

111

Cited By

28.35

FWCI (Field Weighted Citation Impact)

Refs

1.00

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Recommender Systems and Techniques

Physical Sciences → Computer Science → Information Systems

Radiomics and Machine Learning in Medical Imaging

Health Sciences → Medicine → Radiology, Nuclear Medicine and Imaging

Is ChatGPT Fair for Recommendation? Evaluating Fairness in Large Language Model Recommendation

Abstract

Metrics

Citation History

Topics

Related Documents

Item-side Fairness of Large Language Model-based Recommendation System

Fairness identification of large language models in recommendation

Recommendation as Instruction Following: A Large Language Model Empowered Recommendation Approach

GenRec: Large Language Model for Generative Recommendation

Large Language Model-based Recommendation System Agents