GenderAlign: An Alignment Dataset for Mitigating Gender Bias in Large Language Models

Zhang, Tao; Zeng, Ziqian; Xiao, Yuxiang; Zhuang, Huiping; Chen, Cen; Foulds, James; Pan, Shimei

doi:10.13016/m2lani-bx9l

ScienceGate Book Chapters

JOURNAL ARTICLE

GenderAlign: An Alignment Dataset for Mitigating Gender Bias in Large Language Models

Zhang, Tao Zeng, Ziqian Xiao, Yuxiang Zhuang, Huiping Chen, Cen Foulds, James Pan, Shimei

Year: 2024 Journal: Maryland Shared Open Access Repository (USMAI Consortium)

DOI: 10.13016/m2lani-bx9l

Get Full-Text PDF Get Analytical Report

Abstract

Large Language Models (LLMs) are prone to generating content that exhibits gender biases, raising significant ethical concerns. Alignment, the process of fine-tuning LLMs to better align with desired behaviors, is recognized as an effective approach to mitigate gender biases. Although proprietary LLMs have made significant strides in mitigating gender bias, their alignment datasets are not publicly available. The commonly used and publicly available alignment dataset, HH-RLHF, still exhibits gender bias to some extent. There is a lack of publicly available alignment datasets specifically designed to address gender bias. Hence, we developed a new dataset named GenderAlign, aiming at mitigating a comprehensive set of gender biases in LLMs. This dataset comprises 8k single-turn dialogues, each paired with a "chosen" and a "rejected" response. Compared to the "rejected" responses, the "chosen" responses demonstrate lower levels of gender bias and higher quality. Furthermore, we categorized the gender biases in the "rejected" responses of GenderAlign into 4 principal categories. The experimental results show the effectiveness of GenderAlign in reducing gender bias in LLMs.

Keywords:

Gender bias Set (abstract data type) Process (computing) Gender disparity Gender discrimination Gender analysis

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.56

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Genetic diversity and population structure

Life Sciences → Biochemistry, Genetics and Molecular Biology → Genetics

Genetics and Plant Breeding

Life Sciences → Agricultural and Biological Sciences → Plant Science

Genetic Mapping and Diversity in Plants and Animals

Life Sciences → Biochemistry, Genetics and Molecular Biology → Genetics

GenderAlign: An Alignment Dataset for Mitigating Gender Bias in Large Language Models

Abstract

Metrics

Topics

Related Documents

GenderAlign: An Alignment Dataset for Mitigating Gender Bias in Large Language Models

Locating and Mitigating Gender Bias in Large Language Models

Evaluating and Mitigating Gender Bias in Generative Large Language Models

EXPLORING GENDER BIAS IN LARGE LANGUAGE MODELS

Unveiling Gender Bias in Large Language Models