Debiasing Pre-Trained Language Models via Efficient Fine-Tuning

Michael Gira; Ruisu Zhang; Kangwook Lee

doi:10.18653/v1/2022.ltedi-1.8

ScienceGate Book Chapters

JOURNAL ARTICLE

Debiasing Pre-Trained Language Models via Efficient Fine-Tuning

Michael Gira Ruisu Zhang Kangwook Lee

Year: 2022

DOI: 10.18653/v1/2022.ltedi-1.8

Get Full-Text PDF Get Analytical Report

Abstract

An explosion in the popularity of transformer-based language models (such as GPT-3, BERT, RoBERTa, and ALBERT) has opened the doors to new machine learning applications involving language modeling, text generation, and more. However, recent scrutiny reveals that these language models contain inherent biases towards certain demographics reflected in their training data. While research has tried mitigating this problem, existing approaches either fail to remove the bias completely, degrade performance ("catastrophic forgetting"), or are costly to execute. This work examines how to reduce gender bias in a GPT-2 language model by fine-tuning less than 1% of its parameters. Through quantitative benchmarks, we show that this is a viable way to reduce prejudice in pre-trained language models while remaining cost-effective at scale.

Keywords:

Language model Computer science Debiasing Forgetting Popularity Scrutiny Doors Machine learning Artificial intelligence Extrapolation Overfitting Natural language processing Cognitive psychology Psychology Artificial neural network

Metrics

Cited By

5.68

FWCI (Field Weighted Citation Impact)

Refs

0.94

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Debiasing Pre-Trained Language Models via Efficient Fine-Tuning

Abstract

Metrics

Citation History

Topics

Related Documents

Gender-tuning: Empowering Fine-tuning for Debiasing Pre-trained Language Models

Efficient Fine-Tuning for Low-Resource Tibetan Pre-trained Language Models

Parameter-efficient fine-tuning of large-scale pre-trained language models

Pruning Pre-trained Language Models Without Fine-Tuning

Data-Efficient Fine-Tuning for Pre-Trained Language Model