ChatGPT Based Data Augmentation for Improved Parameter-Efficient   Debiasing of LLMs

Pengrui Han; Rafal Kocielnik; Adhithya Saravanan; Roy Jiang; Or; Sharir; Anima Anandkumar

arXiv:2402.11764·cs.CL·September 17, 2024·1 cites

ChatGPT Based Data Augmentation for Improved Parameter-Efficient Debiasing of LLMs

Pengrui Han, Rafal Kocielnik, Adhithya Saravanan, Roy Jiang, Or, Sharir, Anima Anandkumar

PDF

Open Access 1 Repo

TL;DR

This paper presents a novel method using ChatGPT to generate synthetic data for debiasing large language models efficiently, improving fairness across multiple bias categories with minimal retraining.

Contribution

It introduces two prompting strategies for synthetic data generation, demonstrating superior debiasing performance and generalizability compared to existing datasets.

Findings

01

Synthetic data outperforms existing debiasing datasets.

02

Debiasing preserves the internal knowledge of LLMs.

03

Approach effectively mitigates intersectional biases.

Abstract

Large Language models (LLMs), while powerful, exhibit harmful social biases. Debiasing is often challenging due to computational costs, data constraints, and potential degradation of multi-task language capabilities. This work introduces a novel approach utilizing ChatGPT to generate synthetic training data, aiming to enhance the debiasing of LLMs. We propose two strategies: Targeted Prompting, which provides effective debiasing for known biases but necessitates prior specification of bias in question; and General Prompting, which, while slightly less effective, offers debiasing across various categories. We leverage resource-efficient LLM debiasing using adapter tuning and compare the effectiveness of our synthetic data to existing debiasing datasets. Our results reveal that: (1) ChatGPT can efficiently produce high-quality training data for debiasing other LLMs; (2) data produced via…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

barryhpr/syntheticdebiasing
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Storage Technologies · Brain Tumor Detection and Classification · Artificial Intelligence in Healthcare

MethodsAdapter