Queens are Powerful too: Mitigating Gender Bias in Dialogue Generation
Emily Dinan, Angela Fan, Adina Williams, Jack Urbanek, Douwe Kiela,, Jason Weston

TL;DR
This paper investigates gender bias in dialogue generation, analyzes its amplification in models, and proposes techniques like data augmentation and bias-controlled training to reduce bias while maintaining dialogue quality.
Contribution
It introduces effective bias mitigation methods for dialogue models, especially in highly biased datasets like LIGHT, and demonstrates their success through multiple evaluation metrics.
Findings
Bias mitigation techniques reduce gendered language in generated dialogues.
Combined methods are more effective than individual ones.
Models maintain engagement while generating less biased responses.
Abstract
Models often easily learn biases present in the training data, and their predictions directly reflect this bias. We analyze gender bias in dialogue data, and examine how this bias is actually amplified in subsequent generative chit-chat dialogue models. We measure gender bias in six existing dialogue datasets, and focus on the most biased one, the multi-player text-based fantasy adventure dataset LIGHT, as a testbed for our bias mitigation techniques. The LIGHT dataset is highly imbalanced with respect to gender, containing predominantly male characters, likely because it is entirely collected by crowdworkers and reflects common biases that exist in fantasy or medieval settings. We consider three techniques to mitigate gender bias: counterfactual data augmentation, targeted data collection, and bias controlled training. We show that our proposed techniques mitigate gender bias in LIGHT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
