An Analysis of the Effects of Decoding Algorithms on Fairness in Open-Ended Language Generation
Jwala Dhamala, Varun Kumar, Rahul Gupta, Kai-Wei Chang, Aram Galstyan

TL;DR
This paper systematically analyzes how different decoding algorithms influence fairness in open-ended language generation, revealing significant impacts on bias, diversity, and sentiment, and offers recommendations for fairer decoding practices.
Contribution
It provides the first comprehensive study of decoding algorithm effects on fairness in language models, highlighting trade-offs and proposing standardized evaluation methods.
Findings
Fairness varies significantly with decoding hyper-parameters.
More diverse outputs tend to contain more negative sentiment.
Recommendations for balancing fairness, diversity, and quality.
Abstract
Several prior works have shown that language models (LMs) can generate text containing harmful social biases and stereotypes. While decoding algorithms play a central role in determining properties of LM generated text, their impact on the fairness of the generations has not been studied. We present a systematic analysis of the impact of decoding algorithms on LM fairness, and analyze the trade-off between fairness, diversity and quality. Our experiments with top-, top- and temperature decoding algorithms, in open-ended language generation, show that fairness across demographic groups changes significantly with change in decoding algorithm's hyper-parameters. Notably, decoding algorithms that output more diverse text also output more texts with negative sentiment and regard. We present several findings and provide recommendations on standardized reporting of decoding details in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Computational and Text Analysis Methods
