Towards Robust NLG Bias Evaluation with Syntactically-diverse Prompts

Arshiya Aggarwal; Jiao Sun; Nanyun Peng

arXiv:2212.01700·cs.CL·December 6, 2022·1 cites

Towards Robust NLG Bias Evaluation with Syntactically-diverse Prompts

Arshiya Aggarwal, Jiao Sun, Nanyun Peng

PDF

Open Access 1 Repo

TL;DR

This paper proposes a method for evaluating biases in NLG systems using syntactically-diverse prompts, which leads to more reliable and tone-invariant bias assessments compared to fixed templates.

Contribution

It introduces a paraphrasing approach to generate syntactically-diverse prompts for bias evaluation, improving robustness over traditional fixed-template methods.

Findings

01

Syntactic variation affects bias measurement outcomes.

02

Some structures induce more toxic content, others less biased.

03

Robust evaluation benefits from tone-invariant, diverse prompts.

Abstract

We present a robust methodology for evaluating biases in natural language generation(NLG) systems. Previous works use fixed hand-crafted prefix templates with mentions of various demographic groups to prompt models to generate continuations for bias analysis. These fixed prefix templates could themselves be specific in terms of styles or linguistic structures, which may lead to unreliable fairness conclusions that are not representative of the general trends from tone varying prompts. To study this problem, we paraphrase the prompts with different syntactic structures and use these to evaluate demographic bias in NLG systems. Our results suggest similar overall bias trends but some syntactic structures lead to contradictory conclusions compared to past works. We show that our methodology is more robust and that some syntactic structures prompt more toxic content while others could…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

arshiyaaggarwal/robust-nlg-bias-eval
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification