Prompting Large Language Models for Counterfactual Generation: An Empirical Study
Yongqi Li, Mayi Xu, Xin Miao, Shen Zhou, Tieyun Qian

TL;DR
This paper systematically evaluates large language models' ability to generate counterfactuals across various natural language understanding tasks, identifying strengths, weaknesses, and influencing factors such as prompt design and alignment techniques.
Contribution
It introduces a comprehensive evaluation framework for counterfactual generation by LLMs and analyzes the impact of different factors like prompt design and alignment methods.
Findings
LLMs are promising but face challenges in complex tasks like relation extraction.
Alignment techniques can enhance counterfactual generation capabilities.
Increasing model size alone does not improve counterfactual generation.
Abstract
Large language models (LLMs) have made remarkable progress in a wide range of natural language understanding and generation tasks. However, their ability to generate counterfactuals has not been examined systematically. To bridge this gap, we present a comprehensive evaluation framework on various types of NLU tasks, which covers all key factors in determining LLMs' capability of generating counterfactuals. Based on this framework, we 1) investigate the strengths and weaknesses of LLMs as the counterfactual generator, and 2) disclose the factors that affect LLMs when generating counterfactuals, including both the intrinsic properties of LLMs and prompt designing. The results show that, though LLMs are promising in most cases, they face challenges in complex tasks like RE since they are bounded by task-specific performance, entity constraints, and inherent selection bias. We also find…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research
MethodsCounterfactuals Explanations · ALIGN
