ClarifyDelphi: Reinforced Clarification Questions with Defeasibility   Rewards for Social and Moral Situations

Valentina Pyatkin; Jena D. Hwang; Vivek Srikumar; Ximing Lu; Liwei; Jiang; Yejin Choi; Chandra Bhagavatula

arXiv:2212.10409·cs.CL·June 1, 2023·5 cites

ClarifyDelphi: Reinforced Clarification Questions with Defeasibility Rewards for Social and Moral Situations

Valentina Pyatkin, Jena D. Hwang, Vivek Srikumar, Ximing Lu, Liwei, Jiang, Yejin Choi, Chandra Bhagavatula

PDF

Open Access 2 Repos

TL;DR

ClarifyDelphi is an interactive system that learns to ask clarification questions to better understand social and moral contexts, using reinforcement learning with defeasibility rewards to improve moral judgment divergence.

Contribution

It introduces a reinforcement learning framework with defeasibility rewards for generating questions that reveal diverging moral judgments based on contextual answers.

Findings

01

Generated questions are more relevant and informative.

02

Questions lead to greater divergence in moral judgments.

03

System outperforms baseline question-generation methods.

Abstract

Context is everything, even in commonsense moral reasoning. Changing contexts can flip the moral judgment of an action; "Lying to a friend" is wrong in general, but may be morally acceptable if it is intended to protect their life. We present ClarifyDelphi, an interactive system that learns to ask clarification questions (e.g., why did you lie to your friend?) in order to elicit additional salient contexts of a social or moral situation. We posit that questions whose potential answers lead to diverging moral judgments are the most informative. Thus, we propose a reinforcement learning framework with a defeasibility reward that aims to maximize the divergence between moral judgments of hypothetical answers to a question. Human evaluation demonstrates that our system generates more relevant, informative and defeasible questions compared to competitive baselines. Our work is ultimately…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPsychology of Moral and Emotional Judgment · Ethics in Business and Education · Misinformation and Its Impacts

MethodsFLIP