Single-Turn Debate Does Not Help Humans Answer Hard   Reading-Comprehension Questions

Alicia Parrish; Harsh Trivedi; Ethan Perez; Angelica Chen and; Nikita Nangia; Jason Phang; Samuel R. Bowman

arXiv:2204.05212·cs.CL·April 14, 2022

Single-Turn Debate Does Not Help Humans Answer Hard Reading-Comprehension Questions

Alicia Parrish, Harsh Trivedi, Ethan Perez, Angelica Chen and, Nikita Nangia, Jason Phang, Samuel R. Bowman

PDF

Open Access

TL;DR

This paper investigates whether single-turn debates with explanations help humans answer difficult reading comprehension questions, finding that explanations do not improve accuracy but selected snippets do.

Contribution

The study introduces a debate-style dataset with explanations for both correct and incorrect answers to evaluate their effectiveness in aiding human comprehension.

Findings

01

Explanations did not improve human accuracy in answering questions.

02

Providing human-selected text snippets increased accuracy.

03

The results suggest improvements for future debate-based data collection.

Abstract

Current QA systems can generate reasonable-sounding yet false answers without explanation or evidence for the generated answer, which is especially problematic when humans cannot readily check the model's answers. This presents a challenge for building trust in machine learning systems. We take inspiration from real-world situations where difficult questions are answered by considering opposing sides (see Irving et al., 2018). For multiple-choice QA examples, we build a dataset of single arguments for both a correct and incorrect answer option in a debate-style set-up as an initial step in training models to produce explanations for two candidate answers. We use long contexts -- humans familiar with the context write convincing explanations for pre-selected correct and incorrect answers, and we test if those explanations allow humans who have not read the full context to more accurately…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Software Engineering Research