I Could've Asked That: Reformulating Unanswerable Questions

Wenting Zhao; Ge Gao; Claire Cardie; Alexander M. Rush

arXiv:2407.17469·cs.CL·July 25, 2024

I Could've Asked That: Reformulating Unanswerable Questions

Wenting Zhao, Ge Gao, Claire Cardie, Alexander M. Rush

PDF

1 Repo

TL;DR

This paper introduces CouldAsk, a benchmark for evaluating how well language models can reformulate unanswerable questions from documents, revealing current models' limited capabilities in effective reformulation.

Contribution

It presents a new benchmark dataset for assessing question reformulation and evaluates state-of-the-art models, highlighting their shortcomings in generating meaningful reformulations.

Findings

01

GPT-4 reformulates questions successfully 26% of the time

02

Llama2-7B reformulates questions successfully 12% of the time

03

Most unsuccessful reformulations are mere rephrasings or identical questions

Abstract

When seeking information from unfamiliar documents, users frequently pose questions that cannot be answered by the documents. While existing large language models (LLMs) identify these unanswerable questions, they do not assist users in reformulating their questions, thereby reducing their overall utility. We curate CouldAsk, an evaluation benchmark composed of existing and new datasets for document-grounded question answering, specifically designed to study reformulating unanswerable questions. We evaluate state-of-the-art open-source and proprietary LLMs on CouldAsk. The results demonstrate the limited capabilities of these models in reformulating questions. Specifically, GPT-4 and Llama2-7B successfully reformulate questions only 26% and 12% of the time, respectively. Error analysis shows that 62% of the unsuccessful reformulations stem from the models merely rephrasing the questions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wenting-zhao/couldask
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Adam · Label Smoothing · Linear Layer · Byte Pair Encoding · Layer Normalization · Softmax · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Dense Connections