Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?
Nishant Balepur, Abhilasha Ravichander, Rachel Rudinger

TL;DR
This paper investigates whether large language models can answer multiple-choice questions using only the choices, revealing that they often infer questions or use group dynamics rather than memorization, challenging assumptions about their reasoning abilities.
Contribution
The study demonstrates that LLMs can perform reasonably well on choices-only MCQA by inferring questions and group choice patterns, highlighting the need for more robust evaluation methods.
Findings
LLMs outperform majority baseline in most choices-only MCQA cases.
Choices-only accuracy is not solely due to memorization.
LLMs can sometimes infer the original question from choices.
Abstract
Multiple-choice question answering (MCQA) is often used to evaluate large language models (LLMs). To see if MCQA assesses LLMs as intended, we probe if LLMs can perform MCQA with choices-only prompts, where models must select the correct answer only from the choices. In three MCQA datasets and four LLMs, this prompt bests a majority baseline in 11/12 cases, with up to 0.33 accuracy gain. To help explain this behavior, we conduct an in-depth, black-box analysis on memorization, choice dynamics, and question inference. Our key findings are threefold. First, we find no evidence that the choices-only accuracy stems from memorization alone. Second, priors over individual choices do not fully explain choices-only accuracy, hinting that LLMs use the group dynamics of choices. Third, LLMs have some ability to infer a relevant question from choices, and surprisingly can sometimes even match the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsArtificial Intelligence in Law · Legal Education and Practice Innovations · Dispute Resolution and Class Actions
