Leveraging Large Language Models for Multiple Choice Question Answering
Joshua Robinson, Christopher Michael Rytting, David Wingate

TL;DR
This paper explores how large language models can better answer multiple choice questions by explicitly comparing options, introducing the concept of multiple choice symbol binding (MCSB), and demonstrating improved performance across diverse datasets.
Contribution
The paper introduces the MCSB ability as crucial for effective MCQA and shows that models with high MCSB significantly improve accuracy using a natural prompting approach.
Findings
Models with high MCSB outperform traditional methods.
Natural prompting reduces computational costs.
Performance gap with SOTA is largely closed.
Abstract
While large language models (LLMs) like GPT-3 have achieved impressive results on multiple choice question answering (MCQA) tasks in the zero, one, and few-shot settings, they generally lag behind the MCQA state of the art (SOTA). MCQA tasks have traditionally been presented to LLMs like cloze tasks. An LLM is conditioned on a question (without the associated answer options) and its chosen option is the one assigned the highest probability after normalization (for length, etc.). A more natural prompting approach is to present the question and answer options to the LLM jointly and have it output the symbol (e.g., "A") associated with its chosen answer option. This approach allows the model to explicitly compare answer options, reduces computational costs, and mitigates the effects of tokenization scheme and answer option representations on answer selection. For the natural approach to be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · 15 Ways to Contact How can i speak to someone at Delta Airlines · {Dispute@FaQ-s}How to file a dispute with Expedia? · Adam · Attention Dropout · Layer Normalization · Cosine Annealing
