It Is Not About What You Say, It Is About How You Say It: A Surprisingly Simple Approach for Improving Reading Comprehension
Sagi Shaier, Lawrence E Hunter, Katharina von der Wense

TL;DR
This paper demonstrates that simple input reordering and emphasis techniques, such as concatenating tokens, significantly improve reading comprehension performance of large language models, often surpassing larger models.
Contribution
It introduces a straightforward method of input emphasis and ordering that enhances model accuracy, challenging the assumption that complex techniques are necessary.
Findings
Reordering context before questions boosts accuracy by up to 31%.
Emphasizing context outperforms question emphasis in improving results.
Simple token concatenation yields up to 36% accuracy gains, enabling smaller models to outperform larger ones.
Abstract
Natural language processing has seen rapid progress over the past decade. Due to the speed of developments, some practices get established without proper evaluation. Considering one such case and focusing on reading comprehension, we ask our first research question: 1) How does the order of inputs -- i.e., question and context -- affect model performance? Additionally, given recent advancements in input emphasis, we ask a second research question: 2) Does emphasizing either the question, the context, or both enhance performance? Experimenting with 9 large language models across 3 datasets, we find that presenting the context before the question improves model performance, with an accuracy increase of up to . Furthermore, emphasizing the context yields superior results compared to question emphasis, and in general, emphasizing parts of the input is particularly effective for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsEducation and Critical Thinking Development · Reading and Literacy Development
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
