SciDQA: A Deep Reading Comprehension Dataset over Scientific Papers
Shruti Singh, Nandan Sarkar, Arman Cohan

TL;DR
SciDQA is a rigorously curated dataset of 2,937 scientific QA pairs sourced from peer reviews and author responses, designed to challenge LLMs in deep, multi-document scientific comprehension involving reasoning across various complex materials.
Contribution
The paper introduces SciDQA, a new scientific reading comprehension dataset with high-quality, naturally derived questions that require multi-document reasoning and are sourced from expert peer reviews and author answers.
Findings
LLMs show significant performance gaps on SciDQA
Questions require reasoning across figures, tables, and supplementary materials
Dataset enables evaluation of deep scientific understanding in LLMs
Abstract
Scientific literature is typically dense, requiring significant background knowledge and deep comprehension for effective engagement. We introduce SciDQA, a new dataset for reading comprehension that challenges LLMs for a deep understanding of scientific articles, consisting of 2,937 QA pairs. Unlike other scientific QA datasets, SciDQA sources questions from peer reviews by domain experts and answers by paper authors, ensuring a thorough examination of the literature. We enhance the dataset's quality through a process that carefully filters out lower quality questions, decontextualizes the content, tracks the source document across different versions, and incorporates a bibliography for multi-document question-answering. Questions in SciDQA necessitate reasoning across figures, tables, equations, appendices, and supplementary materials, and require multi-document reasoning. We evaluate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
