WildSci: Advancing Scientific Reasoning from In-the-Wild Literature
Tengxiao Liu, Deepak Nathani, Zekun Li, Kevin Yang, William Yang Wang

TL;DR
WildSci introduces a large, automatically generated dataset of scientific questions across multiple disciplines, enabling improved reasoning in LLMs through reinforcement learning and scalable training methods.
Contribution
The paper presents WildSci, a novel dataset of scientific questions from literature, and demonstrates its effectiveness in training LLMs for scientific reasoning across diverse domains.
Findings
Reinforcement learning improves model performance on scientific benchmarks.
WildSci enables scalable training for complex scientific reasoning tasks.
Models show domain-specific performance and generalization trends.
Abstract
Recent progress in large language model (LLM) reasoning has focused on domains like mathematics and coding, where abundant high-quality data and objective evaluation metrics are readily available. In contrast, progress in LLM reasoning models remains limited in scientific domains such as medicine and materials science due to limited dataset coverage and the inherent complexity of open-ended scientific questions. To address these challenges, we introduce WildSci, a new dataset of domain-specific science questions automatically synthesized from peer-reviewed literature, covering 9 scientific disciplines and 26 subdomains. By framing complex scientific reasoning tasks in a multiple-choice format, we enable scalable training with well-defined reward signals. We further apply reinforcement learning to finetune models on these data and analyze the resulting training dynamics, including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning in Materials Science · Multimodal Machine Learning Applications
