Logics-STEM: Empowering LLM Reasoning via Failure-Driven Post-Training and Document Knowledge Enhancement

Mingyu Xu; Cheng Fang; Keyue Jiang; Yuqian Zheng; Yanghua Xiao; Baojian Zhou; Qifang Zhao; Suhang Zheng; Xiuwen Zhu; Jiyang Tang; Yongchi Zhao; Yijia Luo; Zhiqi Bai; Yuchi Xu; Wenbo Su; Wei Wang; Bing Zhao; Lin Qu; Xiaoxiao Xu

arXiv:2601.01562·cs.AI·January 21, 2026

Logics-STEM: Empowering LLM Reasoning via Failure-Driven Post-Training and Document Knowledge Enhancement

Mingyu Xu, Cheng Fang, Keyue Jiang, Yuqian Zheng, Yanghua Xiao, Baojian Zhou, Qifang Zhao, Suhang Zheng, Xiuwen Zhu, Jiyang Tang, Yongchi Zhao, Yijia Luo, Zhiqi Bai, Yuchi Xu, Wenbo Su, Wei Wang, Bing Zhao, Lin Qu, Xiaoxiao Xu

PDF

Open Access 3 Models 1 Datasets

TL;DR

Logics-STEM is a reasoning model fine-tuned on a large, high-quality STEM dataset, using a failure-driven post-training approach to significantly improve reasoning performance on STEM benchmarks.

Contribution

The paper introduces Logics-STEM, a novel combination of large-scale dataset construction and failure-driven post-training to enhance reasoning in LLMs for STEM tasks.

Findings

01

Achieved 4.68% improvement over the next-best 8B model on STEM benchmarks.

02

Developed a 10M-scale high-quality dataset with a 5-stage curation process.

03

Demonstrated the effectiveness of data-algorithm co-design in reasoning enhancement.

Abstract

We present Logics-STEM, a state-of-the-art reasoning model fine-tuned on Logics-STEM-SFT-Dataset, a high-quality and diverse dataset at 10M scale that represents one of the largest-scale open-source long chain-of-thought corpora. Logics-STEM targets reasoning tasks in the domains of Science, Technology, Engineering, and Mathematics (STEM), and exhibits exceptional performance on STEM-related benchmarks with an average improvement of 4.68% over the next-best model at 8B scale. We attribute the gains to our data-algorithm co-design engine, where they are jointly optimized to fit a gold-standard distribution behind reasoning. Data-wise, the Logics-STEM-SFT-Dataset is constructed from a meticulously designed data curation engine with 5 stages to ensure the quality, diversity, and scalability, including annotation, deduplication, decontamination, distillation, and stratified sampling.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Datasets

Logics-MLLM/Logics-STEM-SFT-Dataset-Open-1.6M
dataset· 2.2k dl
2.2k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Machine Learning in Materials Science