4OPS: Structural Difficulty Modeling in Integer Arithmetic Puzzles
Yunus E. Zeytuncu

TL;DR
This paper introduces a structural difficulty model for integer arithmetic puzzles, using an exact solver to analyze features that determine puzzle difficulty, enabling explainable and scalable difficulty estimation.
Contribution
It develops an exact dynamic programming solver and a large dataset to identify structural attributes that fully determine puzzle difficulty, advancing explainable difficulty modeling.
Findings
Difficulty correlates with minimal construction size.
Baseline models struggle to distinguish easy puzzles.
Structural features fully explain puzzle difficulty.
Abstract
Arithmetic puzzle games provide a controlled setting for studying difficulty in mathematical reasoning tasks, a core challenge in adaptive learning systems. We investigate the structural determinants of difficulty in a class of integer arithmetic puzzles inspired by number games. We formalize the problem and develop an exact dynamic-programming solver that enumerates reachable targets, extracts minimal-operation witnesses, and enables large-scale labeling. Using this solver, we construct a dataset of over 3.4 million instances and define difficulty via the minimum number of operations required to reach a target. We analyze the relationship between difficulty and solver-derived features. While baseline machine learning models based on bag- and target-level statistics can partially predict solvability, they fail to reliably distinguish easy instances. In contrast, we show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCognitive and developmental aspects of mathematical skills · Intelligent Tutoring Systems and Adaptive Learning · Mathematics Education and Teaching Techniques
