{\dag}DAGGER: Distractor-Aware Graph Generation for Executable Reasoning in Math Problems

Zabir Al Nazi; Shubhashis Roy Dipta; Sudipta Kar

arXiv:2601.06853·cs.CL·March 31, 2026

{\dag}DAGGER: Distractor-Aware Graph Generation for Executable Reasoning in Math Problems

Zabir Al Nazi, Shubhashis Roy Dipta, Sudipta Kar

PDF

6 Models 1 Datasets

TL;DR

DAGGER introduces a graph-based approach to improve the robustness and efficiency of mathematical reasoning models in noisy, low-resource environments by explicitly modeling distractors.

Contribution

The paper proposes DAGGER, a novel method that reformulates math problem solving as executable graph generation, enhancing robustness without training on distractor-augmented data.

Findings

01

Models degrade significantly with distractors, up to 41 points.

02

DAGGER achieves comparable accuracy with 89% fewer tokens.

03

Structured representations improve robustness in noisy settings.

Abstract

Chain-of-Thought (CoT) prompting is widely adopted for mathematical problem solving, including in low-resource languages, yet its behavior under irrelevant context remains underexplored. To systematically study this challenge, we introduce DISTRACTMATH-BN, a Bangla benchmark that augments MGSM and MSVAMP with semantically coherent but computationally irrelevant information. Evaluating seven models ranging from 3B to 12B parameters, we observe substantial performance degradation under distractors: standard models drop by up to 41 points, while reasoning-specialized models decline by 14 to 20 points despite consuming five times more tokens. We propose {\dag}DAGGER, which reformulates mathematical problem solving as executable computational graph generation with explicit modeling of distractor nodes. Fine-tuning Gemma-3 models using supervised fine-tuning followed by Group Relative Policy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Datasets

dipta007/DistractMath-Bn
dataset· 32 dl
32 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.