Learning the Boundary of Solvability: Aligning LLMs to Detect Unsolvable Problems

Dengyun Peng; Qiguang Chen; Bofei Liu; Jiannan Guan; Libo Qin; Zheng Yan; Jinhao Liu; Jianshu Zhang; Wanxiang Che

arXiv:2512.01661·cs.CL·February 3, 2026

Learning the Boundary of Solvability: Aligning LLMs to Detect Unsolvable Problems

Dengyun Peng, Qiguang Chen, Bofei Liu, Jiannan Guan, Libo Qin, Zheng Yan, Jinhao Liu, Jianshu Zhang, Wanxiang Che

PDF

Open Access

TL;DR

This paper introduces UnsolvableQA and UnsolvableRL to improve LLMs' ability to distinguish between unsolvable problems and capability limitations, reducing hallucinations and enhancing reliability.

Contribution

The paper presents a new dataset and reinforcement learning framework for better detection of unsolvable problems in LLMs, addressing a key reliability challenge.

Findings

01

Achieves over 85% unsolvability detection rate

02

Increases reasoning accuracy from 43.4% to 69.4%

03

Identifies data-training interaction effects on model robustness

Abstract

Ensuring large language model (LLM) reliability requires distinguishing objective unsolvability (inherent contradictions) from subjective capability limitations (tasks exceeding model competence). Current LLMs often conflate these dimensions, leading to hallucinations in which they return confident answers to inherently unsolvable queries. To address this issue, we propose a multi-domain dataset containing both solvable and unsolvable questions, UnsolvableQA, together with an alignment framework, UnsolvableRL. First, we construct UnsolvableQA by "Reverse Construction" that systematically injects logical contradictions into otherwise valid reasoning chains. Second, we introduce UnsolvableRL, a reinforcement learning paradigm that balances objective unsolvability detection with calibrated confidence under capability limits. Empirically, our approach achieves robust unsolvability detection…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Text Readability and Simplification