ImpRIF: Stronger Implicit Reasoning Leads to Better Complex Instruction Following
Yuancheng Yang, Lin Yang, Xu Wang, Chao Tong, Haihua Yang

TL;DR
This paper introduces ImpRIF, a method that improves large language models' ability to follow complex instructions involving implicit reasoning by formalizing instructions as verifiable reasoning graphs and training models accordingly.
Contribution
ImpRIF formalizes implicit reasoning instructions as verifiable graphs and employs graph-driven training to enhance LLMs' complex instruction following capabilities.
Findings
Models with ImpRIF outperform base models on five benchmarks.
Synthesized large-scale data improves reasoning accuracy.
Reinforcement learning further enhances implicit reasoning skills.
Abstract
As applications of large language models (LLMs) become increasingly complex, the demand for robust complex instruction following capabilities is growing accordingly. We argue that a thorough understanding of the instruction itself, especially the latent reasoning structure embedded between the lines, is crucial for improving instruction following. Therefore we target complex instructions that involve implicit reasoning, intricate logical relations, and multi-constraint dependencies. We propose ImpRIF, a method to enhance LLMs' understanding of implicit reasoning instructions, thereby improving its ability to follow complex instructions. We formalize such instructions as verifiable reasoning graphs, enabling programmatic verification and graph-driven chain-of-thought reasoning. Based on this formulation, we synthesize large-scale single- and multi-turn data, propose fine-tuning with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
