LSRIF: Logic-Structured Reinforcement Learning for Instruction Following

Qingyu Ren; Qianyu He; Jingwen Chang; Jie Zeng; Jiaqing Liang; Yanghua Xiao; Han Xia; Zeye Sun; Fei Yu

arXiv:2601.06431·cs.AI·January 15, 2026

LSRIF: Logic-Structured Reinforcement Learning for Instruction Following

Qingyu Ren, Qianyu He, Jingwen Chang, Jie Zeng, Jiaqing Liang, Yanghua Xiao, Han Xia, Zeye Sun, Fei Yu

PDF

Open Access

TL;DR

This paper introduces LSRIF, a framework that explicitly models logical structures in instructions to improve large language models' ability to follow complex, structured instructions and enhance reasoning capabilities.

Contribution

The paper presents a novel logic-structured training framework and dataset that explicitly incorporate logical dependencies, improving instruction following and reasoning in language models.

Findings

01

Significant improvements in instruction-following accuracy

02

Enhanced reasoning capabilities in models trained with LSRIF

03

Parameter updates focus on attention layers and logical operators

Abstract

Instruction-following is critical for large language models, but real-world instructions often contain logical structures such as sequential dependencies and conditional branching. Existing methods typically construct datasets with parallel constraints and optimize average rewards, ignoring logical dependencies and yielding noisy signals. We propose a logic-structured training framework LSRIF that explicitly models instruction logic. We first construct a dataset LSRInstruct with constraint structures such as parallel, sequential, and conditional types, and then design structure-aware rewarding method LSRIF including average aggregation for parallel structures, failure-penalty propagation for sequential structures, and selective rewards for conditional branches. Experiments show LSRIF brings significant improvements in instruction-following (in-domain and out-of-domain) and general…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications