RLAP: A Reinforcement Learning Enhanced Adaptive Planning Framework for Multi-step NLP Task Solving

Zepeng Ding; Dixuan Wang; Ziqin Luo; Guochao Jiang; Deqing Yang; Jiaqing Liang

arXiv:2505.11893·cs.CL·May 20, 2025

RLAP: A Reinforcement Learning Enhanced Adaptive Planning Framework for Multi-step NLP Task Solving

Zepeng Ding, Dixuan Wang, Ziqin Luo, Guochao Jiang, Deqing Yang, Jiaqing Liang

PDF

Open Access

TL;DR

RLAP introduces a reinforcement learning-based adaptive planning framework that models multi-step NLP tasks as MDPs, enabling LLMs to determine optimal subtask sequences by considering linguistic features, improving task performance.

Contribution

The paper presents a novel RL-enhanced adaptive planning framework that models NLP tasks as MDPs and trains an Actor to optimize subtask ordering based on linguistic features.

Findings

01

RLAP outperforms baseline methods on multiple NLP datasets.

02

Incorporating linguistic features improves subtask sequencing.

03

RLAP demonstrates robustness across different NLP tasks.

Abstract

Multi-step planning has been widely employed to enhance the performance of large language models (LLMs) on downstream natural language processing (NLP) tasks, which decomposes the original task into multiple subtasks and guide LLMs to solve them sequentially without additional training. When addressing task instances, existing methods either preset the order of steps or attempt multiple paths at each step. However, these methods overlook instances' linguistic features and rely on the intrinsic planning capabilities of LLMs to evaluate intermediate feedback and then select subtasks, resulting in suboptimal outcomes. To better solve multi-step NLP tasks with LLMs, in this paper we propose a Reinforcement Learning enhanced Adaptive Planning framework (RLAP). In our framework, we model an NLP task as a Markov decision process (MDP) and employ an LLM directly into the environment. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI-based Problem Solving and Planning · Reinforcement Learning in Robotics · Multimodal Machine Learning Applications