PRIMA: Planner-Reasoner Inside a Multi-task Reasoning Agent
Daoming Lyu, Bo Liu, and Jianshu Chen

TL;DR
This paper introduces PRIMA, a Planner-Reasoner framework that enables multi-task reasoning with high efficiency by selecting relevant skills for each task, trained via deep reinforcement learning and validated across various domains.
Contribution
The paper presents a novel Planner-Reasoner architecture that balances reasoning capability and efficiency in multi-task logic reasoning, trained end-to-end with reinforcement learning.
Findings
Achieves state-of-the-art multi-task reasoning performance.
Effectively balances reasoning skills and task-specific efficiency.
Validated across multiple domains with strong results.
Abstract
We consider the problem of multi-task reasoning (MTR), where an agent can solve multiple tasks via (first-order) logic reasoning. This capability is essential for human-like intelligence due to its strong generalizability and simplicity for handling multiple tasks. However, a major challenge in developing effective MTR is the intrinsic conflict between reasoning capability and efficiency. An MTR-capable agent must master a large set of "skills" to tackle diverse tasks, but executing a particular task at the inference stage requires only a small subset of immediately relevant skills. How can we maintain broad reasoning capability and also efficient specific-task performance? To address this problem, we propose a Planner-Reasoner framework capable of state-of-the-art MTR capability and high efficiency. The Reasoner models shareable (first-order) logic deduction rules, from which the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI-based Problem Solving and Planning · Topic Modeling · Reinforcement Learning in Robotics
