PRIMA: Planner-Reasoner Inside a Multi-task Reasoning Agent

Daoming Lyu; Bo Liu; and Jianshu Chen

arXiv:2202.00531·cs.AI·February 15, 2022

PRIMA: Planner-Reasoner Inside a Multi-task Reasoning Agent

Daoming Lyu, Bo Liu, and Jianshu Chen

PDF

Open Access

TL;DR

This paper introduces PRIMA, a Planner-Reasoner framework that enables multi-task reasoning with high efficiency by selecting relevant skills for each task, trained via deep reinforcement learning and validated across various domains.

Contribution

The paper presents a novel Planner-Reasoner architecture that balances reasoning capability and efficiency in multi-task logic reasoning, trained end-to-end with reinforcement learning.

Findings

01

Achieves state-of-the-art multi-task reasoning performance.

02

Effectively balances reasoning skills and task-specific efficiency.

03

Validated across multiple domains with strong results.

Abstract

We consider the problem of multi-task reasoning (MTR), where an agent can solve multiple tasks via (first-order) logic reasoning. This capability is essential for human-like intelligence due to its strong generalizability and simplicity for handling multiple tasks. However, a major challenge in developing effective MTR is the intrinsic conflict between reasoning capability and efficiency. An MTR-capable agent must master a large set of "skills" to tackle diverse tasks, but executing a particular task at the inference stage requires only a small subset of immediately relevant skills. How can we maintain broad reasoning capability and also efficient specific-task performance? To address this problem, we propose a Planner-Reasoner framework capable of state-of-the-art MTR capability and high efficiency. The Reasoner models shareable (first-order) logic deduction rules, from which the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI-based Problem Solving and Planning · Topic Modeling · Reinforcement Learning in Robotics