ProSpec RL: Plan Ahead, then Execute

Liangliang Liu; Yi Guan; BoRan Wang; Rujia Shen; Yi Lin; Chaoran Kong,; Lian Yan; Jingchi Jiang

arXiv:2407.21359·cs.LG·August 1, 2024

ProSpec RL: Plan Ahead, then Execute

Liangliang Liu, Yi Guan, BoRan Wang, Rujia Shen, Yi Lin, Chaoran Kong,, Lian Yan, Jingchi Jiang

PDF

Open Access

TL;DR

ProSpec RL introduces a prospective planning approach in reinforcement learning by imagining future trajectories, improving decision-making, safety, and data efficiency, validated on DMControl benchmarks.

Contribution

The paper presents ProSpec RL, a novel method integrating future trajectory imagination and cycle consistency for safer, more efficient reinforcement learning.

Findings

01

Achieved significant performance improvements on DMControl benchmarks.

02

Enhanced safety by avoiding irreversible states through cycle consistency.

03

Improved data efficiency via virtual trajectory augmentation.

Abstract

Imagining potential outcomes of actions before execution helps agents make more informed decisions, a prospective thinking ability fundamental to human cognition. However, mainstream model-free Reinforcement Learning (RL) methods lack the ability to proactively envision future scenarios, plan, and guide strategies. These methods typically rely on trial and error to adjust policy functions, aiming to maximize cumulative rewards or long-term value, even if such high-reward decisions place the environment in extremely dangerous states. To address this, we propose the Prospective (ProSpec) RL method, which makes higher-value, lower-risk optimal decisions by imagining future n-stream trajectories. Specifically, ProSpec employs a dynamic model to predict future states (termed "imagined states") based on the current state and a series of sampled actions. Furthermore, we integrate the concept…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning