Learning to Generate All Feasible Actions

Mirco Theile; Daniele Bernardini; Raphael Trumpp; Cristina Piazza,; Marco Caccamo; Alberto L. Sangiovanni-Vincentelli

arXiv:2301.11461·cs.LG·July 8, 2024

Learning to Generate All Feasible Actions

Mirco Theile, Daniele Bernardini, Raphael Trumpp, Cristina Piazza,, Marco Caccamo, Alberto L. Sangiovanni-Vincentelli

PDF

Open Access

TL;DR

This paper introduces a novel method for reinforcement learning in complex cyber-physical systems by generating all feasible actions through self-supervised learning, improving safety and efficiency in constrained environments.

Contribution

It proposes action mapping, a two-step learning approach that first models feasibility and then maps actions, enhancing the generation of feasible actions in RL.

Findings

01

Successfully generates all feasible actions in disconnected sets

02

Demonstrates effectiveness in robotic path planning and grasping

03

Addresses safety constraints in reinforcement learning

Abstract

Modern cyber-physical systems are becoming increasingly complex to model, thus motivating data-driven techniques such as reinforcement learning (RL) to find appropriate control agents. However, most systems are subject to hard constraints such as safety or operational bounds. Typically, to learn to satisfy these constraints, the agent must violate them systematically, which is computationally prohibitive in most systems. Recent efforts aim to utilize feasibility models that assess whether a proposed action is feasible to avoid applying the agent's infeasible action proposals to the system. However, these efforts focus on guaranteeing constraint satisfaction rather than the agent's learning efficiency. To improve the learning process, we introduce action mapping, a novel approach that divides the learning process into two steps: first learn feasibility and subsequently, the objective by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Robot Manipulation and Learning