Learning to Generate All Feasible Actions
Mirco Theile, Daniele Bernardini, Raphael Trumpp, Cristina Piazza,, Marco Caccamo, Alberto L. Sangiovanni-Vincentelli

TL;DR
This paper introduces a novel method for reinforcement learning in complex cyber-physical systems by generating all feasible actions through self-supervised learning, improving safety and efficiency in constrained environments.
Contribution
It proposes action mapping, a two-step learning approach that first models feasibility and then maps actions, enhancing the generation of feasible actions in RL.
Findings
Successfully generates all feasible actions in disconnected sets
Demonstrates effectiveness in robotic path planning and grasping
Addresses safety constraints in reinforcement learning
Abstract
Modern cyber-physical systems are becoming increasingly complex to model, thus motivating data-driven techniques such as reinforcement learning (RL) to find appropriate control agents. However, most systems are subject to hard constraints such as safety or operational bounds. Typically, to learn to satisfy these constraints, the agent must violate them systematically, which is computationally prohibitive in most systems. Recent efforts aim to utilize feasibility models that assess whether a proposed action is feasible to avoid applying the agent's infeasible action proposals to the system. However, these efforts focus on guaranteeing constraint satisfaction rather than the agent's learning efficiency. To improve the learning process, we introduce action mapping, a novel approach that divides the learning process into two steps: first learn feasibility and subsequently, the objective by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Robot Manipulation and Learning
