Resilient Computing with Reinforcement Learning on a Dynamical System: Case Study in Sorting
Aleksandra Faust, James B. Aimone, Conrad D. James, Lydia Tapia

TL;DR
This paper presents a reinforcement learning-based approach to resilient computation, framing sorting as a feedback control problem, demonstrating robustness and efficiency improvements over traditional algorithms.
Contribution
It introduces a novel formulation of computation as a feedback-control problem solved with RL, enhancing resilience and efficiency in array sorting tasks.
Findings
RL sorting agent achieves asymptotic stability and steady progress.
The approach is resilient to faulty components.
It performs fewer array manipulations than traditional algorithms.
Abstract
Robots and autonomous agents often complete goal-based tasks with limited resources, relying on imperfect models and sensor measurements. In particular, reinforcement learning (RL) and feedback control can be used to help a robot achieve a goal. Taking advantage of this body of work, this paper formulates general computation as a feedback-control problem, which allows the agent to autonomously overcome some limitations of standard procedural language programming: resilience to errors and early program termination. Our formulation considers computation to be trajectory generation in the program's variable space. The computing then becomes a sequential decision making problem, solved with reinforcement learning (RL), and analyzed with Lyapunov stability theory to assess the agent's resilience and progression to the goal. We do this through a case study on a quintessential computer science…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Optimization and Search Problems · Evolutionary Algorithms and Applications
