Using reinforcement learning to autonomously identify sources of error for agents in group missions
Keishu Utimula, Ken-taro Hayaschi, Trevor J. Bihl, Kenta Hongo, Ryo, Maezono

TL;DR
This paper explores using Q-table reinforcement learning to enable agents in a swarm to autonomously identify whether failures are due to actuators or sensors by generating action plans that induce observable displacements.
Contribution
It introduces a novel application of reinforcement learning for autonomous cause identification in agent failures, overcoming gradient limitations in traditional optimization methods.
Findings
Reinforcement learning successfully generated human-like failure cause pinpointing actions.
Q-table approach effectively handled sparse gradient scenarios.
Demonstrated potential for autonomous failure analysis in swarm systems.
Abstract
When agents swarm to execute a mission, some of them frequently exhibit sudden failure, as observed from the command base. It is generally difficult to determine whether a failure is caused by actuators (hypothesis, ) or sensors (hypothesis, ) by solely relying on the communication between the command base and concerning agent. However, by instigating collusion between the agents, the cause of failure can be identified; in other words, we expect to detect corresponding displacements for but not for . In this study, we considered the question as to whether artificial intelligence can autonomously generate an action plan to pinpoint the cause as aforedescribed. Because the expected response to generally depends upon the adopted hypothesis [let the difference be denoted by ], a formulation that uses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Control Multi-Agent Systems · Systems Engineering Methodologies and Applications · Infrastructure Resilience and Vulnerability Analysis
