Using reinforcement learning to autonomously identify sources of error   for agents in group missions

Keishu Utimula; Ken-taro Hayaschi; Trevor J. Bihl; Kenta Hongo; Ryo; Maezono

arXiv:2107.09232·cs.RO·November 7, 2023

Using reinforcement learning to autonomously identify sources of error for agents in group missions

Keishu Utimula, Ken-taro Hayaschi, Trevor J. Bihl, Kenta Hongo, Ryo, Maezono

PDF

Open Access

TL;DR

This paper explores using Q-table reinforcement learning to enable agents in a swarm to autonomously identify whether failures are due to actuators or sensors by generating action plans that induce observable displacements.

Contribution

It introduces a novel application of reinforcement learning for autonomous cause identification in agent failures, overcoming gradient limitations in traditional optimization methods.

Findings

01

Reinforcement learning successfully generated human-like failure cause pinpointing actions.

02

Q-table approach effectively handled sparse gradient scenarios.

03

Demonstrated potential for autonomous failure analysis in swarm systems.

Abstract

When agents swarm to execute a mission, some of them frequently exhibit sudden failure, as observed from the command base. It is generally difficult to determine whether a failure is caused by actuators (hypothesis, $h_{a}$ ) or sensors (hypothesis, $h_{s}$ ) by solely relying on the communication between the command base and concerning agent. However, by instigating collusion between the agents, the cause of failure can be identified; in other words, we expect to detect corresponding displacements for $h_{a}$ but not for $h_{s}$ . In this study, we considered the question as to whether artificial intelligence can autonomously generate an action plan $g$ to pinpoint the cause as aforedescribed. Because the expected response to $g$ generally depends upon the adopted hypothesis [let the difference be denoted by $D (g)$ ], a formulation that uses…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Control Multi-Agent Systems · Systems Engineering Methodologies and Applications · Infrastructure Resilience and Vulnerability Analysis