A2: Extracting Cyclic Switchings from DOB-nets for Rejecting Excessive   Disturbances

Wenjie Lu; Dikai Liu

arXiv:1911.00165·cs.RO·November 4, 2019

A2: Extracting Cyclic Switchings from DOB-nets for Rejecting Excessive Disturbances

Wenjie Lu, Dikai Liu

PDF

Open Access

TL;DR

This paper introduces an attention-based method to extract finite-state automata from DOB-nets trained with RL, revealing cyclic switching patterns that enhance understanding and robustness against external disturbances.

Contribution

It proposes the A² approach to extract a Key Moore Machine Network from DOB-nets, providing insights into their switching mechanisms under external disturbances.

Findings

01

Identified three cyclic switching patterns in DOB-net behavior.

02

Discovered control saturation synchronized with unknown disturbances.

03

Linked switching mechanisms to hybrid control design principles.

Abstract

Reinforcement Learning (RL) is limited in practice by its gray-box nature, which is responsible for insufficient trustiness from users, unsatisfied interpretation for human intervention, inadequate analysis for future improvement, etc. This paper seeks to partially characterize the interplay between dynamical environments and the DOB-net. The DOB-net obtained from RL solves a set of Partially Observable Markovian Decision Processes (POMDPs). The transition function of each POMDP is largely determined by the environments, which are excessive external disturbances in this research. This paper proposes an Attention-based Abstraction (A $^{2}$ ) approach to extract a finite-state automaton, referred to as a Key Moore Machine Network (KMMN), to capture the switching mechanisms exhibited by the DOB-net in dealing with multiple such POMDPs. This approach first quantizes the controlled platform…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Fault Detection and Control Systems · Smart Grid Security and Resilience