A Multi-step and Resilient Predictive Q-learning Algorithm for IoT with Human Operators in the Loop: A Case Study in Water Supply Networks
Maria Grammatopoulou, Aris Kanellopoulos, Kyriakos G.~Vamvoudakis,, Nathan Lau

TL;DR
This paper introduces a multi-step, resilient Q-learning algorithm for IoT water networks that incorporates human feedback to improve fault detection and containment, demonstrated on Arlington's water system.
Contribution
It presents a novel predictive and resilient Q-learning approach that integrates human operator feedback for IoT network management under faults and attacks.
Findings
Effective in avoiding attack locations and faults
Adapts to irregular operations using historical data
Enhances water network resilience and containment
Abstract
We consider the problem of recommending resilient and predictive actions for an IoT network in the presence of faulty components, considering the presence of human operators manipulating the information of the environment the agent sees for containment purposes. The IoT network is formulated as a directed graph with a known topology whose objective is to maintain a constant and resilient flow between a source and a destination node. The optimal route through this network is evaluated via a predictive and resilient Q-learning algorithm which takes into account historical data about irregular operation, due to faults, as well as the feedback from the human operators that are considered to have extra information about the status of the network concerning locations likely to be targeted by attacks. To showcase our method, we utilize anonymized data from Arlington County, Virginia, to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIoT and Edge/Fog Computing · Data Stream Mining Techniques · Smart Grid Energy Management
MethodsQ-Learning
