Anomaly Detection Under Controlled Sensing Using Actor-Critic   Reinforcement Learning

Geethu Joseph; M. Cenk Gursoy; Pramod K. Varshney

arXiv:2006.01044·eess.SP·June 2, 2020

Anomaly Detection Under Controlled Sensing Using Actor-Critic Reinforcement Learning

Geethu Joseph, M. Cenk Gursoy, Pramod K. Varshney

PDF

TL;DR

This paper introduces a deep reinforcement learning-based method for sequential anomaly detection in multiple processes, optimizing sensor selection to balance detection speed and sensing costs without prior knowledge of the number of anomalies.

Contribution

It formulates the anomaly detection as a Markov decision process and applies actor-critic reinforcement learning to develop an adaptive, low-complexity sensor selection policy.

Findings

01

Effective detection with minimal sensing cost

02

Adaptability to unknown process dependence patterns

03

Good detection accuracy demonstrated in experiments

Abstract

We consider the problem of detecting anomalies among a given set of processes using their noisy binary sensor measurements. The noiseless sensor measurement corresponding to a normal process is 0, and the measurement is 1 if the process is anomalous. The decision-making algorithm is assumed to have no knowledge of the number of anomalous processes. The algorithm is allowed to choose a subset of the sensors at each time instant until the confidence level on the decision exceeds the desired value. Our objective is to design a sequential sensor selection policy that dynamically determines which processes to observe at each time and when to terminate the detection algorithm. The selection policy is designed such that the anomalous processes are detected with the desired confidence level while incurring minimum cost which comprises the delay in detection and the cost of sensing. We cast this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.