PDRL: Multi-Agent based Reinforcement Learning for Predictive Monitoring

Thanveer Shaik; Xiaohui Tao; Lin Li; Haoran Xie; U R Acharya; Raj; Gururajan; Xujuan Zhou

arXiv:2309.10576·cs.LG·October 28, 2024

PDRL: Multi-Agent based Reinforcement Learning for Predictive Monitoring

Thanveer Shaik, Xiaohui Tao, Lin Li, Haoran Xie, U R Acharya, Raj, Gururajan, Xujuan Zhou

PDF

Open Access

TL;DR

This paper introduces PDRL, a multi-agent deep reinforcement learning framework for predictive monitoring in complex environments, demonstrating improved performance in health, traffic, and weather forecasting tasks.

Contribution

The study presents a novel multi-agent deep reinforcement learning system that enhances predictive monitoring accuracy and adaptability in uncertain environments.

Findings

01

Outperforms baseline models in health monitoring tasks.

02

Achieves state-of-the-art results in time series forecasting.

03

Demonstrates transfer learning potential for traffic and weather prediction.

Abstract

Reinforcement learning has been increasingly applied in monitoring applications because of its ability to learn from previous experiences and can make adaptive decisions. However, existing machine learning-based health monitoring applications are mostly supervised learning algorithms, trained on labels and they cannot make adaptive decisions in an uncertain complex environment. This study proposes a novel and generic system, predictive deep reinforcement learning (PDRL) with multiple RL agents in a time series forecasting environment. The proposed generic framework accommodates virtual Deep Q Network (DQN) agents to monitor predicted future states of a complex environment with a well-defined reward policy so that the agent learns existing knowledge while maximizing their rewards. In the evaluation process of the proposed framework, three DRL agents were deployed to monitor a subject's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAir Quality Monitoring and Forecasting · Traffic Prediction and Management Techniques

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Bidirectional LSTM