PDRL: Multi-Agent based Reinforcement Learning for Predictive Monitoring
Thanveer Shaik, Xiaohui Tao, Lin Li, Haoran Xie, U R Acharya, Raj, Gururajan, Xujuan Zhou

TL;DR
This paper introduces PDRL, a multi-agent deep reinforcement learning framework for predictive monitoring in complex environments, demonstrating improved performance in health, traffic, and weather forecasting tasks.
Contribution
The study presents a novel multi-agent deep reinforcement learning system that enhances predictive monitoring accuracy and adaptability in uncertain environments.
Findings
Outperforms baseline models in health monitoring tasks.
Achieves state-of-the-art results in time series forecasting.
Demonstrates transfer learning potential for traffic and weather prediction.
Abstract
Reinforcement learning has been increasingly applied in monitoring applications because of its ability to learn from previous experiences and can make adaptive decisions. However, existing machine learning-based health monitoring applications are mostly supervised learning algorithms, trained on labels and they cannot make adaptive decisions in an uncertain complex environment. This study proposes a novel and generic system, predictive deep reinforcement learning (PDRL) with multiple RL agents in a time series forecasting environment. The proposed generic framework accommodates virtual Deep Q Network (DQN) agents to monitor predicted future states of a complex environment with a well-defined reward policy so that the agent learns existing knowledge while maximizing their rewards. In the evaluation process of the proposed framework, three DRL agents were deployed to monitor a subject's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAir Quality Monitoring and Forecasting · Traffic Prediction and Management Techniques
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Bidirectional LSTM
