Track-MDP: Reinforcement Learning for Target Tracking with Controlled Sensing
Adarsh M. Subramaniam, Argyrios Gerogiannis, James Z. Hare, Venugopal, V. Veeravalli

TL;DR
This paper introduces Track-MDP, a novel Markov Decision Process formulation for target tracking with controlled sensing, enabling reinforcement learning solutions even when the target's motion model is unknown, and guarantees effective tracking.
Contribution
The paper proposes Track-MDP, a new MDP-based framework for target tracking that simplifies POMDP complexities and is solvable via reinforcement learning, ensuring reliable tracking performance.
Findings
Track-MDP achieves the same infinite horizon reward as the optimal POMDP.
RL-based Track-MDP policies can accurately track targets in simulations.
The method guarantees tracking of all significant target paths with certainty.
Abstract
State of the art methods for target tracking with sensor management (or controlled sensing) are model-based and are obtained through solutions to Partially Observable Markov Decision Process (POMDP) formulations. In this paper a Reinforcement Learning (RL) approach to the problem is explored for the setting where the motion model for the object/target to be tracked is unknown to the observer. It is assumed that the target dynamics are stationary in time, the state space and the observation space are discrete, and there is complete observability of the location of the target under certain (a priori unknown) sensor control actions. Then, a novel Markov Decision Process (MDP) rather than POMDP formulation is proposed for the tracking problem with controlled sensing, which is termed as Track-MDP. In contrast to the POMDP formulation, the Track-MDP formulation is amenable to an RL based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAir Quality Monitoring and Forecasting · Target Tracking and Data Fusion in Sensor Networks · Water Quality Monitoring Technologies
