Optimal Policies Search for Sensor Management

Thomas Br\'ehard (INRIA Futurs); Emmanuel Duflos (INRIA Futurs,; LAGIS); Philippe Vanheeghe (LAGIS); Pierre-Arnaud Coquelin (INRIA Futurs)

arXiv:0903.3329·cs.LG·March 20, 2009

Optimal Policies Search for Sensor Management

Thomas Br\'ehard (INRIA Futurs), Emmanuel Duflos (INRIA Futurs,, LAGIS), Philippe Vanheeghe (LAGIS), Pierre-Arnaud Coquelin (INRIA Futurs)

PDF

Open Access

TL;DR

This paper presents a novel method for sensor management that learns optimal policies offline using stochastic gradient estimation and applies it to radar systems, demonstrating promising simulation results.

Contribution

It introduces a new stochastic gradient-based approach for deriving optimal sensor management policies using IPA, applicable in simulation-based offline learning.

Findings

01

Effective policy learning in radar management demonstrated

02

New gradient approximation method based on IPA introduced

03

Simulation results show promising performance

Abstract

This paper introduces a new approach to solve sensor management problems. Classically sensor management problems can be well formalized as Partially-Observed Markov Decision Processes (POMPD). The original approach developped here consists in deriving the optimal parameterized policy based on a stochastic gradient estimation. We assume in this work that it is possible to learn the optimal policy off-line (in simulation) using models of the environement and of the sensor(s). The learned policy can then be used to manage the sensor(s). In order to approximate the gradient in a stochastic context, we introduce a new method to approximate the gradient, based on Infinitesimal Perturbation Approximation (IPA). The effectiveness of this general framework is illustrated by the managing of an Electronically Scanned Array Radar. First simulations results are finally proposed.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTarget Tracking and Data Fusion in Sensor Networks · Distributed Sensor Networks and Detection Algorithms · Simulation Techniques and Applications