Active Perception with Initial-State Uncertainty: A Policy Gradient   Method

Chongyang Shi; Shuo Han; Michael Dorothy; and Jie Fu

arXiv:2409.16439·eess.SY·September 26, 2024

Active Perception with Initial-State Uncertainty: A Policy Gradient Method

Chongyang Shi, Shuo Han, Michael Dorothy, and Jie Fu

PDF

Open Access

TL;DR

This paper introduces a novel policy gradient approach for active perception in stochastic systems, aiming to maximize initial state information leakage using controllable sensors and entropy-based planning.

Contribution

It develops a new policy gradient method with convergence guarantees for active perception in HMMs, leveraging observable operators for efficient gradient computation.

Findings

01

Effective in stochastic grid world environment

02

Convergence guarantees for the proposed method

03

Improved initial state inference accuracy

Abstract

This paper studies the synthesis of an active perception policy that maximizes the information leakage of the initial state in a stochastic system modeled as a hidden Markov model (HMM). Specifically, the emission function of the HMM is controllable with a set of perception or sensor query actions. Given the goal is to infer the initial state from partial observations in the HMM, we use Shannon conditional entropy as the planning objective and develop a novel policy gradient method with convergence guarantees. By leveraging a variant of observable operators in HMMs, we prove several important properties of the gradient of the conditional entropy with respect to the policy parameters, which allow efficient computation of the policy gradient and stable and fast convergence. We demonstrate the effectiveness of our solution by applying it to an inference problem in a stochastic grid world…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuction Theory and Applications · Capital Investment and Risk Analysis

MethodsSparse Evolutionary Training