# Comparative Evaluation of Bandit-Style Heuristic Policies for Moving Target Detection in a Linear Grid Environment

**Authors:** Hyunmin Kang, Minho Ahn, Yongduek Seo

PMC · DOI: 10.3390/s26010226 · Sensors (Basel, Switzerland) · 2025-12-29

## TL;DR

This paper compares two simple decision rules for detecting a moving target in a grid environment, finding that a greedy approach detects the target faster than alternatives.

## Contribution

The study provides a comparative evaluation of greedy and belief-proportional sampling heuristics for moving target detection in a constrained sensing environment.

## Key findings

- The greedy policy achieves a 17–20% faster expected time to detection compared to BPS and random probing.
- BPS offers better exploration under model mismatch at the cost of reduced average efficiency.
- Monte Carlo simulations quantify the trade-off between exploration and exploitation for the two policies.

## Abstract

Moving-target detection under strict sensing constraints is a recurring subproblem in surveillance, search-and-rescue, and autonomous robotics. We study a canonical one-dimensional finite grid in which a sensor probes one location per time step with binary observations while the target follows reflecting random-walk dynamics. The objective is to minimize the expected time to detection using transparent, training-free decision rules defined on the belief state of the target location. We compare two belief-driven heuristics with purely online implementation: a greedy rule that always probes the most probable location and a belief-proportional sampling (BPS, probability matching) rule that samples sensing locations according to the belief distribution (i.e., posterior probability of the target location). Repeated Monte Carlo simulations quantify the exploitation–exploration trade-off and provide a self-comparison between the two policies. Across tested grid sizes, the greedy policy consistently yields the shortest expected time to detection, improving by roughly 17–20% over BPS and uniform random probing in representative settings. BPS trades some average efficiency for stochastic exploration, which can be beneficial under model mismatch. This study provides an interpretable baseline and quantitative reference for extensions to noisy sensing and higher-dimensional search.

## Full-text entities

- **Genes:** PODXL2 (podocalyxin like 2) [NCBI Gene 50512] {aka EG, PODLX2}
- **Diseases:** injury to (MESH:D014947)
- **Chemicals:** Pd (MESH:D010165), Pfa (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** stop at 1000, A3C

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12788252/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12788252/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/PMC12788252/full.md

---
Source: https://tomesphere.com/paper/PMC12788252