# Intelligent Knowledge Distribution: Constrained-Action POMDPs for   Resource-Aware Multi-Agent Communication

**Authors:** Michael C. Fowler, T. Charles Clancy, Ryan K. Williams

arXiv: 1903.03086 · 2019-03-08

## TL;DR

This paper introduces constrained-action POMDPs and an intelligent knowledge distribution framework for resource-aware multi-agent communication, enabling efficient information sharing with minimal constraint violations.

## Contribution

It develops a novel CA-POMDP framework with soft probabilistic constraints and proposes IKD for optimized multi-agent knowledge sharing under resource limitations.

## Key findings

- IKD maintained asset tracking with only 3% constraint violations.
- Compared to naive approaches, IKD reduced constraint violations from over 60%.
- The framework effectively balances information sharing and resource constraints.

## Abstract

This paper addresses a fundamental question of multi-agent knowledge distribution: what information should be sent to whom and when, with the limited resources available to each agent? Communication requirements for multi-agent systems can be rather high when an accurate picture of the environment and the state of other agents must be maintained. To reduce the impact of multi-agent coordination on networked systems, e.g., power and bandwidth, this paper introduces two concepts for partially observable Markov decision processes (POMDPs): 1) action-based constraints which yield constrained-action POMDPs (CA-POMDPs); and 2) soft probabilistic constraint satisfaction for the resulting infinite-horizon controllers. To enable constraint analysis over an infinite horizon, an unconstrained policy is first represented as a Finite State Controller (FSC) and optimized with policy iteration. The FSC representation then allows for a combination of Markov chain Monte Carlo and discrete optimization to improve the probabilistic constraint satisfaction of the controller while minimizing the impact to the value function. Within the CA-POMDP framework we then propose Intelligent Knowledge Distribution (IKD) which yields per-agent policies for distributing knowledge between agents subject to interaction constraints. Finally, the CA-POMDP and IKD concepts are validated using an asset tracking problem where multiple unmanned aerial vehicles (UAVs) with heterogeneous sensors collaborate to localize a ground asset to assist in avoiding unseen obstacles in a disaster area. The IKD model was able to maintain asset tracking through multi-agent communications while only violating soft power and bandwidth constraints 3% of the time, while greedy and naive approaches violated constraints more than 60% of the time.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.03086/full.md

## Figures

14 figures with captions in the complete paper: https://tomesphere.com/paper/1903.03086/full.md

## References

36 references — full list in the complete paper: https://tomesphere.com/paper/1903.03086/full.md

---
Source: https://tomesphere.com/paper/1903.03086