Agent policies from higher-order causal functions
Matt Wilson

TL;DR
This paper establishes a theoretical link between agent policies in deterministic POMDPs and higher-order process functions, revealing how indefinite causal structures can enhance policy performance.
Contribution
It introduces a novel correspondence between agent policies and process functions, and demonstrates the advantage of indefinite causality in multi-agent decision-making.
Findings
Indefinite causal policies can outperform fixed causal policies in certain POMDPs.
A categorical framework supports interpretation of policies and causal structures.
Strict separation shown between indefinite and definite causal process functions.
Abstract
We establish a correspondence between equivalence classes of agent-state policies for deterministic POMDPs and one-input process functions (the classical-deterministic limit of higher-order quantum operations). We use this correspondence to build a bridge between the agent-environment interaction in artificial intelligence, causal structure in the foundations of physics, and logic in computer science. We construct a *-autonomous category PF of types which supports an interpretation of one-step evaluation of policies, and multi-agent observation constraints, into cuts and monoidal products. In terms of types, we develop the correspondence further by identifying observation-independent decentralised POMDPs as the natural domain for the multi-input process functions used to model indefinite causality. We then prove a strict separation between general multi-input process function and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLogic, Reasoning, and Knowledge · Formal Methods in Verification · Computability, Logic, AI Algorithms
