Detecting danger in gridworlds using Gromov's Link Condition
Thomas F Burns, Robert Tang

TL;DR
This paper uses topological and geometric methods to analyze gridworlds in AI, revealing safety limitations by detecting failures in Gromov's Link Condition that correspond to dangerous states.
Contribution
It introduces a modified state complex framework capturing agent braiding, linking topological defects to unsafe states in gridworlds, and applies geometric group theory tools to AI safety analysis.
Findings
Failures in Gromov's Link Condition indicate dangerous states.
Modified state complexes effectively identify safety limitations.
Topological methods provide new insights into gridworld safety analysis.
Abstract
Gridworlds have been long-utilised in AI research, particularly in reinforcement learning, as they provide simple yet scalable models for many real-world applications such as robot navigation, emergent behaviour, and operations research. We initiate a study of gridworlds using the mathematical framework of reconfigurable systems and state complexes due to Abrams, Ghrist & Peterson. State complexes represent all possible configurations of a system as a single geometric space, thus making them conducive to study using geometric, topological, or combinatorial methods. The main contribution of this work is a modification to the original Abrams, Ghrist & Peterson setup which we introduce to capture agent braiding and thereby more naturally represent the topology of gridworlds. With this modification, the state complexes may exhibit geometric defects (failure of Gromov's Link Condition).…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLogic, Reasoning, and Knowledge · Topological and Geometric Data Analysis · Computability, Logic, AI Algorithms
