A Framework for Understanding and Visualizing Strategies of RL Agents
Pedro Sequeira, Daniel Elenius, Jesse Hostetler, Melinda Gervasio

TL;DR
This paper introduces a framework that uses temporal logic and clustering to understand and visualize strategies of reinforcement learning agents in complex environments like StarCraft II, enhancing explainability.
Contribution
The novel framework combines trace clustering with logical formula inference to produce interpretable models of agent strategies in sequential decision tasks.
Findings
Successfully separates agent behaviors into distinct clusters
Produces consistent and meaningful strategy descriptions
Enhances understanding of RL agent strategies in complex environments
Abstract
Recent years have seen significant advances in explainable AI as the need to understand deep learning models has gained importance with the increased emphasis on trust and ethics in AI. Comprehensible models for sequential decision tasks are a particular challenge as they require understanding not only individual predictions but a series of predictions that interact with environmental dynamics. We present a framework for learning comprehensible models of sequential decision tasks in which agent strategies are characterized using temporal logic formulas. Given a set of agent traces, we first cluster the traces using a novel embedding method that captures frequent action patterns. We then search for logical formulas that explain the agent strategies in the different clusters. We evaluate our framework on combat scenarios in StarCraft II (SC2), using traces from a handcrafted expert policy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Reinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI)
