Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal   Abstraction

Tom Bewley; Jonathan Lawry; Arthur Richards

arXiv:2201.07749·cs.AI·June 22, 2022·1 cites

Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction

Tom Bewley, Jonathan Lawry, Arthur Richards

PDF

Open Access

TL;DR

This paper presents a data-driven, model-agnostic method for summarizing and contrasting the dynamics of evolving systems like reinforcement learning agents, using spatiotemporal aggregation based on information divergence.

Contribution

It introduces a novel technique for generating human-interpretable summaries of agent dynamics, applicable to continuous state spaces and enhancing interpretability.

Findings

01

Effective summarization of deep RL learning histories

02

Complementary to existing interpretability methods

03

Applicable to continuous state spaces

Abstract

We introduce a data-driven, model-agnostic technique for generating a human-interpretable summary of the salient points of contrast within an evolving dynamical system, such as the learning process of a control agent. It involves the aggregation of transition data along both spatial and temporal dimensions according to an information-theoretic divergence measure. A practical algorithm is outlined for continuous state spaces, and deployed to summarise the learning histories of deep reinforcement learning agents with the aid of graphical and textual communication methods. We expect our method to be complementary to existing techniques in the realm of agent interpretability.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Explainable Artificial Intelligence (XAI) · Statistical and Computational Modeling