Learning Interpretable Classifiers for PDDL Planning

Arnaud Lequen

arXiv:2410.10011·cs.AI·October 15, 2024

Learning Interpretable Classifiers for PDDL Planning

Arnaud Lequen

PDF

Open Access

TL;DR

This paper introduces a method for learning interpretable, human-readable logical formulas that describe agent behavior in PDDL planning tasks, enabling understanding and generalization of policies.

Contribution

It presents a novel topology-guided MaxSAT approach to efficiently learn interpretable behavior classifiers from planning examples.

Findings

01

Formulas are human-readable and generalize to unseen instances.

02

Learning is computationally intractable without approximation.

03

The MaxSAT-based method produces accurate formulas in reasonable time.

Abstract

We consider the problem of synthesizing interpretable models that recognize the behaviour of an agent compared to other agents, on a whole set of similar planning tasks expressed in PDDL. Our approach consists in learning logical formulas, from a small set of examples that show how an agent solved small planning instances. These formulas are expressed in a version of First-Order Temporal Logic (FTL) tailored to our planning formalism. Such formulas are human-readable, serve as (partial) descriptions of an agent's policy, and generalize to unseen instances. We show that learning such formulas is computationally intractable, as it is an NP-hard problem. As such, we propose to learn these behaviour classifiers through a topology-guided compilation to MaxSAT, which allows us to generate a wide range of different formulas. Experiments show that interesting and accurate formulas can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Fuzzy Logic and Control Systems

MethodsSparse Evolutionary Training