Engineering Benchmarks for Planning: the Domains Used in the Deterministic Part of IPC-4
S. Edelkamp, R. Englert, J. Hoffmann, F. Liporace, S. Thiebaux, S., Trueg

TL;DR
This paper details the creation and analysis of diverse, real-world inspired benchmarks for AI planning in IPC-4, including domain adaptations, theoretical complexity insights, and empirical evaluations of heuristic performance.
Contribution
It introduces five new benchmark domains derived from real-world applications, discusses their encoding and simplification, and provides empirical analysis of heuristic effectiveness and domain properties.
Findings
Heuristic functions vary in quality across domains.
Propositional representations grow rapidly with instance size.
Different planners achieve varying success depending on domain features.
Abstract
In a field of research about general reasoning mechanisms, it is essential to have appropriate benchmarks. Ideally, the benchmarks should reflect possible applications of the developed technology. In AI Planning, researchers more and more tend to draw their testing examples from the benchmark collections used in the International Planning Competition (IPC). In the organization of (the deterministic part of) the fourth IPC, IPC-4, the authors therefore invested significant effort to create a useful set of benchmarks. They come from five different (potential) real-world applications of planning: airport ground traffic control, oil derivative transportation in pipeline networks, model-checking safety properties, power supply restoration, and UMTS call setup. Adapting and preparing such an application for use as a benchmark in the IPC involves, at the time, inevitable (often drastic)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
