Decision-Focused Learning to Predict Action Costs for Planning
Jayanta Mandi, Marco Foschini, Daniel Holler, Sylvie Thiebaux, Jorg, Hoffmann, Tias Guns

TL;DR
This paper introduces decision-focused learning for predicting action costs in automated planning, addressing unique challenges in gradient computation and scalability, and demonstrating improved planning outcomes over traditional prediction methods.
Contribution
It pioneers the application of decision-focused learning to automated planning, proposing novel gradient computation methods and caching techniques to enhance scalability and effectiveness.
Findings
Decision-focused learning outperforms traditional prediction in planning quality.
Proposed gradient methods effectively handle negative action costs.
Caching mechanisms improve training efficiency.
Abstract
In many automated planning applications, action costs can be hard to specify. An example is the time needed to travel through a certain road segment, which depends on many factors, such as the current weather conditions. A natural way to address this issue is to learn to predict these parameters based on input features (e.g., weather forecasts) and use the predicted action costs in automated planning afterward. Decision-Focused Learning (DFL) has been successful in learning to predict the parameters of combinatorial optimization problems in a way that optimizes solution quality rather than prediction quality. This approach yields better results than treating prediction and optimization as separate tasks. In this paper, we investigate for the first time the challenges of implementing DFL for automated planning in order to learn to predict the action costs. There are two main challenges…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Systems and Decision Making
MethodsEmirates Airlines Office in Dubai · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
