On Using Admissible Bounds for Learning Forward Search Heuristics
Carlos N\'u\~nez-Molina, Masataro Asai, Pablo Mesejo, Juan, Fern\'andez-Olivares

TL;DR
This paper introduces a novel approach for learning heuristic functions in forward search by modeling admissible heuristics as a truncated Gaussian, leading to improved training convergence and heuristic quality.
Contribution
It proposes a new loss function based on modeling heuristics as a truncated Gaussian, addressing limitations of the MSE approach in heuristic learning.
Findings
Our method converges faster during training.
The learned heuristics are of higher quality.
The approach outperforms MSE-based methods in experiments.
Abstract
In recent years, there has been growing interest in utilizing modern machine learning techniques to learn heuristic functions for forward search algorithms. Despite this, there has been little theoretical understanding of what they should learn, how to train them, and why we do so. This lack of understanding has resulted in the adoption of diverse training targets (suboptimal vs optimal costs vs admissible heuristics) and loss functions (e.g., square vs absolute errors) in the literature. In this work, we focus on how to effectively utilize the information provided by admissible heuristics in heuristic learning. We argue that learning from poly-time admissible heuristics by minimizing mean square errors (MSE) is not the correct approach, since its result is merely a noisy, inadmissible copy of an efficiently computable heuristic. Instead, we propose to model the learned heuristic as a…
Peer Reviews
Decision·Submitted to ICLR 2024
1. The paper is enjoyable to read and fairly well organized by raising three core questions (what, how and why). The problem of choice of training target and loss function is also well-motivated. 2. The experiments are relatively comprehensive and the results are pleasing.
1. In my view, one of the main contributions of this paper is to use the NLL as their training loss instead of MSE, and NLL adds the prediction of $\sigma$ (the variance) where MSE uses the fixed $\sigma$. However, in my opinion, this technique will improve the performance very trivially since the model will predict better with more parameters. 2. The authors explain the reason for using Gaussian distribution by giving the principle of maximum entropy, but the reason for using Truncated Gaussia
Strengths: - Novel approach for learning heuristics that makes principled use of admissible estimates. - The approach is compatible with residual heuristic learning and is agnostic of the neural architecture. - Experiments show increased accuracy and a larger number of planning instances solved under 10^4 evaluations.
Weaknesses: - Missing state-of-the-art recent baseline: [Chrestien et al., 2022] is an alternative approach that also argues against using MSE and proposes an alternative approach. This baseline should be compared to the proposed approach for learning heuristics. - In the experiments, it looks that the standard h^FF baseline outperforms the proposed approach in terms of problem solved in two out of the three domains. - There is no analysis of performance vs. problem size. It would be very useful
The paper is well-written, the presented argument is convincing, and the empirical results further support it. This is a great paper that successfully challenges the accepted wisdom of using MSE when learning heuristics.
I have no concerns or questions regarding the work.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScheduling and Timetabling Solutions · Metaheuristic Optimization Algorithms Research · Machine Learning and Algorithms
MethodsFocus
