A novel approach to model exploration for value function learning
Zlatan Ajanovic, Halil Beglerovic, Bakir Lacevic

TL;DR
This paper introduces a new method for exploring models to learn value functions that enhance planning efficiency and robustness by extending search regions and managing heuristic admissibility.
Contribution
It proposes a search-inspired model exploration technique for value function learning that extends beyond optimal paths and combines ML heuristics with admissible bounds.
Findings
Improved planning efficiency through extended search regions.
Enhanced robustness of planning with learned heuristics.
Effective management of heuristic admissibility loss.
Abstract
Planning and Learning are complementary approaches. Planning relies on deliberative reasoning about the current state and sequence of future reachable states to solve the problem. Learning, on the other hand, is focused on improving system performance based on experience or available data. Learning to improve the performance of planning based on experience in similar, previously solved problems, is ongoing research. One approach is to learn Value function (cost-to-go) which can be used as heuristics for speeding up search-based planning. Existing approaches in this direction use the results of the previous search for learning the heuristics. In this work, we present a search-inspired approach of systematic model exploration for the learning of the value function which does not stop when a plan is available but rather prolongs search such that not only resulting optimal path is used but…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI-based Problem Solving and Planning · Logic, Reasoning, and Knowledge · Machine Learning and Algorithms
