Refining Manually-Designed Symbol Grounding and High-Level Planning by   Policy Gradients

Takuya Hiraoka; Takashi Onishi; Takahisa Imagawa; Yoshimasa Tsuruoka

arXiv:1810.00177·cs.AI·October 2, 2018·1 cites

Refining Manually-Designed Symbol Grounding and High-Level Planning by Policy Gradients

Takuya Hiraoka, Takashi Onishi, Takahisa Imagawa, Yoshimasa Tsuruoka

PDF

Open Access

TL;DR

This paper introduces a framework that automatically refines manually-designed symbol grounding functions and high-level planners using policy gradients, reducing human effort while maintaining interpretability.

Contribution

The proposed method automatically improves hierarchical planners by refining symbol grounding and planning modules through a combined reinforcement and penalty approach.

Findings

01

Successfully refined modules in the Mountain car problem

02

Improved plan appropriateness while maintaining interpretability

03

Reduced manual effort in designing hierarchical planners

Abstract

Hierarchical planners that produce interpretable and appropriate plans are desired, especially in its application to supporting human decision making. In the typical development of the hierarchical planners, higher-level planners and symbol grounding functions are manually created, and this manual creation requires much human effort. In this paper, we propose a framework that can automatically refine symbol grounding functions and a high-level planner to reduce human effort for designing these modules. In our framework, symbol grounding and high-level planning, which are based on manually-designed knowledge bases, are modeled with semi-Markov decision processes. A policy gradient method is then applied to refine the modules, in which two terms for updating the modules are considered. The first term, called a reinforcement term, contributes to updating the modules to improve the overall…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Natural Language Processing Techniques · Topic Modeling