Sample-Efficient Behavior Cloning Using General Domain Knowledge

Feiyu Zhu; Jean Oh; Reid Simmons

arXiv:2501.16546·cs.AI·February 5, 2026

Sample-Efficient Behavior Cloning Using General Domain Knowledge

Feiyu Zhu, Jean Oh, Reid Simmons

PDF

Open Access

TL;DR

This paper introduces KIM, a method that leverages large language models to incorporate expert domain knowledge into behavior cloning, significantly improving sample efficiency and robustness in decision-making tasks.

Contribution

The paper proposes a novel approach to embed domain knowledge into policy structures using language models, enhancing learning efficiency and generalization in behavior cloning.

Findings

01

Learns with as few as 5 demonstrations

02

Outperforms baseline models without domain knowledge

03

Shows robustness to action noise

Abstract

Behavior cloning has shown success in many sequential decision-making tasks by learning from expert demonstrations, yet they can be very sample inefficient and fail to generalize to unseen scenarios. One approach to these problems is to introduce general domain knowledge, such that the policy can focus on the essential features and may generalize to unseen states by applying that knowledge. Although this knowledge is easy to acquire from the experts, it is hard to be combined with learning from individual examples due to the lack of semantic structure in neural networks and the time-consuming nature of feature engineering. To enable learning from both general knowledge and specific demonstration trajectories, we use a large language model's coding capability to instantiate a policy structure based on expert domain knowledge expressed in natural language and tune the parameters in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTime Series Analysis and Forecasting · Anomaly Detection Techniques and Applications

MethodsFocus