Imitation Learning via Focused Satisficing

Rushit N. Shah; Nikolaos Agadakos; Synthia Sasulski; Ali Farajzadeh; Sanjiban Choudhury; Brian Ziebart

arXiv:2505.14820·cs.LG·May 27, 2025

Imitation Learning via Focused Satisficing

Rushit N. Shah, Nikolaos Agadakos, Synthia Sasulski, Ali Farajzadeh, Sanjiban Choudhury, Brian Ziebart

PDF

Open Access

TL;DR

This paper introduces a focused satisficing approach to imitation learning that aims to surpass demonstrator aspiration levels on unseen data, leading to better imitation quality and higher acceptability in various environments.

Contribution

It proposes a novel margin-based objective for deep reinforcement learning that emphasizes surpassing aspiration levels without explicitly modeling them.

Findings

01

Outperforms existing imitation learning methods in imitation quality.

02

Achieves higher guaranteed acceptability to the demonstrator.

03

Provides competitive true returns across multiple environments.

Abstract

Imitation learning often assumes that demonstrations are close to optimal according to some fixed, but unknown, cost function. However, according to satisficing theory, humans often choose acceptable behavior based on their personal (and potentially dynamic) levels of aspiration, rather than achieving (near-) optimality. For example, a lunar lander demonstration that successfully lands without crashing might be acceptable to a novice despite being slow or jerky. Using a margin-based objective to guide deep reinforcement learning, our focused satisficing approach to imitation learning seeks a policy that surpasses the demonstrator's aspiration levels -- defined over trajectories or portions of trajectories -- on unseen demonstrations without explicitly learning those aspirations. We show experimentally that this focuses the policy to imitate the highest quality (portions of)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInnovative Teaching and Learning Methods · Intelligent Tutoring Systems and Adaptive Learning