KOI: Accelerating Online Imitation Learning via Hybrid Key-state   Guidance

Jingxian Lu; Wenke Xia; Dong Wang; Zhigang Wang; Bin Zhao; Di Hu,; Xuelong Li

arXiv:2408.02912·cs.RO·October 18, 2024

KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance

Jingxian Lu, Wenke Xia, Dong Wang, Zhigang Wang, Bin Zhao, Di Hu,, Xuelong Li

PDF

Open Access

TL;DR

KOI introduces a hybrid key-state guidance approach for online imitation learning, combining semantic and motion cues to improve reward estimation, accelerate learning, and enhance sample efficiency in robotic tasks.

Contribution

The paper proposes a novel hybrid key-state guided method that integrates visual-language models and optical flow for better reward estimation in imitation learning.

Findings

01

Improved success rates in Meta-World and LIBERO environments.

02

Enhanced sample efficiency during online imitation learning.

03

Validated effectiveness through real-world robotic experiments.

Abstract

Online Imitation Learning struggles with the gap between extensive online exploration space and limited expert trajectories, hindering efficient exploration due to inaccurate reward estimation. Inspired by the findings from cognitive neuroscience, we hypothesize that an agent could estimate precise task-aware reward for efficient online exploration, through decomposing the target task into the objectives of "what to do" and the mechanisms of "how to do". In this work, we introduce the hybrid Key-state guided Online Imitation (KOI) learning method, which leverages the integration of semantic and motion key states as guidance for reward estimation. Initially, we utilize visual-language models to extract semantic key states from expert trajectory, indicating the objectives of "what to do". Within the intervals between semantic key states, optical flow is employed to capture motion key…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTeleoperation and Haptic Systems