NBDI: A Simple and Effective Termination Condition for Skill Extraction from Task-Agnostic Demonstrations

Myunsoo Kim; Hayeong Lee; Seong-Woong Shim; JunHo Seo; Byung-Jun Lee

arXiv:2501.12668·cs.LG·May 21, 2025

NBDI: A Simple and Effective Termination Condition for Skill Extraction from Task-Agnostic Demonstrations

Myunsoo Kim, Hayeong Lee, Seong-Woong Shim, JunHo Seo, Byung-Jun Lee

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces NBDI, a novel method for identifying decision points in skill learning using a state-action novelty module, which improves performance in complex, long-horizon tasks and adapts well to environment variations.

Contribution

The paper proposes NBDI, a simple termination condition based on state-action novelty, enhancing skill extraction and decision point identification in long-horizon, variable environments.

Findings

01

NBDI outperforms previous methods in complex tasks.

02

NBDI remains effective despite environment variations.

03

Improves exploration and policy learning efficiency.

Abstract

Intelligent agents are able to make decisions based on different levels of granularity and duration. Recent advances in skill learning enabled the agent to solve complex, long-horizon tasks by effectively guiding the agent in choosing appropriate skills. However, the practice of using fixed-length skills can easily result in skipping valuable decision points, which ultimately limits the potential for further exploration and faster policy learning. In this work, we propose to learn a simple and effective termination condition that identifies decision points through a state-action novelty module that leverages agent experience data. Our approach, Novelty-based Decision Point Identification (NBDI), outperforms previous baselines in complex, long-horizon tasks, and remains effective even in the presence of significant variations in the environment configurations of downstream tasks,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ku-dmlab/nbdi
pytorchOfficial

Videos

NBDI: A Simple and Effective Termination Condition for Skill Extraction from Task-Agnostic Demonstrations· slideslive

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Scheduling and Timetabling Solutions