TACO: Learning Task Decomposition via Temporal Alignment for Control

Kyriacos Shiarlis; Markus Wulfmeier; Sasha Salter; Shimon Whiteson,; Ingmar Posner

arXiv:1803.01840·cs.LG·August 13, 2018·29 cites

TACO: Learning Task Decomposition via Temporal Alignment for Control

Kyriacos Shiarlis, Markus Wulfmeier, Sasha Salter, Shimon Whiteson,, Ingmar Posner

PDF

Open Access 1 Repo

TL;DR

TACO introduces a weakly supervised, domain-agnostic method for learning task decomposition and sub-policies from demonstrations using temporal alignment, reducing annotation effort while maintaining high performance.

Contribution

It presents a novel approach that aligns task sketches with demonstrations to learn sub-policies without extensive supervision or domain knowledge.

Findings

01

Performs comparably to fully supervised methods

02

Requires significantly less annotation effort

03

Effective on multiple domains including image-based robot control

Abstract

Many advanced Learning from Demonstration (LfD) methods consider the decomposition of complex, real-world tasks into simpler sub-tasks. By reusing the corresponding sub-policies within and between tasks, they provide training data for each policy from different high-level tasks and compose them to perform novel ones. Existing approaches to modular LfD focus either on learning a single high-level task or depend on domain knowledge and temporal segmentation. In contrast, we propose a weakly supervised, domain-agnostic approach based on task sketches, which include only the sequence of sub-tasks performed in each demonstration. Our approach simultaneously aligns the sketches with the observed demonstrations and learns the required sub-policies. This improves generalisation in comparison to separate optimisation procedures. We evaluate the approach on multiple domains, including a simulated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

KyriacosShiarli/taco
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · AI-based Problem Solving and Planning