Open-World Multi-Task Control Through Goal-Aware Representation Learning   and Adaptive Horizon Prediction

Shaofei Cai; Zihao Wang; Xiaojian Ma; Anji Liu; Yitao Liang

arXiv:2301.10034·cs.AI·October 16, 2023·1 cites

Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction

Shaofei Cai, Zihao Wang, Xiaojian Ma, Anji Liu, Yitao Liang

PDF

Open Access 2 Repos

TL;DR

This paper introduces a goal-aware representation learning approach with adaptive horizon prediction to improve goal-conditioned multi-task policies in Minecraft, achieving significant performance gains and zero-shot generalization in complex open-ended environments.

Contribution

The paper proposes a novel Goal-Sensitive Backbone and adaptive horizon prediction module to address task indistinguishability and non-stationary dynamics in open-world multi-task learning.

Findings

01

Outperforms baseline methods on 20 Minecraft tasks

02

Doubles performance in many tasks

03

Achieves zero-shot generalization to new scenes

Abstract

We study the problem of learning goal-conditioned policies in Minecraft, a popular, widely accessible yet challenging open-ended environment for developing human-level multi-task agents. We first identify two main challenges of learning such policies: 1) the indistinguishability of tasks from the state distribution, due to the vast scene diversity, and 2) the non-stationary nature of environment dynamics caused by partial observability. To tackle the first challenge, we propose Goal-Sensitive Backbone (GSB) for the policy to encourage the emergence of goal-relevant visual state representations. To tackle the second challenge, the policy is further fueled by an adaptive horizon prediction module that helps alleviate the learning uncertainty brought by the non-stationary dynamics. Experiments on 20 Minecraft tasks show that our method significantly outperforms the best baseline so far; in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition