Adaptformer: Sequence models as adaptive iterative planners

Akash Karthikeyan; Yash Vardhan Pant

arXiv:2412.00293·cs.RO·December 3, 2024

Adaptformer: Sequence models as adaptive iterative planners

Akash Karthikeyan, Yash Vardhan Pant

PDF

Open Access

TL;DR

Adaptformer is a novel adaptive sequence model-based planner that improves decision-making in complex, multi-task autonomous missions by generalizing to out-of-distribution tasks and environments, outperforming existing methods.

Contribution

The paper introduces Adaptformer, a stochastic, adaptive sequence model that incorporates an energy-based heuristic and intrinsic sub-goal curriculum for efficient long-horizon planning.

Findings

01

Outperforms state-of-the-art by up to 25% in maze tasks

02

Successfully generalizes to out-of-distribution tasks

03

Effectively adapts to multi-task missions with limited demonstrations

Abstract

Despite recent advances in learning-based behavioral planning for autonomous systems, decision-making in multi-task missions remains a challenging problem. For instance, a mission might require a robot to explore an unknown environment, locate the goals, and navigate to them, even if there are obstacles along the way. Such problems are difficult to solve due to: a) sparse rewards, meaning a reward signal is available only once all the tasks in a mission have been satisfied, and b) the agent having to perform tasks at run-time that are not covered in the training data, e.g., demonstrations only from an environment where all doors were unlocked. Consequently, state-of-the-art decision-making methods in such settings are limited to missions where the required tasks are well-represented in the training demonstrations and can be solved within a short planning horizon. To overcome these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsConstraint Satisfaction and Optimization · Music and Audio Processing · Artificial Intelligence in Games