Adaptformer: Sequence models as adaptive iterative planners
Akash Karthikeyan, Yash Vardhan Pant

TL;DR
Adaptformer is a novel adaptive sequence model-based planner that improves decision-making in complex, multi-task autonomous missions by generalizing to out-of-distribution tasks and environments, outperforming existing methods.
Contribution
The paper introduces Adaptformer, a stochastic, adaptive sequence model that incorporates an energy-based heuristic and intrinsic sub-goal curriculum for efficient long-horizon planning.
Findings
Outperforms state-of-the-art by up to 25% in maze tasks
Successfully generalizes to out-of-distribution tasks
Effectively adapts to multi-task missions with limited demonstrations
Abstract
Despite recent advances in learning-based behavioral planning for autonomous systems, decision-making in multi-task missions remains a challenging problem. For instance, a mission might require a robot to explore an unknown environment, locate the goals, and navigate to them, even if there are obstacles along the way. Such problems are difficult to solve due to: a) sparse rewards, meaning a reward signal is available only once all the tasks in a mission have been satisfied, and b) the agent having to perform tasks at run-time that are not covered in the training data, e.g., demonstrations only from an environment where all doors were unlocked. Consequently, state-of-the-art decision-making methods in such settings are limited to missions where the required tasks are well-represented in the training demonstrations and can be solved within a short planning horizon. To overcome these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsConstraint Satisfaction and Optimization · Music and Audio Processing · Artificial Intelligence in Games
