TAMPC: A Controller for Escaping Traps in Novel Environments
Sheng Zhong (1), Zhenyuan Zhang (1), Nima Fazeli (1), Dmitry Berenson, (1) ((1) Robotics Institute, University of Michigan)

TL;DR
This paper introduces TAMPC, a hierarchical control algorithm that adapts to unexpected trap states in hybrid dynamics environments, enabling robots to escape traps and continue tasks effectively.
Contribution
The paper presents TAMPC, a novel trap-aware control method that recognizes and adapts to out-of-distribution dynamics to escape traps in hybrid systems.
Findings
Outperforms adaptive control and reinforcement learning baselines in trap scenarios.
Effectively recognizes nominal dynamics even with out-of-distribution data.
Achieves comparable performance to existing trap-handling methods on easier tasks.
Abstract
We propose an approach to online model adaptation and control in the challenging case of hybrid and discontinuous dynamics where actions may lead to difficult-to-escape "trap" states, under a given controller. We first learn dynamics for a system without traps from a randomly collected training set (since we do not know what traps will be encountered online). These "nominal" dynamics allow us to perform tasks in scenarios where the dynamics matches the training data, but when unexpected traps arise in execution, we must find a way to adapt our dynamics and control strategy and continue attempting the task. Our approach, Trap-Aware Model Predictive Control (TAMPC), is a two-level hierarchical control algorithm that reasons about traps and non-nominal dynamics to decide between goal-seeking and recovery policies. An important requirement of our method is the ability to recognize nominal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Control Systems Optimization · Model Reduction and Neural Networks
