A Novel Model for Arbitration between Planning and Habitual Control Systems
Farzaneh S. Fard, and Thomas P. Trappenberg

TL;DR
This paper introduces an integrated arbitration model combining habitual and deliberate control systems, demonstrating rapid learning and robustness in a robotic target-reaching task, especially under changing conditions and occlusion.
Contribution
It presents a novel architecture with an arbitrator that dynamically switches between habitual and planning control, improving adaptability and learning speed in robotic tasks.
Findings
The model learns system kinematics rapidly without prior knowledge.
It maintains performance despite environmental changes and occlusion.
The integrated system outperforms pure habitual or planning models.
Abstract
It is well established that humans decision making and instrumental control uses multiple systems, some which use habitual action selection and some which require deliberate planning. Deliberate planning systems use predictions of action-outcomes using an internal model of the agent's environment, while habitual action selection systems learn to automate by repeating previously rewarded actions. Habitual control is computationally efficient but may be inflexible in changing environments. Conversely, deliberate planning may be computationally expensive, but flexible in dynamic environments. This paper proposes a general architecture comprising both control paradigms by introducing an arbitrator that controls which subsystem is used at any time. This system is implemented for a target-reaching task with a simulated two-joint robotic arm that comprises a supervised internal model and deep…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
