A Novel Model for Arbitration between Planning and Habitual Control   Systems

Farzaneh S. Fard; and Thomas P. Trappenberg

arXiv:1712.02441·cs.SY·December 11, 2017

A Novel Model for Arbitration between Planning and Habitual Control Systems

Farzaneh S. Fard, and Thomas P. Trappenberg

PDF

TL;DR

This paper introduces an integrated arbitration model combining habitual and deliberate control systems, demonstrating rapid learning and robustness in a robotic target-reaching task, especially under changing conditions and occlusion.

Contribution

It presents a novel architecture with an arbitrator that dynamically switches between habitual and planning control, improving adaptability and learning speed in robotic tasks.

Findings

01

The model learns system kinematics rapidly without prior knowledge.

02

It maintains performance despite environmental changes and occlusion.

03

The integrated system outperforms pure habitual or planning models.

Abstract

It is well established that humans decision making and instrumental control uses multiple systems, some which use habitual action selection and some which require deliberate planning. Deliberate planning systems use predictions of action-outcomes using an internal model of the agent's environment, while habitual action selection systems learn to automate by repeating previously rewarded actions. Habitual control is computationally efficient but may be inflexible in changing environments. Conversely, deliberate planning may be computationally expensive, but flexible in dynamic environments. This paper proposes a general architecture comprising both control paradigms by introducing an arbitrator that controls which subsystem is used at any time. This system is implemented for a target-reaching task with a simulated two-joint robotic arm that comprises a supervised internal model and deep…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.