Optimizing Attention and Cognitive Control Costs Using Temporally-Layered Architectures
Devdhar Patel, Terrence Sejnowski, Hava Siegelmann

TL;DR
This paper introduces a biologically-inspired, temporally layered architecture for reinforcement learning that optimizes computational energy and decision costs, achieving high performance with reduced energy expenditure.
Contribution
The paper proposes a novel Temporally Layered Architecture (TLA) that manages computational costs in reinforcement learning, addressing limitations of existing algorithms in decision-bounded environments.
Findings
TLA achieves optimal performance with lower computational costs.
Existing RL algorithms struggle under decision and energy constraints.
TLA matches state-of-the-art performance while reducing compute energy use.
Abstract
The current reinforcement learning framework focuses exclusively on performance, often at the expense of efficiency. In contrast, biological control achieves remarkable performance while also optimizing computational energy expenditure and decision frequency. We propose a Decision Bounded Markov Decision Process (DB-MDP), that constrains the number of decisions and computational energy available to agents in reinforcement learning environments. Our experiments demonstrate that existing reinforcement learning algorithms struggle within this framework, leading to either failure or suboptimal performance. To address this, we introduce a biologically-inspired, Temporally Layered Architecture (TLA), enabling agents to manage computational costs through two layers with distinct time scales and energy requirements. TLA achieves optimal performance in decision-bounded environments and in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗devdharpatel/tla-Pendulum-v1model· ♡ 1♡ 1
- 🤗devdharpatel/tla-MountainCarContinuous-v0model· ♡ 1♡ 1
- 🤗devdharpatel/tla_InvertedPendulum-v2model· ♡ 1♡ 1
- 🤗devdharpatel/tla-InvertedDoublePendulum-v2model· ♡ 1♡ 1
- 🤗devdharpatel/tla-Hopper-v2model· ♡ 1♡ 1
- 🤗devdharpatel/tla-Walker2d-v2model· ♡ 1♡ 1
- 🤗devdharpatel/tla-Ant-v2model· ♡ 1♡ 1
- 🤗devdharpatel/tla-HalfCheetah-v2model· ♡ 1♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Flow Experience in Various Fields
MethodsTemporally Layered Architecture · Target Policy Smoothing · Clipped Double Q-learning · Experience Replay · *Communicated@Fast*How Do I Communicate to Expedia? · Dense Connections · Adam · Twin Delayed Deep Deterministic
