Optimizing Attention and Cognitive Control Costs Using   Temporally-Layered Architectures

Devdhar Patel; Terrence Sejnowski; Hava Siegelmann

arXiv:2305.18701·cs.AI·November 1, 2024·1 cites

Optimizing Attention and Cognitive Control Costs Using Temporally-Layered Architectures

Devdhar Patel, Terrence Sejnowski, Hava Siegelmann

PDF

Open Access 1 Repo 8 Models

TL;DR

This paper introduces a biologically-inspired, temporally layered architecture for reinforcement learning that optimizes computational energy and decision costs, achieving high performance with reduced energy expenditure.

Contribution

The paper proposes a novel Temporally Layered Architecture (TLA) that manages computational costs in reinforcement learning, addressing limitations of existing algorithms in decision-bounded environments.

Findings

01

TLA achieves optimal performance with lower computational costs.

02

Existing RL algorithms struggle under decision and energy constraints.

03

TLA matches state-of-the-art performance while reducing compute energy use.

Abstract

The current reinforcement learning framework focuses exclusively on performance, often at the expense of efficiency. In contrast, biological control achieves remarkable performance while also optimizing computational energy expenditure and decision frequency. We propose a Decision Bounded Markov Decision Process (DB-MDP), that constrains the number of decisions and computational energy available to agents in reinforcement learning environments. Our experiments demonstrate that existing reinforcement learning algorithms struggle within this framework, leading to either failure or suboptimal performance. To address this, we introduce a biologically-inspired, Temporally Layered Architecture (TLA), enabling agents to manage computational costs through two layers with distinct time scales and energy requirements. TLA achieves optimal performance in decision-bounded environments and in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dee0512/Temporally-Layered-Architecture
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Flow Experience in Various Fields

MethodsTemporally Layered Architecture · Target Policy Smoothing · Clipped Double Q-learning · Experience Replay · *Communicated@Fast*How Do I Communicate to Expedia? · Dense Connections · Adam · Twin Delayed Deep Deterministic