Insect-inspired modular architectures as inductive biases for reinforcement learning

Anne E. Staples

arXiv:2604.22081·cs.LG·April 27, 2026

Insect-inspired modular architectures as inductive biases for reinforcement learning

Anne E. Staples

PDF

TL;DR

This paper introduces insect-inspired modular architectures for reinforcement learning, demonstrating improved performance and stability in complex navigation tasks by decomposing control into specialized interacting modules.

Contribution

It proposes a novel modular RL policy architecture inspired by insect neural circuits, showing advantages over centralized controllers in dynamic, multi-objective tasks.

Findings

01

Modular policies outperform centralized controllers in navigation tasks.

02

The modular approach achieves lower value loss and more stable PPO training.

03

Highly selective control allocation indicated by low module-assignment entropy.

Abstract

Most reinforcement-learning (RL) controllers used in continuous control are architecturally centralized: observations are compressed into a single latent state from which both value estimates and actions are produced. Biological control systems are often organized differently. Insects, in particular, coordinate navigation, heading stabilization, memory, and context-dependent action selection through distributed circuits rather than a single monolithic controller. Motivated by this contrast, we study an RL policy architecture that decomposes control into interacting modules for sensory encoding, heading representation, sparse associative memory, recurrent command generation, and local motor control, with a learned arbitration mechanism that allocates motor authority across modules. The model is evaluated on a two-dimensional navigation task that require simultaneous food seeking,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.