Efficient Replay Memory Architectures in Multi-Agent Reinforcement Learning for Traffic Congestion Control
Mukul Chodhary, Kevin Octavian, SooJean Han

TL;DR
This paper introduces a dual-memory replay architecture inspired by human cognition to enhance multi-agent reinforcement learning for traffic congestion control, effectively managing memory growth and improving traffic flow.
Contribution
It proposes a novel dual-memory architecture combining semantic and episodic memory with equivalence classes, addressing memory growth and improving congestion control in multi-agent systems.
Findings
Memory growth bounds are established theoretically.
Simulation results show increased vehicle throughput.
Enhanced congestion management in intersection networks.
Abstract
Episodic control, inspired by the role of episodic memory in the human brain, has been shown to improve the sample inefficiency of model-free reinforcement learning by reusing high-return past experiences. However, the memory growth of episodic control is undesirable in large-scale multi-agent problems such as vehicle traffic management. This paper proposes a novel replay memory architecture called Dual-Memory Integrated Learning, to augment to multi-agent reinforcement learning methods for congestion control via adaptive light signal scheduling. Our dual-memory architecture mimics two core capabilities of human decision-making. First, it relies on diverse types of memory--semantic and episodic, short-term and long-term--in order to remember high-return states that occur often in the network and filter out states that don't. Second, it employs equivalence classes to group together…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
