Mitigating the Stability-Plasticity Dilemma in Adaptive Train Scheduling with Curriculum-Driven Continual DQN Expansion
Achref Jaziri, Etienne K\"unzel, Visvanathan Ramesh

TL;DR
This paper introduces a curriculum-driven continual DQN expansion method to address the stability-plasticity dilemma in adaptive train scheduling, improving learning efficiency and adaptability in dynamic multi-agent environments.
Contribution
It proposes a novel algorithm, CDE, that dynamically adjusts Q-function subspaces and balances plasticity and stability using EWC and adaptive activation functions.
Findings
Significant improvements in learning efficiency over baselines
Enhanced adaptability in non-stationary environments
Effective mitigation of catastrophic forgetting
Abstract
A continual learning agent builds on previous experiences to develop increasingly complex behaviors by adapting to non-stationary and dynamic environments while preserving previously acquired knowledge. However, scaling these systems presents significant challenges, particularly in balancing the preservation of previous policies with the adaptation of new ones to current environments. This balance, known as the stability-plasticity dilemma, is especially pronounced in complex multi-agent domains such as the train scheduling problem, where environmental and agent behaviors are constantly changing, and the search space is vast. In this work, we propose addressing these challenges in the train scheduling problem using curriculum learning. We design a curriculum with adjacent skills that build on each other to improve generalization performance. Introducing a curriculum with distinct tasks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRailway Systems and Energy Efficiency
MethodsElastic Weight Consolidation
