TL;DR
TSN-Affinity introduces a similarity-guided architectural reuse method for continual offline reinforcement learning, effectively balancing task retention and knowledge sharing without relying on replay buffers.
Contribution
The paper proposes TSN-Affinity, a novel CORL approach using TinySubNetworks and Decision Transformer for task-specific parameterization and RL-aware knowledge sharing.
Findings
Strong retention achieved with sparse SubNetworks.
Routing based on action compatibility improves multi-task performance.
Similarity-guided reuse is a viable alternative to replay strategies.
Abstract
Continual offline reinforcement learning (CORL) aims to learn a sequence of tasks from datasets collected over time while preserving performance on previously learned tasks. This setting corresponds to domains where new tasks arise over time, but adapting the model in live environment interactions is expensive, risky, or impossible. However, CORL inherits the dual difficulty of offline reinforcement learning and adapting while preventing catastrophic forgetting. Replay-based continual learning approaches remain a strong baseline but incur memory overhead and suffer from a distribution mismatch between replayed samples and newly learned policies. At the same time, architectural continual learning methods have shown strong potential in supervised learning but remain underexplored in CORL. In this work, we propose TSN-Affinity, a novel CORL method based on TinySubNetworks and Decision…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
