Soft Conflict-Resolution Decision Transformer for Offline Multi-Task Reinforcement Learning

Shudong Wang; Xinfei Wang; Chenhao Zhang; Shanchen Pang; Haiyuan Gui; Wenhao Ji; Xiaojian Liao

arXiv:2511.13133·cs.LG·November 18, 2025

Soft Conflict-Resolution Decision Transformer for Offline Multi-Task Reinforcement Learning

Shudong Wang, Xinfei Wang, Chenhao Zhang, Shanchen Pang, Haiyuan Gui, Wenhao Ji, Xiaojian Liao

PDF

Open Access

TL;DR

This paper introduces SoCo-DT, a novel soft conflict-resolution method for offline multi-task reinforcement learning that dynamically adjusts parameter importance and sparsity to mitigate gradient conflicts and enhance learning efficiency.

Contribution

The paper proposes a dynamic, importance-based masking and sparsity adjustment strategy for multi-task RL, improving over fixed masks and enhancing knowledge sharing across tasks.

Findings

01

SoCo-DT outperforms state-of-the-art methods by 7.6% on MT50.

02

It achieves a 10.5% improvement on the suboptimal dataset.

03

Experimental results validate the effectiveness of dynamic conflict mitigation.

Abstract

Multi-task reinforcement learning (MTRL) seeks to learn a unified policy for diverse tasks, but often suffers from gradient conflicts across tasks. Existing masking-based methods attempt to mitigate such conflicts by assigning task-specific parameter masks. However, our empirical study shows that coarse-grained binary masks have the problem of over-suppressing key conflicting parameters, hindering knowledge sharing across tasks. Moreover, different tasks exhibit varying conflict levels, yet existing methods use a one-size-fits-all fixed sparsity strategy to keep training stability and performance, which proves inadequate. These limitations hinder the model's generalization and learning efficiency. To address these issues, we propose SoCo-DT, a Soft Conflict-resolution method based by parameter importance. By leveraging Fisher information, mask values are dynamically adjusted to retain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications