Learning Representation for Multitask learning through Self Supervised Auxiliary learning
Seokwon Shin, Hyungrok Do, and Youngdoo Son

TL;DR
This paper introduces Dummy Gradient norm Regularization (DGR), a simple and efficient method to enhance the universality of shared encoder representations in multi-task learning, leading to improved prediction performance across benchmarks.
Contribution
The paper proposes DGR, a novel regularization technique that improves shared encoder representations in multi-task learning, compatible with existing methods and computationally efficient.
Findings
DGR improves multi-task prediction accuracy on benchmark datasets.
Shared representations with DGR outperform existing multi-task learning methods.
DGR is simple to implement and seamlessly integrates with current algorithms.
Abstract
Multi-task learning is a popular machine learning approach that enables simultaneous learning of multiple related tasks, improving algorithmic efficiency and effectiveness. In the hard parameter sharing approach, an encoder shared through multiple tasks generates data representations passed to task-specific predictors. Therefore, it is crucial to have a shared encoder that provides decent representations for every and each task. However, despite recent advances in multi-task learning, the question of how to improve the quality of representations generated by the shared encoder remains open. To address this gap, we propose a novel approach called Dummy Gradient norm Regularization that aims to improve the universality of the representations generated by the shared encoder. Specifically, the method decreases the norm of the gradient of the loss function with repect to dummy task-specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOnline and Blended Learning · Online Learning and Analytics · Intelligent Tutoring Systems and Adaptive Learning
