Learning Representation for Multitask learning through Self Supervised   Auxiliary learning

Seokwon Shin; Hyungrok Do; and Youngdoo Son

arXiv:2409.16651·stat.ML·September 26, 2024

Learning Representation for Multitask learning through Self Supervised Auxiliary learning

Seokwon Shin, Hyungrok Do, and Youngdoo Son

PDF

Open Access

TL;DR

This paper introduces Dummy Gradient norm Regularization (DGR), a simple and efficient method to enhance the universality of shared encoder representations in multi-task learning, leading to improved prediction performance across benchmarks.

Contribution

The paper proposes DGR, a novel regularization technique that improves shared encoder representations in multi-task learning, compatible with existing methods and computationally efficient.

Findings

01

DGR improves multi-task prediction accuracy on benchmark datasets.

02

Shared representations with DGR outperform existing multi-task learning methods.

03

DGR is simple to implement and seamlessly integrates with current algorithms.

Abstract

Multi-task learning is a popular machine learning approach that enables simultaneous learning of multiple related tasks, improving algorithmic efficiency and effectiveness. In the hard parameter sharing approach, an encoder shared through multiple tasks generates data representations passed to task-specific predictors. Therefore, it is crucial to have a shared encoder that provides decent representations for every and each task. However, despite recent advances in multi-task learning, the question of how to improve the quality of representations generated by the shared encoder remains open. To address this gap, we propose a novel approach called Dummy Gradient norm Regularization that aims to improve the universality of the representations generated by the shared encoder. Specifically, the method decreases the norm of the gradient of the loss function with repect to dummy task-specific…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOnline and Blended Learning · Online Learning and Analytics · Intelligent Tutoring Systems and Adaptive Learning