On the relationship between disentanglement and multi-task learning

{\L}ukasz Maziarka; Aleksandra Nowak; Maciej Wo{\l}czyk; Andrzej; Bedychaj

arXiv:2110.03498·cs.LG·October 8, 2021

On the relationship between disentanglement and multi-task learning

{\L}ukasz Maziarka, Aleksandra Nowak, Maciej Wo{\l}czyk, Andrzej, Bedychaj

PDF

Open Access

TL;DR

This paper investigates how disentangled representations naturally emerge during multi-task neural network training with hard parameter sharing, suggesting a close relationship between the two concepts.

Contribution

It provides an empirical analysis demonstrating that disentanglement occurs naturally in multi-task learning, highlighting the potential for reusing representations across tasks.

Findings

01

Disentanglement appears naturally during multi-task training

02

Neural networks trained on multiple tasks develop disentangled representations

03

Standard metrics confirm the emergence of disentanglement in this setting

Abstract

One of the main arguments behind studying disentangled representations is the assumption that they can be easily reused in different tasks. At the same time finding a joint, adaptable representation of data is one of the key challenges in the multi-task learning setting. In this paper, we take a closer look at the relationship between disentanglement and multi-task learning based on hard parameter sharing. We perform a thorough empirical study of the representations obtained by neural networks trained on automatically generated supervised tasks. Using a set of standard metrics we show that disentanglement appears naturally during the process of multi-task neural network training.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Anomaly Detection Techniques and Applications