Synergies between Disentanglement and Sparsity: Generalization and   Identifiability in Multi-Task Learning

S\'ebastien Lachapelle; Tristan Deleu; Divyat Mahajan; Ioannis; Mitliagkas; Yoshua Bengio; Simon Lacoste-Julien; Quentin Bertrand

arXiv:2211.14666·cs.LG·June 8, 2023·1 cites

Synergies between Disentanglement and Sparsity: Generalization and Identifiability in Multi-Task Learning

S\'ebastien Lachapelle, Tristan Deleu, Divyat Mahajan, Ioannis, Mitliagkas, Yoshua Bengio, Simon Lacoste-Julien, Quentin Bertrand

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper demonstrates that combining disentangled representations with sparse predictors enhances generalization in multi-task learning, supported by theoretical identifiability results and practical algorithms with competitive few-shot classification performance.

Contribution

It provides a new theoretical identifiability result linking sparsity and disentanglement, and proposes a practical sparsity-promoting optimization method for learning disentangled representations.

Findings

01

Sparse predictors improve generalization in multi-task learning.

02

Theoretical conditions for disentanglement with sparse predictors are established.

03

The proposed method achieves competitive few-shot classification results.

Abstract

Although disentangled representations are often said to be beneficial for downstream tasks, current empirical and theoretical understanding is limited. In this work, we provide evidence that disentangled representations coupled with sparse base-predictors improve generalization. In the context of multi-task learning, we prove a new identifiability result that provides conditions under which maximally sparse base-predictors yield disentangled representations. Motivated by this theoretical result, we propose a practical approach to learn disentangled representations based on a sparsity-promoting bi-level optimization problem. Finally, we explore a meta-learning version of this algorithm based on group Lasso multiclass SVM base-predictors, for which we derive a tractable dual formulation. It obtains competitive results on standard few-shot classification benchmarks, while each task is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tristandeleu/synergies-disentanglement-sparsity
jaxOfficial

Videos

Synergies between Disentanglement and Sparsity: Generalization and Identifiability in Multi-Task Learning· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Machine Learning and ELM

MethodsSupport Vector Machine