Interpretable Multi-task Learning with Shared Variable Embeddings
Maciej \.Zelaszczyk, Jacek Ma\'ndziuk

TL;DR
This paper introduces a multi-task learning system using shared variable embeddings that enhances interpretability without sacrificing accuracy, and demonstrates that sparsity in attention mechanisms improves both performance and training efficiency.
Contribution
It presents a novel multi-task learning framework with shared variable embeddings, incorporating attention sparsity for improved accuracy and interpretability.
Findings
Shared embeddings do not reduce performance compared to vanilla methods.
Sparsity in attention increases accuracy and reduces training steps.
Shared embeddings enable interpretability through concept mapping.
Abstract
This paper proposes a general interpretable predictive system with shared information. The system is able to perform predictions in a multi-task setting where distinct tasks are not bound to have the same input/output structure. Embeddings of input and output variables in a common space are obtained, where the input embeddings are produced through attending to a set of shared embeddings, reused across tasks. All the embeddings are treated as model parameters and learned. Specific restrictions on the space of shared embedings and the sparsity of the attention mechanism are considered. Experiments show that the introduction of shared embeddings does not deteriorate the results obtained from a vanilla variable embeddings method. We run a number of further ablations. Inducing sparsity in the attention mechanism leads to both an increase in accuracy and a significant decrease in the number…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare
MethodsSparse Evolutionary Training
