InterroGate: Learning to Share, Specialize, and Prune Representations for Multi-task Learning
Babak Ehteshami Bejnordi, Gaurav Kumar, Amelie Royer, Christos, Louizos, Tijmen Blankevoort, Mohsen Ghafoorian

TL;DR
InterroGate introduces a learnable gating mechanism for multi-task learning that dynamically balances shared and task-specific features, leading to improved performance and efficiency across multiple benchmarks.
Contribution
The paper presents InterroGate, a novel architecture with automatic, learnable parameter sharing and specialization for multi-task learning, reducing manual design effort and optimizing inference.
Findings
Achieves state-of-the-art results on CelebA, NYUD-v2, and PASCAL-Context.
Effectively mitigates task interference with dynamic sharing patterns.
Improves computational efficiency while maintaining high accuracy.
Abstract
Jointly learning multiple tasks with a unified model can improve accuracy and data efficiency, but it faces the challenge of task interference, where optimizing one task objective may inadvertently compromise the performance of another. A solution to mitigate this issue is to allocate task-specific parameters, free from interference, on top of shared features. However, manually designing such architectures is cumbersome, as practitioners need to balance between the overall performance across all tasks and the higher computational cost induced by the newly added parameters. In this work, we propose \textit{InterroGate}, a novel multi-task learning (MTL) architecture designed to mitigate task interference while optimizing inference computational efficiency. We employ a learnable gating mechanism to automatically balance the shared and task-specific representations while preserving the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsContext-Aware Activity Recognition Systems · Seismology and Earthquake Studies · Machine Learning and Algorithms
