Stochastic Filter Groups for Multi-Task CNNs: Learning Specialist and Generalist Convolution Kernels
Felix J.S. Bragman, Ryutaro Tanno, Sebastien Ourselin, Daniel C., Alexander, M. Jorge Cardoso

TL;DR
This paper introduces stochastic filter groups (SFG), a probabilistic method for learning task-specific and shared convolutional kernels in multi-task CNNs, improving performance without manual architecture design.
Contribution
The paper proposes a novel SFG mechanism with variational inference to automatically learn sharing patterns in multi-task CNNs, reducing manual tuning and enhancing flexibility.
Findings
SFG improves multi-task CNN performance over baselines.
The method generalizes well across different tasks.
It effectively learns task-specific and shared kernels.
Abstract
The performance of multi-task learning in Convolutional Neural Networks (CNNs) hinges on the design of feature sharing between tasks within the architecture. The number of possible sharing patterns are combinatorial in the depth of the network and the number of tasks, and thus hand-crafting an architecture, purely based on the human intuitions of task relationships can be time-consuming and suboptimal. In this paper, we present a probabilistic approach to learning task-specific and shared representations in CNNs for multi-task learning. Specifically, we propose "stochastic filter groups'' (SFG), a mechanism to assign convolution kernels in each layer to "specialist'' or "generalist'' groups, which are specific to or shared across different tasks, respectively. The SFG modules determine the connectivity between layers and the structures of task-specific and shared representations in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConvolution
