Stochastic Filter Groups for Multi-Task CNNs: Learning Specialist and   Generalist Convolution Kernels

Felix J.S. Bragman; Ryutaro Tanno; Sebastien Ourselin; Daniel C.; Alexander; M. Jorge Cardoso

arXiv:1908.09597·cs.CV·August 27, 2019

Stochastic Filter Groups for Multi-Task CNNs: Learning Specialist and Generalist Convolution Kernels

Felix J.S. Bragman, Ryutaro Tanno, Sebastien Ourselin, Daniel C., Alexander, M. Jorge Cardoso

PDF

TL;DR

This paper introduces stochastic filter groups (SFG), a probabilistic method for learning task-specific and shared convolutional kernels in multi-task CNNs, improving performance without manual architecture design.

Contribution

The paper proposes a novel SFG mechanism with variational inference to automatically learn sharing patterns in multi-task CNNs, reducing manual tuning and enhancing flexibility.

Findings

01

SFG improves multi-task CNN performance over baselines.

02

The method generalizes well across different tasks.

03

It effectively learns task-specific and shared kernels.

Abstract

The performance of multi-task learning in Convolutional Neural Networks (CNNs) hinges on the design of feature sharing between tasks within the architecture. The number of possible sharing patterns are combinatorial in the depth of the network and the number of tasks, and thus hand-crafting an architecture, purely based on the human intuitions of task relationships can be time-consuming and suboptimal. In this paper, we present a probabilistic approach to learning task-specific and shared representations in CNNs for multi-task learning. Specifically, we propose "stochastic filter groups'' (SFG), a mechanism to assign convolution kernels in each layer to "specialist'' or "generalist'' groups, which are specific to or shared across different tasks, respectively. The SFG modules determine the connectivity between layers and the structures of task-specific and shared representations in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution