Cross-stitch Networks for Multi-task Learning

Ishan Misra; Abhinav Shrivastava; Abhinav Gupta; Martial; Hebert

arXiv:1604.03539·cs.CV·April 13, 2016

Cross-stitch Networks for Multi-task Learning

Ishan Misra, Abhinav Shrivastava, Abhinav Gupta, Martial, Hebert

PDF

4 Repos 1 Video

TL;DR

This paper introduces cross-stitch units for multi-task learning in ConvNets, enabling the network to learn optimal shared and task-specific features, leading to improved performance across multiple tasks.

Contribution

It proposes a novel cross-stitch unit that allows end-to-end learning of shared representations in multi-task ConvNets, generalizing across tasks and improving accuracy.

Findings

01

Significant performance improvements over baselines.

02

Effective sharing of representations across multiple tasks.

03

Better generalization with fewer training examples.

Abstract

Multi-task learning in Convolutional Networks has displayed remarkable success in the field of recognition. This success can be largely attributed to learning shared representations from multiple supervisory tasks. However, existing multi-task approaches rely on enumerating multiple network architectures specific to the tasks at hand, that do not generalize. In this paper, we propose a principled approach to learn shared representations in ConvNets using multi-task learning. Specifically, we propose a new sharing unit: "cross-stitch" unit. These units combine the activations from multiple networks and can be trained end-to-end. A network with cross-stitch units can learn an optimal combination of shared and task-specific representations. Our proposed method generalizes across multiple tasks and shows dramatically improved performance over baseline methods for categories with few…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Cross-Stitch Networks for Multi-Task Learning· youtube