Hybrid Task Cascade for Instance Segmentation
Kai Chen, Jiangmiao Pang, Jiaqi Wang, Yu Xiong, Xiaoxiao Li, Shuyang, Sun, Wansen Feng, Ziwei Liu, Jianping Shi, Wanli Ouyang, Chen Change Loy,, Dahua Lin

TL;DR
The paper introduces Hybrid Task Cascade (HTC), a novel framework that interweaves detection and segmentation tasks in a multi-stage process, significantly improving instance segmentation performance on COCO dataset.
Contribution
It proposes a joint multi-stage framework that fully leverages the reciprocal relationship between detection and segmentation tasks, with a convolutional branch for spatial context.
Findings
Achieves 38.4 AP over Cascade Mask R-CNN baseline.
Attains 48.6 mask AP on COCO test-challenge split.
Ranks 1st in COCO 2018 Challenge Object Detection Task.
Abstract
Cascade is a classic yet powerful architecture that has boosted performance on various tasks. However, how to introduce cascade to instance segmentation remains an open question. A simple combination of Cascade R-CNN and Mask R-CNN only brings limited gain. In exploring a more effective approach, we find that the key to a successful instance segmentation cascade is to fully leverage the reciprocal relationship between detection and segmentation. In this work, we propose a new framework, Hybrid Task Cascade (HTC), which differs in two important aspects: (1) instead of performing cascaded refinement on these two tasks separately, it interweaves them for a joint multi-stage processing; (2) it adopts a fully convolutional branch to provide spatial context, which can help distinguishing hard foreground from cluttered background. Overall, this framework can learn more discriminative features…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsRegion Proposal Network · Average Pooling · ResNeXt Block · Sigmoid Activation · Softmax · Dense Connections · Squeeze-and-Excitation Block · SENet · Feature Pyramid Network · Bottom-up Path Augmentation
