BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks   for Semantic Segmentation

Jifeng Dai; Kaiming He; Jian Sun

arXiv:1503.01640·cs.CV·May 19, 2015·147 cites

BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation

Jifeng Dai, Kaiming He, Jian Sun

PDF

Open Access

TL;DR

BoxSup is a novel method that uses bounding box annotations instead of pixel-level masks to train convolutional networks for semantic segmentation, achieving competitive and state-of-the-art results.

Contribution

We introduce BoxSup, a technique that iteratively generates segmentation masks from bounding boxes to train deep networks without requiring pixel-level annotations.

Findings

01

Achieves results comparable to fully supervised methods using only bounding boxes.

02

Outperforms previous weakly supervised approaches on PASCAL datasets.

03

Enables leveraging large amounts of bounding box data for improved segmentation.

Abstract

Recent leading approaches to semantic segmentation rely on deep convolutional networks trained with human-annotated, pixel-level segmentation masks. Such pixel-accurate supervision demands expensive labeling effort and limits the performance of deep networks that usually benefit from more training data. In this paper, we propose a method that achieves competitive accuracy but only requires easily obtained bounding box annotations. The basic idea is to iterate between automatically generating region proposals and training convolutional networks. These two steps gradually recover segmentation masks for improving the networks, and vise versa. Our method, called BoxSup, produces competitive results supervised by boxes only, on par with strong baselines fully supervised by masks under the same setting. By leveraging a large amount of bounding boxes, BoxSup further unleashes the power of deep…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning