PixelNet: Representation of the pixels, by the pixels, and for the   pixels

Aayush Bansal; Xinlei Chen; Bryan Russell; Abhinav Gupta; Deva Ramanan

arXiv:1702.06506·cs.CV·February 27, 2017·94 cites

PixelNet: Representation of the pixels, by the pixels, and for the pixels

Aayush Bansal, Xinlei Chen, Bryan Russell, Abhinav Gupta, Deva Ramanan

PDF

Open Access 1 Repo

TL;DR

PixelNet introduces a pixel sampling strategy that enhances learning efficiency and accuracy across various pixel-level prediction tasks, achieving state-of-the-art results by training from scratch.

Contribution

The paper proposes stratified pixel sampling to improve training efficiency and model performance for diverse pixel prediction tasks, enabling training from scratch.

Findings

01

Achieved state-of-the-art results on PASCAL-Context, NYUDv2, and BSDS datasets.

02

Stratified sampling speeds up learning and improves accuracy.

03

Effective training of complex models from scratch.

Abstract

We explore design principles for general pixel-level prediction problems, from low-level edge detection to mid-level surface normal estimation to high-level semantic segmentation. Convolutional predictors, such as the fully-convolutional network (FCN), have achieved remarkable success by exploiting the spatial redundancy of neighboring pixels through convolutional processing. Though computationally efficient, we point out that such approaches are not statistically efficient during learning precisely because spatial redundancy limits the information learned from neighboring pixels. We demonstrate that stratified sampling of pixels allows one to (1) add diversity during batch updates, speeding up learning; (2) explore complex nonlinear predictors, improving accuracy; and (3) efficiently train state-of-the-art models tabula rasa (i.e., "from scratch") for diverse pixel-labeling tasks. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bdecost/pixelnet
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Medical Image Segmentation Techniques · Domain Adaptation and Few-Shot Learning