PixelNet: Representation of the pixels, by the pixels, and for the pixels
Aayush Bansal, Xinlei Chen, Bryan Russell, Abhinav Gupta, Deva Ramanan

TL;DR
PixelNet introduces a pixel sampling strategy that enhances learning efficiency and accuracy across various pixel-level prediction tasks, achieving state-of-the-art results by training from scratch.
Contribution
The paper proposes stratified pixel sampling to improve training efficiency and model performance for diverse pixel prediction tasks, enabling training from scratch.
Findings
Achieved state-of-the-art results on PASCAL-Context, NYUDv2, and BSDS datasets.
Stratified sampling speeds up learning and improves accuracy.
Effective training of complex models from scratch.
Abstract
We explore design principles for general pixel-level prediction problems, from low-level edge detection to mid-level surface normal estimation to high-level semantic segmentation. Convolutional predictors, such as the fully-convolutional network (FCN), have achieved remarkable success by exploiting the spatial redundancy of neighboring pixels through convolutional processing. Though computationally efficient, we point out that such approaches are not statistically efficient during learning precisely because spatial redundancy limits the information learned from neighboring pixels. We demonstrate that stratified sampling of pixels allows one to (1) add diversity during batch updates, speeding up learning; (2) explore complex nonlinear predictors, improving accuracy; and (3) efficiently train state-of-the-art models tabula rasa (i.e., "from scratch") for diverse pixel-labeling tasks. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Image Segmentation Techniques · Domain Adaptation and Few-Shot Learning
