ParseNet: Looking Wider to See Better
Wei Liu, Andrew Rabinovich, Alexander C. Berg

TL;DR
ParseNet introduces a simple global context augmentation to deep networks for semantic segmentation, significantly improving accuracy with minimal additional computation, and achieves state-of-the-art results on multiple benchmarks.
Contribution
The paper proposes ParseNet, a method that incorporates global average features into convolutional networks, enhancing segmentation performance beyond existing baselines.
Findings
Achieves state-of-the-art on SiftFlow and PASCAL-Context.
Improves baseline performance significantly with global features.
Maintains low additional computational cost.
Abstract
We present a technique for adding global context to deep convolutional networks for semantic segmentation. The approach is simple, using the average feature for a layer to augment the features at each location. In addition, we study several idiosyncrasies of training, significantly increasing the performance of baseline networks (e.g. from FCN). When we add our proposed global feature, and a technique for learning normalization parameters, accuracy increases consistently even over our improved versions of the baselines. Our proposed approach, ParseNet, achieves state-of-the-art performance on SiftFlow and PASCAL-Context with small additional computational cost over baselines, and near current state-of-the-art performance on PASCAL VOC 2012 semantic segmentation with a simple approach. Code is available at https://github.com/weiliu89/caffe/tree/fcn .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
