TL;DR
This paper introduces higher order potentials into deep neural network-based CRFs for semantic segmentation, enabling end-to-end training and achieving state-of-the-art results on PASCAL VOC.
Contribution
It presents a novel way to incorporate higher order potentials based on object detections and superpixels into CRFs within deep networks, allowing end-to-end learning.
Findings
Achieved state-of-the-art segmentation on PASCAL VOC
Demonstrated effective integration of higher order potentials
Enabled end-to-end training of the CRF with CNNs
Abstract
We address the problem of semantic segmentation using deep learning. Most segmentation systems include a Conditional Random Field (CRF) to produce a structured output that is consistent with the image's visual features. Recent deep learning approaches have incorporated CRFs into Convolutional Neural Networks (CNNs), with some even training the CRF end-to-end with the rest of the network. However, these approaches have not employed higher order potentials, which have previously been shown to significantly improve segmentation performance. In this paper, we demonstrate that two types of higher order potential, based on object detections and superpixels, can be included in a CRF embedded within a deep network. We design these higher order potentials to allow inference with the differentiable mean field algorithm. As a result, all the parameters of our richer CRF model can be learned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConditional Random Field
