TL;DR
This paper introduces CRF-RNN, a novel deep neural network that integrates Conditional Random Fields with CNNs, enabling end-to-end training for improved pixel-level image segmentation.
Contribution
It formulates CRF inference as a Recurrent Neural Network, allowing seamless integration with CNNs for end-to-end training in semantic segmentation tasks.
Findings
Achieved top results on Pascal VOC 2012 segmentation benchmark.
Successfully integrated CRF modeling into CNNs for pixel-level labelling.
Enabled end-to-end training of the combined model.
Abstract
Pixel-level labelling tasks, such as semantic segmentation, play a central role in image understanding. Recent approaches have attempted to harness the capabilities of deep learning techniques for image recognition to tackle pixel-level labelling tasks. One central issue in this methodology is the limited capacity of deep learning techniques to delineate visual objects. To solve this problem, we introduce a new form of convolutional neural network that combines the strengths of Convolutional Neural Networks (CNNs) and Conditional Random Fields (CRFs)-based probabilistic graphical modelling. To this end, we formulate mean-field approximate inference for the Conditional Random Fields with Gaussian pairwise potentials as Recurrent Neural Networks. This network, called CRF-RNN, is then plugged in as a part of a CNN to obtain a deep network that has desirable properties of both CNNs and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMax Pooling · Convolution · Softmax · SGD with Momentum · Fully Convolutional Network · CRF-RNN · Conditional Random Field
