Semantic Instance Segmentation with a Discriminative Loss Function
Bert De Brabandere, Davy Neven, Luc Van Gool

TL;DR
This paper introduces a simple yet effective pixel-level discriminative loss function for semantic instance segmentation, enabling clustering-based post-processing without complex mechanisms, achieving competitive results on standard benchmarks.
Contribution
Proposes a novel discriminative loss function for pixel embeddings that simplifies instance segmentation without relying on object proposals or recurrent models.
Findings
Achieves competitive performance on Cityscapes and CVPPP benchmarks.
Demonstrates effectiveness of simple clustering-based approach.
Avoids limitations of detect-and-segment methods.
Abstract
Semantic instance segmentation remains a challenging task. In this work we propose to tackle the problem with a discriminative loss function, operating at the pixel level, that encourages a convolutional network to produce a representation of the image that can easily be clustered into instances with a simple post-processing step. The loss function encourages the network to map each pixel to a point in feature space so that pixels belonging to the same instance lie close together while different instances are separated by a wide margin. Our approach of combining an off-the-shelf network with a principled loss function inspired by a metric learning objective is conceptually simple and distinct from recent efforts in instance segmentation. In contrast to previous works, our method does not rely on object proposals or recurrent mechanisms. A key contribution of our work is to demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Remote-Sensing Image Classification · Advanced Image and Video Retrieval Techniques
