End-to-End Segmentation via Patch-wise Polygons Prediction
Tal Shaharabany, Lior Wolf

TL;DR
This paper introduces a novel segmentation approach that models object edges as patch-wise polygons, optimized via a differentiable renderer, achieving state-of-the-art results across multiple benchmarks.
Contribution
The paper proposes an alternative to pixel-based segmentation by representing object boundaries as polygons per image patch, optimized through a differentiable rendering process.
Findings
Achieved 76.26% mIoU on Cityscapes validation
Attained 90.92% IoU on Vaihingen building benchmark
Secured 66.82% IoU on MoNU microscopy dataset
Abstract
The leading segmentation methods represent the output map as a pixel grid. We study an alternative representation in which the object edges are modeled, per image patch, as a polygon with vertices that is coupled with per-patch label probabilities. The vertices are optimized by employing a differentiable neural renderer to create a raster image. The delineated region is then compared with the ground truth segmentation. Our method obtains multiple state-of-the-art results: 76.26\% mIoU on the Cityscapes validation, 90.92\% IoU on the Vaihingen building segmentation benchmark, 66.82\% IoU for the MoNU microscopy dataset, and 90.91\% for the bird benchmark CUB. Our code for training and reproducing these results is attached as supplementary.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Remote Sensing and LiDAR Applications · Advanced Image and Video Retrieval Techniques
