End-to-End Segmentation via Patch-wise Polygons Prediction

Tal Shaharabany; Lior Wolf

arXiv:2112.02535·cs.CV·December 7, 2021

End-to-End Segmentation via Patch-wise Polygons Prediction

Tal Shaharabany, Lior Wolf

PDF

Open Access

TL;DR

This paper introduces a novel segmentation approach that models object edges as patch-wise polygons, optimized via a differentiable renderer, achieving state-of-the-art results across multiple benchmarks.

Contribution

The paper proposes an alternative to pixel-based segmentation by representing object boundaries as polygons per image patch, optimized through a differentiable rendering process.

Findings

01

Achieved 76.26% mIoU on Cityscapes validation

02

Attained 90.92% IoU on Vaihingen building benchmark

03

Secured 66.82% IoU on MoNU microscopy dataset

Abstract

The leading segmentation methods represent the output map as a pixel grid. We study an alternative representation in which the object edges are modeled, per image patch, as a polygon with $k$ vertices that is coupled with per-patch label probabilities. The vertices are optimized by employing a differentiable neural renderer to create a raster image. The delineated region is then compared with the ground truth segmentation. Our method obtains multiple state-of-the-art results: 76.26\% mIoU on the Cityscapes validation, 90.92\% IoU on the Vaihingen building segmentation benchmark, 66.82\% IoU for the MoNU microscopy dataset, and 90.91\% for the bird benchmark CUB. Our code for training and reproducing these results is attached as supplementary.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Remote Sensing and LiDAR Applications · Advanced Image and Video Retrieval Techniques