SIN:Superpixel Interpolation Network

Qing Yuan; Songfeng Lu; Yan Huang; Wuxin Sha

arXiv:2110.08702·cs.CV·October 19, 2021

SIN:Superpixel Interpolation Network

Qing Yuan, Songfeng Lu, Yan Huang, Wuxin Sha

PDF

Open Access 1 Repo

TL;DR

SIN introduces a deep learning-based superpixel segmentation method that enforces spatial connectivity from the start, enabling end-to-end integration with downstream tasks and achieving real-time performance.

Contribution

The paper presents SIN, a novel superpixel segmentation algorithm that is fully differentiable, enforces spatial connectivity, and operates in real-time, unlike traditional methods.

Findings

01

Runs at about 80fps, enabling real-time applications.

02

Performs favorably against state-of-the-art superpixel methods.

03

Effective loss function reduces training time.

Abstract

Superpixels have been widely used in computer vision tasks due to their representational and computational efficiency. Meanwhile, deep learning and end-to-end framework have made great progress in various fields including computer vision. However, existing superpixel algorithms cannot be integrated into subsequent tasks in an end-to-end way. Traditional algorithms and deep learning-based algorithms are two main streams in superpixel segmentation. The former is non-differentiable and the latter needs a non-differentiable post-processing step to enforce connectivity, which constraints the integration of superpixels and downstream tasks. In this paper, we propose a deep learning-based superpixel segmentation algorithm SIN which can be integrated with downstream tasks in an end-to-end way. Owing to some downstream tasks such as visual tracking require real-time speed, the speed of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yuanqqq/sin
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Medical Image Segmentation Techniques · Advanced Image and Video Retrieval Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings