Fast Soft Color Segmentation

Naofumi Akimoto; Huachun Zhu; Yanghua Jin; Yoshimitsu Aoki

arXiv:2004.08096·cs.CV·April 20, 2020·1 cites

Fast Soft Color Segmentation

Naofumi Akimoto, Huachun Zhu, Yanghua Jin, Yoshimitsu Aoki

PDF

Open Access 1 Video

TL;DR

This paper introduces a neural network-based method for fast soft color image segmentation into RGBA layers, enabling real-time applications like video editing with high quality and significantly improved speed.

Contribution

The authors propose a single-pass neural network approach for soft color segmentation that outperforms previous iterative methods in speed while maintaining comparable quality.

Findings

01

Achieves 300,000x faster inference than previous methods.

02

Produces qualitative and quantitative results comparable to state-of-the-art.

03

Demonstrates effective application in video editing scenarios.

Abstract

We address the problem of soft color segmentation, defined as decomposing a given image into several RGBA layers, each containing only homogeneous color regions. The resulting layers from decomposition pave the way for applications that benefit from layer-based editing, such as recoloring and compositing of images and videos. The current state-of-the-art approach for this problem is hindered by slow processing time due to its iterative nature, and consequently does not scale to certain real-world scenarios. To address this issue, we propose a neural network based method for this task that decomposes a given image into multiple layers in a single forward pass. Furthermore, our method separately decomposes the color layers and the alpha channel layers. By leveraging a novel training objective, our method achieves proper assignment of colors amongst layers. As a consequence, our method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Fast Soft Color Segmentation· youtube

Taxonomy

TopicsImage Enhancement Techniques · Advanced Vision and Imaging · Advanced Image Processing Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings