Fast Soft Color Segmentation
Naofumi Akimoto, Huachun Zhu, Yanghua Jin, Yoshimitsu Aoki

TL;DR
This paper introduces a neural network-based method for fast soft color image segmentation into RGBA layers, enabling real-time applications like video editing with high quality and significantly improved speed.
Contribution
The authors propose a single-pass neural network approach for soft color segmentation that outperforms previous iterative methods in speed while maintaining comparable quality.
Findings
Achieves 300,000x faster inference than previous methods.
Produces qualitative and quantitative results comparable to state-of-the-art.
Demonstrates effective application in video editing scenarios.
Abstract
We address the problem of soft color segmentation, defined as decomposing a given image into several RGBA layers, each containing only homogeneous color regions. The resulting layers from decomposition pave the way for applications that benefit from layer-based editing, such as recoloring and compositing of images and videos. The current state-of-the-art approach for this problem is hindered by slow processing time due to its iterative nature, and consequently does not scale to certain real-world scenarios. To address this issue, we propose a neural network based method for this task that decomposes a given image into multiple layers in a single forward pass. Furthermore, our method separately decomposes the color layers and the alpha channel layers. By leveraging a novel training objective, our method achieves proper assignment of colors amongst layers. As a consequence, our method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Fast Soft Color Segmentation· youtube
Taxonomy
TopicsImage Enhancement Techniques · Advanced Vision and Imaging · Advanced Image Processing Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
