Deep Automatic Natural Image Matting
Jizhizi Li, Jing Zhang, Dacheng Tao

TL;DR
This paper introduces a novel end-to-end deep learning model for automatic natural image matting that predicts a generalized trimap and effectively handles diverse foreground types, outperforming existing methods.
Contribution
We propose a new end-to-end neural network that generates a generalized trimap and guides the matting process, addressing limitations of prior methods on complex natural images.
Findings
Our model outperforms existing methods in objective and subjective evaluations.
The constructed AIM-500 dataset enables benchmarking of generalization ability.
The approach effectively handles diverse foreground types including transparent and non-salient objects.
Abstract
Automatic image matting (AIM) refers to estimating the soft foreground from an arbitrary natural image without any auxiliary input like trimap, which is useful for image editing. Prior methods try to learn semantic features to aid the matting process while being limited to images with salient opaque foregrounds such as humans and animals. In this paper, we investigate the difficulties when extending them to natural images with salient transparent/meticulous foregrounds or non-salient foregrounds. To address the problem, a novel end-to-end matting network is proposed, which can predict a generalized trimap for any image of the above types as a unified semantic representation. Simultaneously, the learned semantic features guide the matting network to focus on the transition areas via an attention mechanism. We also construct a test set AIM-500 that contains 500 diverse natural images…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Enhancement Techniques · Visual Attention and Saliency Detection · Generative Adversarial Networks and Image Synthesis
