SGM-Net: Semantic Guided Matting Net
Qing Song, Wenfeng Sun, Donghan Yang, Mengjie Hu, Chun Liu

TL;DR
SGM-Net introduces a semantic guidance module to improve human image matting using only a single image, achieving significant performance gains over existing methods.
Contribution
The paper proposes a novel semantic guidance module added to MODNet, enabling high-quality human matting from a single image without extra inputs or complex models.
Findings
Significant improvement in evaluation metrics on P3M-10k dataset.
Effective human matting with only one image input.
Outperforms benchmark methods like MODNet.
Abstract
Human matting refers to extracting human parts from natural images with high quality, including human detail information such as hair, glasses, hat, etc. This technology plays an essential role in image synthesis and visual effects in the film industry. When the green screen is not available, the existing human matting methods need the help of additional inputs (such as trimap, background image, etc.), or the model with high computational cost and complex network structure, which brings great difficulties to the application of human matting in practice. To alleviate such problems, most existing methods (such as MODNet) use multi-branches to pave the way for matting through segmentation, but these methods do not make full use of the image features and only utilize the prediction results of the network as guidance information. Therefore, we propose a module to generate foreground…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Enhancement Techniques · Color Science and Applications · Visual Attention and Saliency Detection
MethodsMODNet
