Toward Real-World High-Precision Image Matting and Segmentation

Haipeng Zhou; Zhaohu Xing; Hongqiu Wang; Jun Ma; Ping Li; Lei Zhu

arXiv:2601.12080·cs.CV·January 21, 2026

Toward Real-World High-Precision Image Matting and Segmentation

Haipeng Zhou, Zhaohu Xing, Hongqiu Wang, Jun Ma, Ping Li, Lei Zhu

PDF

Open Access 1 Video

TL;DR

This paper introduces FCLM, a novel model for high-precision image matting and segmentation that leverages depth-aware distillation, domain-invariant learning, and an object-oriented decoder to improve real-world performance and generalization.

Contribution

The paper proposes FCLM, integrating depth-aware distillation, domain-invariant learning, and an object-oriented decoder for enhanced scene parsing and interactive segmentation.

Findings

01

Outperforms state-of-the-art methods quantitatively.

02

Achieves superior qualitative segmentation results.

03

Effectively generalizes to real-world scenarios.

Abstract

High-precision scene parsing tasks, including image matting and dichotomous segmentation, aim to accurately predict masks with extremely fine details (such as hair). Most existing methods focus on salient, single foreground objects. While interactive methods allow for target adjustment, their class-agnostic design restricts generalization across different categories. Furthermore, the scarcity of high-quality annotation has led to a reliance on inharmonious synthetic data, resulting in poor generalization to real-world scenarios. To this end, we propose a Foreground Consistent Learning model, dubbed as FCLM, to address the aforementioned issues. Specifically, we first introduce a Depth-Aware Distillation strategy where we transfer the depth-related knowledge for better foreground representation. Considering the data dilemma, we term the processing of synthetic data as domain adaptation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Toward Real-World High-Precision Image Matting and Segmentation· underline

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Image Enhancement Techniques