TL;DR
This paper introduces a new dichotomous image segmentation task, creates a large high-resolution dataset, proposes a baseline model, and evaluates multiple models with a new human correction effort metric, advancing segmentation research.
Contribution
The paper presents the first large-scale DIS dataset, a simple intermediate supervision baseline, and a new evaluation metric, facilitating future research and applications in precise object segmentation.
Findings
IS-Net outperforms existing baselines on DIS5K.
HCE metric effectively measures model correction effort.
Benchmark results reveal insights into object complexity and model performance.
Abstract
We present a systematic study on a new task called dichotomous image segmentation (DIS) , which aims to segment highly accurate objects from natural images. To this end, we collected the first large-scale DIS dataset, called DIS5K, which contains 5,470 high-resolution (e.g., 2K, 4K or larger) images covering camouflaged, salient, or meticulous objects in various backgrounds. DIS is annotated with extremely fine-grained labels. Besides, we introduce a simple intermediate supervision baseline (IS-Net) using both feature-level and mask-level guidance for DIS model training. IS-Net outperforms various cutting-edge baselines on the proposed DIS5K, making it a general self-learned supervision network that can facilitate future research in DIS. Further, we design a new metric called human correction efforts (HCE) which approximates the number of mouse clicking operations required to correct…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
