ClevrTex: A Texture-Rich Benchmark for Unsupervised Multi-Object Segmentation
Laurynas Karazija, Iro Laina, Christian Rupprecht

TL;DR
ClevrTex is a new benchmark with complex textured scenes designed to evaluate and challenge unsupervised multi-object segmentation models, revealing their limitations in realistic, texture-rich environments.
Contribution
The paper introduces ClevrTex, a textured scene dataset for benchmarking unsupervised segmentation, and demonstrates current models' struggles with complex textures.
Findings
State-of-the-art models perform poorly on textured scenes.
Models excel on simple scenes but fail with complex textures.
ClevrTex reveals specific shortcomings of existing algorithms.
Abstract
There has been a recent surge in methods that aim to decompose and segment scenes into multiple objects in an unsupervised manner, i.e., unsupervised multi-object segmentation. Performing such a task is a long-standing goal of computer vision, offering to unlock object-level reasoning without requiring dense annotations to train segmentation models. Despite significant progress, current models are developed and trained on visually simple scenes depicting mono-colored objects on plain backgrounds. The natural world, however, is visually complex with confounding aspects such as diverse textures and complicated lighting effects. In this study, we present a new benchmark called ClevrTex, designed as the next challenge to compare, evaluate and analyze algorithms. ClevrTex features synthetic scenes with diverse shapes, textures and photo-mapped materials, created using physically based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · Robotics and Sensor-Based Localization
