Data-Driven Scene Understanding with Adaptively Retrieved Exemplars
Xionghao Liu, Wei Yang, Liang Lin, Qing Wang, Zhaoquan Cai, Jianhuang, Lai

TL;DR
This paper introduces a novel data-driven method for scene understanding that retrieves exemplars from an image database and propagates semantics to segment images without pixelwise annotations, using a probabilistic EM framework.
Contribution
It proposes a mutually conditional, EM-based framework for semantic segmentation that jointly retrieves references and propagates labels without requiring pixelwise training.
Findings
Outperforms state-of-the-art in semantic segmentation
Effective for image annotation tasks
Validated on two public datasets
Abstract
This article investigates a data-driven approach for semantically scene understanding, without pixelwise annotation and classifier training. Our framework parses a target image with two steps: (i) retrieving its exemplars (i.e. references) from an image database, where all images are unsegmented but annotated with tags; (ii) recovering its pixel labels by propagating semantics from the references. We present a novel framework making the two steps mutually conditional and bootstrapped under the probabilistic Expectation-Maximization (EM) formulation. In the first step, the references are selected by jointly matching their appearances with the target as well as the semantics (i.e. the assigned labels of the target and the references). We process the second step via a combinatorial graphical representation, in which the vertices are superpixels extracted from the target and its selected…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
