Interpretable Foreground Object Search As Knowledge Distillation
Boren Li, Po-Yu Zhuang, Jian Gu, Mingyang Li, Ping Tan

TL;DR
This paper introduces a knowledge distillation approach for foreground object search that enhances interpretability and efficiency by learning interchangeable foreground patterns, resulting in improved accuracy and controllability in object retrieval.
Contribution
It presents a novel pattern-level dataset, a knowledge distillation framework for compatibility learning, and achieves significant performance improvements over previous methods.
Findings
Outperforms previous state-of-the-art by 10.42% in mAP
Introduces a pattern-level dataset for foreground search
Enables efficient and interpretable instance-level search
Abstract
This paper proposes a knowledge distillation method for foreground object search (FoS). Given a background and a rectangle specifying the foreground location and scale, FoS retrieves compatible foregrounds in a certain category for later image composition. Foregrounds within the same category can be grouped into a small number of patterns. Instances within each pattern are compatible with any query input interchangeably. These instances are referred to as interchangeable foregrounds. We first present a pipeline to build pattern-level FoS dataset containing labels of interchangeable foregrounds. We then establish a benchmark dataset for further training and testing following the pipeline. As for the proposed method, we first train a foreground encoder to learn representations of interchangeable foregrounds. We then train a query encoder to learn query-foreground compatibility following a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
MethodsKnowledge Distillation
