Object-Level Targeted Selection via Deep Template Matching
Suraj Kothawade, Donna Roy, Michele Fenzi, Elmar Haussmann, Jose M., Alvarez, Christoph Angerer

TL;DR
This paper introduces a fast, robust object-level template matching algorithm in the DNN feature space for semantic image retrieval, especially effective for small, occluded objects in cluttered scenes, without needing extra labeled data.
Contribution
The authors propose a novel template matching method in DNN feature space that improves object-level retrieval for small, occluded objects without additional labeled data.
Findings
High recall in mining images with small objects
Effective retrieval in cluttered scenes with multiple objects
Outperforms existing semantic retrieval methods
Abstract
Retrieving images with objects that are semantically similar to objects of interest (OOI) in a query image has many practical use cases. A few examples include fixing failures like false negatives/positives of a learned model or mitigating class imbalance in a dataset. The targeted selection task requires finding the relevant data from a large-scale pool of unlabeled data. Manual mining at this scale is infeasible. Further, the OOI are often small and occupy less than 1% of image area, are occluded, and co-exist with many semantically different objects in cluttered scenes. Existing semantic image retrieval methods often focus on mining for larger sized geographical landmarks, and/or require extra labeled data, such as images/image-pairs with similar objects, for mining images with generic objects. We propose a fast and robust template matching algorithm in the DNN feature space, that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Multimodal Machine Learning Applications
