Deep Reinforced Attention Regression for Partial Sketch Based Image Retrieval
Dingrong Wang, Hitesh Sapkota, Xumin Liu, Qi Yu

TL;DR
This paper introduces a deep reinforcement learning framework for partial sketch-based image retrieval that enhances accuracy and robustness to noise by focusing attention on important sketch regions, outperforming existing methods.
Contribution
The paper proposes a novel deep reinforcement learning model with dual-level exploration and attention mechanisms specifically designed for partial, noisy sketches in image retrieval tasks.
Findings
Achieves state-of-the-art performance on three public datasets.
Significantly improves retrieval accuracy with partial and noisy sketches.
Robustness to unnecessary strokes and noise in sketches.
Abstract
Fine-Grained Sketch-Based Image Retrieval (FG-SBIR) aims at finding a specific image from a large gallery given a query sketch. Despite the widespread applicability of FG-SBIR in many critical domains (e.g., crime activity tracking), existing approaches still suffer from a low accuracy while being sensitive to external noises such as unnecessary strokes in the sketch. The retrieval performance will further deteriorate under a more practical on-the-fly setting, where only a partially complete sketch with only a few (noisy) strokes are available to retrieve corresponding images. We propose a novel framework that leverages a uniquely designed deep reinforcement learning model that performs a dual-level exploration to deal with partial sketch training and attention region selection. By enforcing the model's attention on the important regions of the original sketches, it remains robust to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
