BDA-SketRet: Bi-Level Domain Adaptation for Zero-Shot SBIR
Ushasi Chaudhuri, Ruchika Chavan, Biplab Banerjee, Anjan Dutta, Zeynep, Akata

TL;DR
This paper introduces BDA-SketRet, a bi-level domain adaptation framework for zero-shot sketch-based image retrieval that aligns spatial and semantic features to improve retrieval accuracy across multiple datasets.
Contribution
It proposes a novel bi-level domain adaptation approach with a symmetric information bottleneck loss and a topology-preserving semantic projection network for ZS-SBIR.
Findings
Significant performance improvements on Sketchy, TU-Berlin, and QuickDraw datasets.
Effective alignment of spatial and semantic features reduces intra-class variance.
Enhanced discriminativeness of the shared latent space.
Abstract
The efficacy of zero-shot sketch-based image retrieval (ZS-SBIR) models is governed by two challenges. The immense distributions-gap between the sketches and the images requires a proper domain alignment. Moreover, the fine-grained nature of the task and the high intra-class variance of many categories necessitates a class-wise discriminative mapping among the sketch, image, and the semantic spaces. Under this premise, we propose BDA-SketRet, a novel ZS-SBIR framework performing a bi-level domain adaptation for aligning the spatial and semantic features of the visual data pairs progressively. In order to highlight the shared features and reduce the effects of any sketch or image-specific artifacts, we propose a novel symmetric loss function based on the notion of information bottleneck for aligning the semantic features while a cross-entropy-based adversarial loss is introduced to align…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
