Semi-Heterogeneous Three-Way Joint Embedding Network for Sketch-Based Image Retrieval
Jianjun Lei, Yuxin Song, Bo Peng, Zhanyu Ma, Ling Shao, Yi-Zhe Song

TL;DR
This paper introduces Semi3-Net, a novel three-branch network that effectively aligns sketches, natural images, and edgemaps into a shared semantic space, significantly improving sketch-based image retrieval performance.
Contribution
The paper proposes a semi-heterogeneous three-way joint embedding network with a co-attention mechanism and hybrid loss for better cross-domain feature alignment in SBIR.
Findings
Outperforms state-of-the-art on Sketchy and TU-Berlin datasets
Effectively aligns sketches, images, and edgemaps in a shared semantic space
Demonstrates superior retrieval accuracy compared to existing methods
Abstract
Sketch-based image retrieval (SBIR) is a challenging task due to the large cross-domain gap between sketches and natural images. How to align abstract sketches and natural images into a common high-level semantic space remains a key problem in SBIR. In this paper, we propose a novel semi-heterogeneous three-way joint embedding network (Semi3-Net), which integrates three branches (a sketch branch, a natural image branch, and an edgemap branch) to learn more discriminative cross-domain feature representations for the SBIR task. The key insight lies with how we cultivate the mutual and subtle relationships amongst the sketches, natural images, and edgemaps. A semi-heterogeneous feature mapping is designed to extract bottom features from each domain, where the sketch and edgemap branches are shared while the natural image branch is heterogeneous to the other branches. In addition, a joint…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
