# Semantically Tied Paired Cycle Consistency for Zero-Shot Sketch-based   Image Retrieval

**Authors:** Anjan Dutta, Zeynep Akata

arXiv: 1903.03372 · 2019-03-11

## TL;DR

This paper introduces SEM-PCYC, a novel generative model for zero-shot sketch-based image retrieval that aligns visual and semantic information without requiring paired data, significantly improving performance.

## Contribution

The paper proposes a semantically aligned cycle-consistent generative model that uses adversarial training and feature selection auto-encoder for zero-shot SBIR, avoiding paired data dependency.

## Key findings

- Significant performance boost over state-of-the-art on Sketchy and TU-Berlin datasets.
- Effective semantic alignment without aligned sketch-image pairs.
- Utilizes combined textual and hierarchical side information.

## Abstract

Zero-shot sketch-based image retrieval (SBIR) is an emerging task in computer vision, allowing to retrieve natural images relevant to sketch queries that might not been seen in the training phase. Existing works either require aligned sketch-image pairs or inefficient memory fusion layer for mapping the visual information to a semantic space. In this work, we propose a semantically aligned paired cycle-consistent generative (SEM-PCYC) model for zero-shot SBIR, where each branch maps the visual information to a common semantic space via an adversarial training. Each of these branches maintains a cycle consistency that only requires supervision at category levels, and avoids the need of highly-priced aligned sketch-image pairs. A classification criteria on the generators' outputs ensures the visual to semantic space mapping to be discriminating. Furthermore, we propose to combine textual and hierarchical side information via a feature selection auto-encoder that selects discriminating side information within a same end-to-end model. Our results demonstrate a significant boost in zero-shot SBIR performance over the state-of-the-art on the challenging Sketchy and TU-Berlin datasets.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.03372/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1903.03372/full.md

## References

68 references — full list in the complete paper: https://tomesphere.com/paper/1903.03372/full.md

---
Source: https://tomesphere.com/paper/1903.03372