Deep Cross-modality Adaptation via Semantics Preserving Adversarial   Learning for Sketch-based 3D Shape Retrieval

Jiaxin Chen; Yi Fang

arXiv:1807.01806·cs.CV·July 6, 2018·1 cites

Deep Cross-modality Adaptation via Semantics Preserving Adversarial Learning for Sketch-based 3D Shape Retrieval

Jiaxin Chen, Yi Fang

PDF

Open Access

TL;DR

This paper introduces a deep learning framework that uses adversarial training to align features from 2D sketches and 3D shapes, significantly improving cross-modality retrieval accuracy.

Contribution

It proposes a novel cross-modality adaptation model with a transformation network trained via adversarial learning to bridge the gap between sketches and 3D shapes.

Findings

01

Outperforms state-of-the-art methods on SHREC datasets

02

Effectively reduces modality discrepancy in feature space

03

Enhances sketch-based 3D shape retrieval accuracy

Abstract

Due to the large cross-modality discrepancy between 2D sketches and 3D shapes, retrieving 3D shapes by sketches is a significantly challenging task. To address this problem, we propose a novel framework to learn a discriminative deep cross-modality adaptation model in this paper. Specifically, we first separately adopt two metric networks, following two deep convolutional neural networks (CNNs), to learn modality-specific discriminative features based on an importance-aware metric learning method. Subsequently, we explicitly introduce a cross-modality transformation network to compensate for the divergence between two modalities, which can transfer features of 2D sketches to the feature space of 3D shapes. We develop an adversarial learning based method to train the transformation model, by simultaneously enhancing the holistic correlations between data distributions of two modalities,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Advanced Vision and Imaging · Human Pose and Action Recognition