Deep Cross-modality Adaptation via Semantics Preserving Adversarial Learning for Sketch-based 3D Shape Retrieval
Jiaxin Chen, Yi Fang

TL;DR
This paper introduces a deep learning framework that uses adversarial training to align features from 2D sketches and 3D shapes, significantly improving cross-modality retrieval accuracy.
Contribution
It proposes a novel cross-modality adaptation model with a transformation network trained via adversarial learning to bridge the gap between sketches and 3D shapes.
Findings
Outperforms state-of-the-art methods on SHREC datasets
Effectively reduces modality discrepancy in feature space
Enhances sketch-based 3D shape retrieval accuracy
Abstract
Due to the large cross-modality discrepancy between 2D sketches and 3D shapes, retrieving 3D shapes by sketches is a significantly challenging task. To address this problem, we propose a novel framework to learn a discriminative deep cross-modality adaptation model in this paper. Specifically, we first separately adopt two metric networks, following two deep convolutional neural networks (CNNs), to learn modality-specific discriminative features based on an importance-aware metric learning method. Subsequently, we explicitly introduce a cross-modality transformation network to compensate for the divergence between two modalities, which can transfer features of 2D sketches to the feature space of 3D shapes. We develop an adversarial learning based method to train the transformation model, by simultaneously enhancing the holistic correlations between data distributions of two modalities,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Advanced Vision and Imaging · Human Pose and Action Recognition
