Self-supervised Spatial Reasoning on Multi-View Line Drawings

Siyuan Xiang; Anbang Yang; Yanfei Xue; Yaoqing Yang; Chen Feng

arXiv:2104.13433·cs.CV·May 17, 2022

Self-supervised Spatial Reasoning on Multi-View Line Drawings

Siyuan Xiang, Anbang Yang, Yanfei Xue, Yaoqing Yang, Chen Feng

PDF

Open Access 1 Repo

TL;DR

This paper introduces two self-supervised learning methods to enhance spatial reasoning on multi-view line drawings, significantly improving performance on the SPARE3D dataset where supervised methods underperform.

Contribution

The paper proposes novel self-supervised approaches for view consistency and camera pose reasoning, addressing limitations of supervised deep networks on 3D line drawing data.

Findings

01

Significant performance improvement on SPARE3D dataset

02

Effective learning of view-invariant line drawing representations

03

Enhanced generalization to unseen camera poses

Abstract

Spatial reasoning on multi-view line drawings by state-of-the-art supervised deep networks is recently shown with puzzling low performances on the SPARE3D dataset. Based on the fact that self-supervised learning is helpful when a large number of data are available, we propose two self-supervised learning approaches to improve the baseline performance for view consistency reasoning and camera pose reasoning tasks on the SPARE3D dataset. For the first task, we use a self-supervised binary classification network to contrast the line drawing differences between various views of any two similar 3D objects, enabling the trained networks to effectively learn detail-sensitive yet view-invariant line drawing representations of 3D objects. For the second type of task, we propose a self-supervised multi-class classification framework to train a model to select the correct corresponding view from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ai4ce/Contrastive-SPARE3D
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Advanced Vision and Imaging

MethodsContrastive Learning