PARN: Pyramidal Affine Regression Networks for Dense Semantic   Correspondence

Sangryul Jeon; Seungryong Kim; Dongbo Min; Kwanghoon Sohn

arXiv:1807.02939·cs.CV·August 2, 2018·1 cites

PARN: Pyramidal Affine Regression Networks for Dense Semantic Correspondence

Sangryul Jeon, Seungryong Kim, Dongbo Min, Kwanghoon Sohn

PDF

Open Access

TL;DR

This paper introduces PARN, a deep pyramidal network that estimates dense, locally-varying affine transformations for semantic correspondence, using a coarse-to-fine approach and weak supervision to handle intra-class variations.

Contribution

PARN is the first deep network to estimate dense affine fields in a coarse-to-fine manner with end-to-end training and weak supervision, improving semantic correspondence accuracy.

Findings

01

Outperforms state-of-the-art on multiple benchmarks.

02

Effectively handles intra-class shape and appearance variations.

03

End-to-end learnable with no need for affine quantization.

Abstract

This paper presents a deep architecture for dense semantic correspondence, called pyramidal affine regression networks (PARN), that estimates locally-varying affine transformation fields across images. To deal with intra-class appearance and shape variations that commonly exist among different instances within the same object category, we leverage a pyramidal model where affine transformation fields are progressively estimated in a coarse-to-fine manner so that the smoothness constraint is naturally imposed within deep networks. PARN estimates residual affine transformations at each level and composes them to estimate final affine transformations. Furthermore, to overcome the limitations of insufficient training data for semantic correspondence, we propose a novel weakly-supervised training scheme that generates progressive supervisions by leveraging a correspondence consistency across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Image Segmentation Techniques · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning