PARN: Pyramidal Affine Regression Networks for Dense Semantic Correspondence
Sangryul Jeon, Seungryong Kim, Dongbo Min, Kwanghoon Sohn

TL;DR
This paper introduces PARN, a deep pyramidal network that estimates dense, locally-varying affine transformations for semantic correspondence, using a coarse-to-fine approach and weak supervision to handle intra-class variations.
Contribution
PARN is the first deep network to estimate dense affine fields in a coarse-to-fine manner with end-to-end training and weak supervision, improving semantic correspondence accuracy.
Findings
Outperforms state-of-the-art on multiple benchmarks.
Effectively handles intra-class shape and appearance variations.
End-to-end learnable with no need for affine quantization.
Abstract
This paper presents a deep architecture for dense semantic correspondence, called pyramidal affine regression networks (PARN), that estimates locally-varying affine transformation fields across images. To deal with intra-class appearance and shape variations that commonly exist among different instances within the same object category, we leverage a pyramidal model where affine transformation fields are progressively estimated in a coarse-to-fine manner so that the smoothness constraint is naturally imposed within deep networks. PARN estimates residual affine transformations at each level and composes them to estimate final affine transformations. Furthermore, to overcome the limitations of insufficient training data for semantic correspondence, we propose a novel weakly-supervised training scheme that generates progressive supervisions by leveraging a correspondence consistency across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
