GIFT: Learning Transformation-Invariant Dense Visual Descriptors via   Group CNNs

Yuan Liu; Zehong Shen; Zhixuan Lin; Sida Peng; Hujun Bao; Xiaowei Zhou

arXiv:1911.05932·cs.CV·November 15, 2019·25 cites

GIFT: Learning Transformation-Invariant Dense Visual Descriptors via Group CNNs

Yuan Liu, Zehong Shen, Zhixuan Lin, Sida Peng, Hujun Bao, Xiaowei Zhou

PDF

Open Access 1 Repo

TL;DR

GIFT introduces a novel group convolution-based descriptor that achieves both high discriminability and invariance to geometric transformations, improving local image correspondence tasks.

Contribution

It proposes a new descriptor using group convolutions to maintain discriminability while ensuring invariance to transformations, unlike traditional pooling methods.

Findings

01

GIFT outperforms existing descriptors on benchmark datasets.

02

It significantly improves relative pose estimation accuracy.

03

The method is theoretically provably invariant to transformation groups.

Abstract

Finding local correspondences between images with different viewpoints requires local descriptors that are robust against geometric transformations. An approach for transformation invariance is to integrate out the transformations by pooling the features extracted from transformed versions of an image. However, the feature pooling may sacrifice the distinctiveness of the resulting descriptors. In this paper, we introduce a novel visual descriptor named Group Invariant Feature Transform (GIFT), which is both discriminative and robust to geometric transformations. The key idea is that the features extracted from the transformed versions of an image can be viewed as a function defined on the group of the transformations. Instead of feature pooling, we use group convolutions to exploit underlying structures of the extracted features on the group, resulting in descriptors that are both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zju3dv/GIFT
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization · Human Pose and Action Recognition