Content-Aware Unsupervised Deep Homography Estimation
Jirong Zhang, Chuan Wang, Shuaicheng Liu, Lanpeng Jia, Nianjin Ye, Jue, Wang, Ji Zhou, Jian Sun

TL;DR
This paper introduces an unsupervised deep learning approach for homography estimation that effectively handles depth disparities and moving objects by learning reliable regions and using a novel triplet loss, outperforming existing methods.
Contribution
The paper proposes a new architecture for unsupervised deep homography estimation that incorporates outlier masking and a specialized triplet loss, addressing real-world challenges.
Findings
Outperforms state-of-the-art methods in diverse scenes
Effectively handles depth disparities and moving objects
Uses learned deep features for loss calculation
Abstract
Homography estimation is a basic image alignment method in many applications. It is usually conducted by extracting and matching sparse feature points, which are error-prone in low-light and low-texture images. On the other hand, previous deep homography approaches use either synthetic images for supervised learning or aerial images for unsupervised learning, both ignoring the importance of handling depth disparities and moving objects in real world applications. To overcome these problems, in this work we propose an unsupervised deep homography method with a new architecture design. In the spirit of the RANSAC procedure in traditional methods, we specifically learn an outlier mask to only select reliable regions for homography estimation. We calculate loss with respect to our learned deep features instead of directly comparing image content as did previously. To achieve the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Vision and Imaging · Robotics and Sensor-Based Localization
MethodsTriplet Loss
