M\"obius Transform for Mitigating Perspective Distortions in Representation Learning
Prakash Chandra Chhipa, Meenakshi Subhash Chippa, Kanjar De, Rajkumar, Saini, Marcus Liwicki, Mubarak Shah

TL;DR
This paper introduces a novel M"obius transform-based method to mitigate perspective distortions in images, improving robustness of vision models without needing camera parameters or distorted training data.
Contribution
It proposes a new approach using a specific family of M"obius transforms for distortion correction, along with a benchmark dataset, enhancing model robustness against perspective distortions.
Findings
Outperforms existing benchmarks like ImageNet-E and ImageNet-X
Improves performance on real-world PD-affected applications
Enhances object detection under perspective distortions
Abstract
Perspective distortion (PD) causes unprecedented changes in shape, size, orientation, angles, and other spatial relationships of visual concepts in images. Precisely estimating camera intrinsic and extrinsic parameters is a challenging task that prevents synthesizing perspective distortion. Non-availability of dedicated training data poses a critical barrier to developing robust computer vision methods. Additionally, distortion correction methods make other computer vision tasks a multi-step approach and lack performance. In this work, we propose mitigating perspective distortion (MPD) by employing a fine-grained parameter control on a specific family of M\"obius transform to model real-world distortion without estimating camera intrinsic and extrinsic parameters and without the need for actual distorted data. Also, we present a dedicated perspectively distorted benchmark dataset,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Vision and Imaging · Retinal Imaging and Analysis
