M\"obius Transform for Mitigating Perspective Distortions in   Representation Learning

Prakash Chandra Chhipa; Meenakshi Subhash Chippa; Kanjar De; Rajkumar; Saini; Marcus Liwicki; Mubarak Shah

arXiv:2405.02296·cs.CV·July 16, 2024

M\"obius Transform for Mitigating Perspective Distortions in Representation Learning

Prakash Chandra Chhipa, Meenakshi Subhash Chippa, Kanjar De, Rajkumar, Saini, Marcus Liwicki, Mubarak Shah

PDF

Open Access 1 Models 1 Datasets

TL;DR

This paper introduces a novel M"obius transform-based method to mitigate perspective distortions in images, improving robustness of vision models without needing camera parameters or distorted training data.

Contribution

It proposes a new approach using a specific family of M"obius transforms for distortion correction, along with a benchmark dataset, enhancing model robustness against perspective distortions.

Findings

01

Outperforms existing benchmarks like ImageNet-E and ImageNet-X

02

Improves performance on real-world PD-affected applications

03

Enhances object detection under perspective distortions

Abstract

Perspective distortion (PD) causes unprecedented changes in shape, size, orientation, angles, and other spatial relationships of visual concepts in images. Precisely estimating camera intrinsic and extrinsic parameters is a challenging task that prevents synthesizing perspective distortion. Non-availability of dedicated training data poses a critical barrier to developing robust computer vision methods. Additionally, distortion correction methods make other computer vision tasks a multi-step approach and lack performance. In this work, we propose mitigating perspective distortion (MPD) by employing a fine-grained parameter control on a specific family of M\"obius transform to model real-world distortion without estimating camera intrinsic and extrinsic parameters and without the need for actual distorted data. Also, we present a dedicated perspectively distorted benchmark dataset,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
prakashchhipa/MPD_SSL
model· ♡ 1
♡ 1

Datasets

prakashchhipa/ImageNet-PD
dataset· 52 dl
52 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Vision and Imaging · Retinal Imaging and Analysis