Aligning Motion-Blurred Images Using Contrastive Learning on   Overcomplete Pixels

Leonid Pogorelyuk; Stefan T. Radev

arXiv:2410.07410·cs.CV·November 4, 2024

Aligning Motion-Blurred Images Using Contrastive Learning on Overcomplete Pixels

Leonid Pogorelyuk, Stefan T. Radev

PDF

Open Access 1 Repo

TL;DR

This paper introduces a contrastive learning approach to generate overcomplete pixel features that are invariant to motion blur, enabling effective alignment of video frames captured with moving cameras under challenging conditions.

Contribution

It presents a novel contrastive objective for overcomplete pixel features and demonstrates their effectiveness in aligning motion-blurred video frames.

Findings

01

U-Net trained with the proposed objective aligns frames in challenging videos

02

Overcomplete pixels encode object identity and pixel coordinates

03

Features are invariant to motion blur and other transformations

Abstract

We propose a new contrastive objective for learning overcomplete pixel-level features that are invariant to motion blur. Other invariances (e.g., pose, illumination, or weather) can be learned by applying the corresponding transformations on unlabeled images during self-supervised training. We showcase that a simple U-Net trained with our objective can produce local features useful for aligning the frames of an unseen video captured with a moving camera under realistic and challenging conditions. Using a carefully designed toy example, we also show that the overcomplete pixels can encode the identity of objects in an image and the pixel coordinates relative to these objects.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

leonidprinceton/oxels
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Advanced Vision and Imaging · Face recognition and analysis

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Convolution · Max Pooling · U-Net