MINR: Implicit Neural Representations with Masked Image Modelling

Sua Lee; Joonhun Lee; Myungjoo Kang

arXiv:2507.22404·cs.CV·July 31, 2025

MINR: Implicit Neural Representations with Masked Image Modelling

Sua Lee, Joonhun Lee, Myungjoo Kang

PDF

TL;DR

MINR introduces a novel framework combining implicit neural representations with masked image modeling, resulting in more robust, generalizable, and efficient image reconstruction for self-supervised learning, outperforming existing methods especially on out-of-distribution data.

Contribution

The paper proposes MINR, a new approach that integrates implicit neural representations with masked image modeling to enhance robustness and generalization in self-supervised image learning.

Findings

01

MINR outperforms MAE in in-domain and out-of-distribution tasks.

02

MINR reduces model complexity compared to existing methods.

03

MINR demonstrates versatility across various self-supervised applications.

Abstract

Self-supervised learning methods like masked autoencoders (MAE) have shown significant promise in learning robust feature representations, particularly in image reconstruction-based pretraining task. However, their performance is often strongly dependent on the masking strategies used during training and can degrade when applied to out-of-distribution data. To address these limitations, we introduce the masked implicit neural representations (MINR) framework that synergizes implicit neural representations with masked image modeling. MINR learns a continuous function to represent images, enabling more robust and generalizable reconstructions irrespective of masking strategies. Our experiments demonstrate that MINR not only outperforms MAE in in-domain scenarios but also in out-of-distribution settings, while reducing model complexity. The versatility of MINR extends to various…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.