Unlocking Masked Autoencoders as Loss Function for Image and Video Restoration
Man Zhou, Naishan Zheng, Jie Huang, Chunle Guo, Chongyi Li

TL;DR
This paper proposes using masked Autoencoders as a learned loss function to improve image and video restoration tasks, demonstrating consistent performance gains across multiple applications.
Contribution
It introduces a novel approach of employing MAE as a learned loss function, enhancing neural network training for image and video restoration.
Findings
Consistent performance improvements across multiple restoration tasks.
Learned loss can be directly integrated into existing networks without extra inference costs.
Effective for both image and video restoration, across different architectures.
Abstract
Image and video restoration has achieved a remarkable leap with the advent of deep learning. The success of deep learning paradigm lies in three key components: data, model, and loss. Currently, many efforts have been devoted to the first two while seldom study focuses on loss function. With the question ``are the de facto optimization functions e.g., , , and perceptual losses optimal?'', we explore the potential of loss and raise our belief ``learned loss function empowers the learning capability of neural networks for image and video restoration''. Concretely, we stand on the shoulders of the masked Autoencoders (MAE) and formulate it as a `learned loss function', owing to the fact the pre-trained MAE innately inherits the prior of image reasoning. We investigate the efficacy of our belief from three perspectives: 1) from task-customized MAE to native MAE, 2) from image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Image Processing Techniques and Applications
MethodsMasked autoencoder · Convolution
