TL;DR
This paper introduces a multi-objective optimization framework for training super-resolution models that balances perceptual quality and distortion, automating hyperparameter tuning and improving results over existing methods.
Contribution
It proposes MOBOSR, a novel Bayesian optimization-based method that automates loss weight tuning for better perception-distortion trade-offs in super-resolution.
Findings
MOBOSR outperforms state-of-the-art methods in perceptual quality and distortion.
The approach effectively automates hyperparameter tuning, reducing computational costs.
It advances the perception-distortion Pareto frontier in super-resolution tasks.
Abstract
Training Single-Image Super-Resolution (SISR) models using pixel-based regression losses can achieve high distortion metrics scores (e.g., PSNR and SSIM), but often results in blurry images due to insufficient recovery of high-frequency details. Conversely, using GAN or perceptual losses can produce sharp images with high perceptual metric scores (e.g., LPIPS), but may introduce artifacts and incorrect textures. Balancing these two types of losses can help achieve a trade-off between distortion and perception, but the challenge lies in tuning the loss function weights. To address this issue, we propose a novel method that incorporates Multi-Objective Optimization (MOO) into the training process of SISR models to balance perceptual quality and distortion. We conceptualize the relationship between loss weights and image quality assessment (IQA) metrics as black-box objective functions to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
