TL;DR
This paper presents a simple method for HDR video generation by leveraging logarithmic encoding to align HDR imagery with pretrained generative models' priors, enabling high-quality results with minimal fine-tuning.
Contribution
It introduces a novel approach using logarithmic encoding and lightweight fine-tuning to generate HDR videos without retraining models or designing new architectures.
Findings
Achieves high-quality HDR video generation with minimal adaptation.
Logarithmic encoding aligns HDR data with pretrained model priors effectively.
Demonstrates robustness across diverse scenes and lighting conditions.
Abstract
High dynamic range (HDR) imagery offers a rich and faithful representation of scene radiance, but remains challenging for generative models due to its mismatch with the bounded, perceptually compressed data on which these models are trained. A natural solution is to learn new representations for HDR, which introduces additional complexity and data requirements. In this work, we show that HDR generation can be achieved in a much simpler way by leveraging the strong visual priors already captured by pretrained generative models. We observe that a logarithmic encoding widely used in cinematic pipelines maps HDR imagery into a distribution that is naturally aligned with the latent space of these models, enabling direct adaptation via lightweight fine-tuning without retraining an encoder. To recover details that are not directly observable in the input, we further introduce a training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
