LatentHDR: Decoupling Exposure from Diffusion via Conditional Latent-to-Latent Mapping for Text/Image-to-Panoramic HDR
Pedram Fekri, WenChen Li, William Chen, Peter Altamirano

TL;DR
LatentHDR introduces a novel latent space approach for efficient, structurally consistent HDR image synthesis from text or images, outperforming diffusion-based methods in quality and computational cost.
Contribution
It decouples scene and exposure modeling in latent space using a pretrained diffusion backbone and a lightweight mapping head, enabling scalable HDR generation.
Findings
Achieves state-of-the-art dynamic range on benchmarks.
Reduces computational cost by an order of magnitude.
Produces structurally consistent exposure stacks in a single pass.
Abstract
High Dynamic Range (HDR) generation remains challenging for generative models, which are largely limited to low dynamic range outputs. Recent diffusionbased approaches approximate HDR by generating multiple exposure-conditioned samples, incurring high computational cost and structural inconsistencies across exposures. We propose LatentHDR, a framework that decouples scene generation from exposure modeling in latent space. A pretrained diffusion backbone produces a single coherent scene representation, while a lightweight conditional latent to-latent head deterministically maps it to exposure-specific representations. This enables the generation of a dense, structurally consistent exposure stack in a single pass. This design eliminates multi-pass diffusion, ensures cross-exposure alignment, and enables scalable HDR synthesis. LatentHDR supports both textand image-conditioned HDR…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
