LumaFlux: Lifting 8-Bit Worlds to HDR Reality with Physically-Guided Diffusion Transformers
Shreshth Saini, Hakan Gedik, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

TL;DR
LumaFlux is a novel diffusion transformer model that converts SDR content into perceptually and physically accurate HDR by integrating physical cues, perceptual modulation, and a new HDR dataset and benchmark.
Contribution
It introduces a physically-guided diffusion transformer with novel modules for SDR-to-HDR conversion and provides the first large-scale SDR-HDR training corpus and evaluation benchmark.
Findings
LumaFlux outperforms state-of-the-art methods in luminance reconstruction.
It achieves superior perceptual color fidelity with minimal additional parameters.
The model effectively stabilizes chroma and texture during HDR reconstruction.
Abstract
The rapid adoption of HDR-capable devices has created a pressing need to convert the 8-bit Standard Dynamic Range (SDR) content into perceptually and physically accurate 10-bit High Dynamic Range (HDR). Existing inverse tone-mapping (ITM) methods often rely on fixed tone-mapping operators that struggle to generalize to real-world degradations, stylistic variations, and camera pipelines, frequently producing clipped highlights, desaturated colors, or unstable tone reproduction. We introduce LumaFlux, a first physically and perceptually guided diffusion transformer (DiT) for SDR-to-HDR reconstruction by adapting a large pretrained DiT. Our LumaFlux introduces (1) a Physically-Guided Adaptation (PGA) module that injects luminance, spatial descriptors, and frequency cues into attention through low-rank residuals; (2) a Perceptual Cross-Modulation (PCM) layer that stabilizes chroma and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
