Configurable Holography: Towards Display and Scene Adaptation
Yicheng Zhan, Liang Shi, Wojciech Matusik, Qi Sun, Kaan Ak\c{s}it

TL;DR
This paper presents a highly configurable learned holography model that can synthesize 3D holograms interactively across diverse display and scene parameters, improving flexibility and speed over existing methods.
Contribution
Introduces a novel model structure enabling continuous conditioning on multiple display-scene parameters for real-time, high-quality hologram synthesis.
Findings
Achieves 2x faster hologram synthesis compared to state-of-the-art methods.
Uncovers a correlation between depth estimation and hologram synthesis tasks.
Validates models on simulations and two holographic display prototypes.
Abstract
Emerging learned holography approaches have enabled faster and high-quality hologram synthesis, setting a new milestone toward practical holographic displays. However, these learned models require training a dedicated model for each set of display-scene parameters. To address this shortcoming, our work introduces a highly configurable learned model structure, synthesizing 3D holograms interactively while supporting diverse display-scene parameters. Our family of models relying on this structure can be conditioned continuously for varying novel scene parameters, including input images, propagation distances, volume depths, peak brightnesses, and novel display parameters of pixel pitches and wavelengths. Uniquely, our findings unearth a correlation between depth estimation and hologram synthesis tasks in the learning domain, leading to a learned model that unlocks accurate 3D hologram…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optical Imaging Technologies · Augmented Reality Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Knowledge Distillation
