Lumina: Real-Time Mobile Neural Rendering by Exploiting Computational Redundancy
Yu Feng, Weikai Lin, Yuge Cheng, Zihan Liu, Jingwen Leng, Minyi Guo, Chen Chen, Shixuan Sun, Yuhao Zhu

TL;DR
Lumina is a hardware-algorithm co-designed system that significantly accelerates neural rendering on mobile devices by exploiting temporal coherence and caching, achieving over four times speedup with minimal quality loss.
Contribution
The paper introduces Lumina, combining novel algorithms S^2 and RC with a specialized accelerator, LuminCore, to enhance mobile neural rendering efficiency.
Findings
4.5x speedup over mobile GPU
5.3x energy reduction
Less than 0.2 dB PSNR loss
Abstract
3D Gaussian Splatting (3DGS) has vastly advanced the pace of neural rendering, but it remains computationally demanding on today's mobile SoCs. To address this challenge, we propose Lumina, a hardware-algorithm co-designed system, which integrates two principal optimizations: a novel algorithm, S^2, and a radiance caching mechanism, RC, to improve the efficiency of neural rendering. S2 algorithm exploits temporal coherence in rendering to reduce the computational overhead, while RC leverages the color integration process of 3DGS to decrease the frequency of intensive rasterization computations. Coupled with these techniques, we propose an accelerator architecture, LuminCore, to further accelerate cache lookup and address the fundamental inefficiencies in Rasterization. We show that Lumina achieves 4.5x speedup and 5.3x energy reduction against a mobile Volta GPU, with a marginal quality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputer Graphics and Visualization Techniques · Advanced Neural Network Applications · Image Enhancement Techniques
