A Practical Stereo Depth System for Smart Glasses
Jialiang Wang, Daniel Scharstein, Akash Bapat, Kevin Blackburn-Matzen,, Matthew Yu, Jonathan Lehman, Suhib Alsisan, Yanghan Wang, Sam Tsai,, Jan-Michael Frahm, Zijian He, Peter Vajda, Michael F. Cohen, Matt Uyttendaele

TL;DR
This paper introduces a practical, on-device stereo depth sensing system for smart glasses that combines pre-processing, rectification, and depth estimation with fallback mechanisms, enabling real-time 3D effects on mobile hardware.
Contribution
It presents a comprehensive, production-ready stereo depth system optimized for mobile devices, with robustness to calibration changes and input variability, suitable for smart glasses applications.
Findings
Runs in under 1 second on a six-year-old smartphone CPU
Generalizes well to unseen data and real-world images
Achieves good accuracy on Middlebury and in-the-wild images
Abstract
We present the design of a productionized end-to-end stereo depth sensing system that does pre-processing, online stereo rectification, and stereo depth estimation with a fallback to monocular depth estimation when rectification is unreliable. The output of our depth sensing system is then used in a novel view generation pipeline to create 3D computational photography effects using point-of-view images captured by smart glasses. All these steps are executed on-device on the stringent compute budget of a mobile phone, and because we expect the users can use a wide range of smartphones, our design needs to be general and cannot be dependent on a particular hardware or ML accelerator such as a smartphone GPU. Although each of these steps is well studied, a description of a practical system is still lacking. For such a system, all these steps need to work in tandem with one another and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Image Processing Techniques and Applications
