DeepLight: Robust & Unobtrusive Real-time Screen-Camera Communication for Real-World Displays
Vu Tran, Gihan Jayatilaka, Ashwin Ashok, Archan Misra

TL;DR
DeepLight presents a machine learning-based system for robust, human-imperceptible real-time screen-camera communication that works reliably in real-world conditions, overcoming limitations of previous methods.
Contribution
The paper introduces DeepLight, a novel DNN-based decoder and encoding scheme that enhances robustness and imperceptibility in screen-camera communication under diverse conditions.
Findings
Achieves high decoding accuracy with frame error rate < 0.2
Supports data rates of at least 0.95 Kbps
Maintains IoU >= 83% for screen extraction
Abstract
The paper introduces a novel, holistic approach for robust Screen-Camera Communication (SCC), where video content on a screen is visually encoded in a human-imperceptible fashion and decoded by a camera capturing images of such screen content. We first show that state-of-the-art SCC techniques have two key limitations for in-the-wild deployment: (a) the decoding accuracy drops rapidly under even modest screen extraction errors from the captured images, and (b) they generate perceptible flickers on common refresh rate screens even with minimal modulation of pixel intensity. To overcome these challenges, we introduce DeepLight, a system that incorporates machine learning (ML) models in the decoding pipeline to achieve humanly-imperceptible, moderately high SCC rates under diverse real-world conditions. Deep-Light's key innovation is the design of a Deep Neural Network (DNN) based decoder…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
