Scale-Gest: Scalable Model-Space Synthesis and Runtime Selection for On-Device Gesture Detection
Abdul Basit, Saim Rehman, Muhammad Shafique

TL;DR
Scale-Gest is an adaptive gesture detection framework that dynamically selects from a family of tiny-YOLO models to optimize performance, energy, and latency on mobile devices in real-time scenarios.
Contribution
It introduces a run-time adaptive system with device-calibrated ACE profiles and a motion-aware ROI gate for efficient on-device gesture detection under constraints.
Findings
Reduces energy per frame by 4x on a laptop
Maintains high gesture detection F1 score of 0.8-0.9
Achieves low latency of 6 ms per frame
Abstract
Realizing on-device ML-based gesture detection under tight real-time performance, energy and memory constraints is challenging, especially when considering mobile devices with varying battery-power levels. Existing EdgeAI deployments typically rely on a single fixed detector, limiting optimization opportunities. We present Scale-Gest, a novel run-time adaptive gesture detection framework that expands the detector space into a dense family of tiny-YOLO architectures. We introduce multiple novel device-calibrated ACE (Accuracy-Complexity-Energy) profiles by analyzing different model-resolution-stride operating points. A lightweight run-time controller selects an appropriate ACE mode under user-defined and battery constraints, while a motion-aware hand-gesture-tracking ROI gate crops the input for reduced complexity detection. To evaluate performance of our system in real-world car driving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
