Scale-Gest: Scalable Model-Space Synthesis and Runtime Selection for On-Device Gesture Detection

Abdul Basit; Saim Rehman; Muhammad Shafique

arXiv:2605.12506·cs.CV·May 14, 2026

Scale-Gest: Scalable Model-Space Synthesis and Runtime Selection for On-Device Gesture Detection

Abdul Basit, Saim Rehman, Muhammad Shafique

PDF

TL;DR

Scale-Gest is an adaptive gesture detection framework that dynamically selects from a family of tiny-YOLO models to optimize performance, energy, and latency on mobile devices in real-time scenarios.

Contribution

It introduces a run-time adaptive system with device-calibrated ACE profiles and a motion-aware ROI gate for efficient on-device gesture detection under constraints.

Findings

01

Reduces energy per frame by 4x on a laptop

02

Maintains high gesture detection F1 score of 0.8-0.9

03

Achieves low latency of 6 ms per frame

Abstract

Realizing on-device ML-based gesture detection under tight real-time performance, energy and memory constraints is challenging, especially when considering mobile devices with varying battery-power levels. Existing EdgeAI deployments typically rely on a single fixed detector, limiting optimization opportunities. We present Scale-Gest, a novel run-time adaptive gesture detection framework that expands the detector space into a dense family of tiny-YOLO architectures. We introduce multiple novel device-calibrated ACE (Accuracy-Complexity-Energy) profiles by analyzing different model-resolution-stride operating points. A lightweight run-time controller selects an appropriate ACE mode under user-defined and battery constraints, while a motion-aware hand-gesture-tracking ROI gate crops the input for reduced complexity detection. To evaluate performance of our system in real-world car driving…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.