Covariance-Guided Resource Adaptive Learning for Efficient Edge Inference
Ahmad N. L. Nabhaan, Zaki Sukma, Rakandhiya D. Rachmanto, Muhammad Husni Santriaji, Byungjin Cho, Arief Setyanto, In Kee Kim

TL;DR
CORAL is an online optimization method that efficiently finds near-optimal hardware configurations for edge inference, balancing power and throughput without offline profiling, by leveraging statistical dependency measures.
Contribution
It introduces CORAL, a novel online approach using distance covariance for co-optimizing power and throughput in edge inference, avoiding exhaustive offline profiling.
Findings
Achieves 96-100% of optimal performance compared to exhaustive search.
Effectively finds configurations within power constraints where baselines fail.
Operates efficiently across diverse models and hardware setups.
Abstract
For deep learning inference on edge devices, hardware configurations achieving the same throughput can differ by 2 in power consumption, yet operators often struggle to find the efficient ones without exhaustive profiling. Existing approaches often rely on inefficient static presets or require expensive offline profiling that must be repeated for each new model or device. To address this problem, we present CORAL, an online optimization method that discovers near-optimal configurations without offline profiling. CORAL leverages distance covariance to statistically capture the non-linear dependencies between hardware settings, e.g., DVFS and concurrency levels, and performance metrics. Unlike prior work, we explicitly formulate the challenge as a throughput-power co-optimization problem to satisfy power budgets and throughput targets simultaneously. We evaluate CORAL on two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
