EdgeRL: Reinforcement Learning-driven Deep Learning Model Inference   Optimization at Edge

Motahare Mounesan; Xiaojie Zhang; Saptarshi Debroy

arXiv:2410.12221·cs.DC·October 17, 2024

EdgeRL: Reinforcement Learning-driven Deep Learning Model Inference Optimization at Edge

Motahare Mounesan, Xiaojie Zhang, Saptarshi Debroy

PDF

Open Access

TL;DR

EdgeRL is a reinforcement learning framework that optimizes deep learning inference at the edge by balancing latency, accuracy, and energy consumption, using real-world tests to demonstrate its effectiveness.

Contribution

It introduces a novel RL-based approach, EdgeRL, for dynamic optimization of DNN inference parameters tailored to application-specific performance trade-offs.

Findings

01

Significant energy savings on end devices.

02

Improved inference accuracy.

03

Reduced end-to-end latency.

Abstract

Balancing mutually diverging performance metrics, such as, processing latency, outcome accuracy, and end device energy consumption is a challenging undertaking for deep learning model inference in ad-hoc edge environments. In this paper, we propose EdgeRL framework that seeks to strike such balance by using an Advantage Actor-Critic (A2C) Reinforcement Learning (RL) approach that can choose optimal run-time DNN inference parameters and aligns the performance metrics based on the application requirements. Using real world deep learning model and a hardware testbed, we evaluate the benefits of EdgeRL framework in terms of end device energy savings, inference accuracy improvement, and end-to-end inference latency reduction.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Stream Mining Techniques