Reducing the Deployment-Time Inference Control Costs of Deep   Reinforcement Learning Agents via an Asymmetric Architecture

Chin-Jui Chang; Yu-Wei Chu; Chao-Hsien Ting; Hao-Kang Liu; Zhang-Wei; Hong; Chun-Yi Lee

arXiv:2105.14471·cs.AI·June 1, 2021

Reducing the Deployment-Time Inference Control Costs of Deep Reinforcement Learning Agents via an Asymmetric Architecture

Chin-Jui Chang, Yu-Wei Chu, Chao-Hsien Ting, Hao-Kang Liu, Zhang-Wei, Hong, Chun-Yi Lee

PDF

Open Access

TL;DR

This paper introduces an asymmetric neural network architecture for deep reinforcement learning that switches between a high-cost and low-cost policy to reduce inference energy consumption on mobile robots without sacrificing performance.

Contribution

The paper proposes a novel asymmetric architecture that dynamically switches between policies to lower inference costs in DRL, suitable for energy-constrained platforms.

Findings

01

Significant reduction in inference costs demonstrated on robotic control benchmarks.

02

Maintains comparable performance to traditional DRL methods.

03

Applicable to energy-limited robotic systems.

Abstract

Deep reinforcement learning (DRL) has been demonstrated to provide promising results in several challenging decision making and control tasks. However, the required inference costs of deep neural networks (DNNs) could prevent DRL from being applied to mobile robots which cannot afford high energy-consuming computations. To enable DRL methods to be affordable in such energy-limited platforms, we propose an asymmetric architecture that reduces the overall inference costs via switching between a computationally expensive policy and an economic one. The experimental results evaluated on a number of representative benchmark suites for robotic control tasks demonstrate that our method is able to reduce the inference costs while retaining the agent's overall performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Advanced Memory and Neural Computing