A3C-S: Automated Agent Accelerator Co-Search towards Efficient Deep   Reinforcement Learning

Yonggan Fu; Yongan Zhang; Chaojian Li; Zhongzhi Yu; Yingyan Celine Lin

arXiv:2106.06577·cs.LG·January 7, 2025·1 cites

A3C-S: Automated Agent Accelerator Co-Search towards Efficient Deep Reinforcement Learning

Yonggan Fu, Yongan Zhang, Chaojian Li, Zhongzhi Yu, Yingyan Celine Lin

PDF

Open Access

TL;DR

This paper introduces A3C-S, an automated framework that co-searches for optimal deep reinforcement learning agents and hardware accelerators, significantly improving performance and efficiency for resource-constrained devices.

Contribution

A3C-S is the first framework to automatically co-search DRL agents and accelerators, optimizing both test scores and hardware efficiency simultaneously.

Findings

01

A3C-S outperforms existing methods in test scores.

02

A3C-S achieves higher hardware efficiency.

03

Experimental results validate the framework's superiority.

Abstract

Driven by the explosive interest in applying deep reinforcement learning (DRL) agents to numerous real-time control and decision-making applications, there has been a growing demand to deploy DRL agents to empower daily-life intelligent devices, while the prohibitive complexity of DRL stands at odds with limited on-device resources. In this work, we propose an Automated Agent Accelerator Co-Search (A3C-S) framework, which to our best knowledge is the first to automatically co-search the optimally matched DRL agents and accelerators that maximize both test scores and hardware efficiency. Extensive experiments consistently validate the superiority of our A3C-S over state-of-the-art techniques.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Modular Robots and Swarm Intelligence