Honey, I Shrunk The Actor: A Case Study on Preserving Performance with   Smaller Actors in Actor-Critic RL

Siddharth Mysore; Bassel Mabsout; Renato Mancuso; Kate Saenko

arXiv:2102.11893·cs.LG·June 22, 2021·1 cites

Honey, I Shrunk The Actor: A Case Study on Preserving Performance with Smaller Actors in Actor-Critic RL

Siddharth Mysore, Bassel Mabsout, Renato Mancuso, Kate Saenko

PDF

Open Access

TL;DR

This paper investigates how independently reducing the size of actor networks in actor-critic reinforcement learning can significantly cut computational costs while maintaining performance, especially useful in resource-limited settings.

Contribution

It demonstrates that smaller actor networks can match larger ones' performance, challenging the common assumption of symmetric architectures in actor-critic models.

Findings

01

Up to 99% reduction in network weights for actors.

02

Average 77% weight reduction across multiple algorithms.

03

Smaller actors maintain comparable policy performance.

Abstract

Actors and critics in actor-critic reinforcement learning algorithms are functionally separate, yet they often use the same network architectures. This case study explores the performance impact of network sizes when considering actor and critic architectures independently. By relaxing the assumption of architectural symmetry, it is often possible for smaller actors to achieve comparable policy performance to their symmetric counterparts. Our experiments show up to 99% reduction in the number of network weights with an average reduction of 77% over multiple actor-critic algorithms on 9 independent tasks. Given that reducing actor complexity results in a direct reduction of run-time inference cost, we believe configurations of actors and critics are aspects of actor-critic design that deserve to be considered independently, particularly in resource-constrained applications or when…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvancements in Semiconductor Devices and Circuit Design · Insect symbiosis and bacterial influences · Neural Networks and Reservoir Computing