Continual Vision-based Reinforcement Learning with Group Symmetries
Shiqi Liu, Mengdi Xu, Piede Huang, Yongkang Liu, Kentaro Oguchi, Ding, Zhao

TL;DR
This paper introduces COVERS, a continual vision-based reinforcement learning method that leverages group symmetries to improve sample efficiency and generalization by recognizing task invariances under transformations.
Contribution
COVERS is the first method to incorporate group symmetries into continual RL with visual inputs, enabling task grouping and improved generalization.
Findings
COVERS accurately groups tasks based on invariances.
It outperforms existing methods in generalization on manipulation tasks.
Demonstrates effectiveness on both simulation and real robots.
Abstract
Continual reinforcement learning aims to sequentially learn a variety of tasks, retaining the ability to perform previously encountered tasks while simultaneously developing new policies for novel tasks. However, current continual RL approaches overlook the fact that certain tasks are identical under basic group operations like rotations or translations, especially with visual inputs. They may unnecessarily learn and maintain a new policy for each similar task, leading to poor sample efficiency and weak generalization capability. To address this, we introduce a unique Continual Vision-based Reinforcement Learning method that recognizes Group Symmetries, called COVERS, cultivating a policy for each group of equivalent tasks rather than individual tasks. COVERS employs a proximal policy optimization-based RL algorithm with an equivariant feature extractor and a novel task grouping…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Muscle activation and electromyography studies · Robot Manipulation and Learning
MethodsTest
